Changelog

Fri, 31 Oct 2025

v3.0.0

New

  • Transcription Studio: Brand new interactive workspace for reviewing transcriptions with professional tools

  • Audio Player Integration: Full-featured audio player with play/pause, seek, skip (±10s), and volume controls

  • Interactive Transcript: Click any segment to jump to that exact moment in the audio

  • Real-time Segment Highlighting: Current segment automatically highlights during playback

  • OpenAI Whisper Model: Upgraded to OpenAI's official Whisper model for better accuracy and timestamp precision

  • Timestamp-Based Segments: Each transcription now includes precise start/end times for every segment

  • Enhanced Export Formats: Download transcriptions in TXT, SRT, VTT, and DOCX with proper timestamps

  • SRT/VTT Generation: Professional subtitle file generation with accurate timestamps from API

  • Transcript Search: Search functionality with result highlighting in segments

  • Responsive Studio Layout: Two-column design optimized for both desktop and mobile

Improved

  • Better Timestamp Accuracy: OpenAI Whisper provides more precise segment boundaries

  • Streamlined UI: Removed speaker diarization option to simplify the interface

  • Segment Data Storage: All transcriptions now store segment arrays with timestamps

  • Modal-Based Studio: Studio opens in a dedicated modal for focused work without losing context

  • Click-to-Seek: Instant audio navigation by clicking transcript segments

  • Session Persistence: Segments are saved in session history for later playback

Fixed

  • Audio Playback: Fixed audio player to properly connect with transcript

  • Time Synchronization: Resolved timing issues between audio and transcript highlighting

  • Export Quality: Improved SRT/VTT file format generation with proper timestamps

Tue, 22 Jul 2025

v2.1.0

New

  • Universal Audio Format Support: Added automatic audio format conversion powered by CloudConvert API

  • Seamless Format Conversion: M4A, AAC, MP4, WMA, AIFF, and CAF files are now automatically converted to MP3 before transcription

  • iPhone Recording Support: Native support for M4A files from iPhone Voice Memos and other iOS recording apps

  • Windows Media Audio Support: Added WMA file support for Windows users

  • macOS Audio Support: Full compatibility with AIFF and CAF audio formats from macOS applications

  • CloudConvert API Integration: New /api/convert/cloud endpoint with comprehensive job management and error handling

  • Intelligent Format Detection: Advanced file format detection utility with 15+ supported audio formats

  • Real-Time Conversion Transparency: Added detailed API response logging visible in UI details panel during conversion

  • Conversion Progress Tracking: Live status updates with step-by-step conversion process visibility

  • Secure Temporary File Handling: Automatic cleanup of conversion files with privacy-focused temporary storage

  • Enhanced File History: Converted files now properly appear as 'file uploads' in transcription history with original filenames

  • Smart Conversion Routing: Automatic detection and routing based on file format without user intervention

  • CloudConvert Health Monitoring: Built-in API health checks and service availability validation

  • Comprehensive Format Documentation: Updated documentation with detailed format support matrix and conversion explanations

  • Multi-Format Download System: Complete download dropdown with support for TXT, Markdown, DOCX, and PDF formats

  • Professional Document Generation: Real DOCX files using the docx library with proper formatting, titles, and paragraph structure

  • PDF Export with Text Wrapping: PDF generation using jsPDF with automatic text wrapping and professional layout

  • Enhanced Markdown Export: Structured Markdown files with proper headings and formatting

  • Arabic Language Detection: Automatic detection of Arabic text using comprehensive Unicode character ranges

  • Arabic Format Beta Warnings: Clear beta ribbons on DOCX and PDF downloads for Arabic content with explanatory notices

  • Format Compatibility Notices: User-friendly warnings explaining format limitations for Arabic text with recommendations

Improved

  • Enhanced User Experience: Conversion process is completely transparent with immediate 'Converting...' UI feedback

  • Processing Time Optimization: Direct formats (MP3, WAV, FLAC, OGG) process immediately, converted formats add only 30-60 seconds

  • Automatic Workflow Integration: Conversion seamlessly flows into existing transcription pipeline without user interaction

  • Mobile Conversion Support: All conversion features fully functional on mobile devices with touch-optimized interfaces

  • Visual Format Indicators: Color-coded format categories (green for direct support, blue for auto-conversion) in documentation

  • Enhanced Progress Visualization: Updated TranscriptionProcessing component with conversion-specific status messages

  • Improved Error Handling: Better error messages and fallback options for conversion failures

  • Updated Help Documentation: Completely rewritten audio format support section with visual guides and process explanations

  • File Format Classification: Clear distinction between 'directly supported' and 'auto-converted' formats for user expectations

  • Enhanced File Validation: Improved file type detection with both extension and MIME type checking

  • README Documentation: Updated project README with comprehensive CloudConvert setup instructions and format support matrix

  • Environment Configuration: Enhanced environment variable documentation for CloudConvert API setup

  • Unified Transcription Results UI: Complete redesign with mobile-inspired card layout replacing complex tabs system

  • Streamlined Download Interface: Single dropdown button with organized format options and visual icons

  • Improved Dropdown Visibility: Solid backgrounds with proper shadows and borders for better readability

  • Enhanced Download Performance: Optimized file generation with proper memory cleanup and blob URL management

  • Internationalization Awareness: Smart language detection with format-specific recommendations for non-Latin scripts

Fixed

  • Session History Persistence: Fixed transcription history only keeping most recent entry - now properly accumulates all transcriptions chronologically

  • Session ID Management: Modified createSession() to always generate unique session IDs instead of reusing cookie values

  • SSL Certificate Issues: Resolved development environment SSL certificate errors in Replicate API calls

  • UI State Management: Fixed conversion status not appearing immediately - UI now shows 'Converting...' state instantly

  • File History Categorization: Resolved converted files appearing as 'URL audio' - now properly categorized as file uploads with original filenames

  • Conversion Flow Metadata: Enhanced data flow to preserve original file information throughout conversion process

  • File Input Validation: Updated file input to accept all supported formats including M4A, AAC, and other convertible types

  • UnsupportedFormatHelp Component: Updated component to handle conversion failures rather than format rejection

Mon, 21 Jul 2025

v2.0.0

New

  • Added V2 Announcement modal for first-time visitors with celebratory confetti animation

  • Implemented localStorage-based persistence to show announcement only once per user

  • Added debug function seenV2(false) for developers to re-enable the announcement modal

  • Documentation Page: Created comprehensive documentation at /documentation with getting started guide, features overview, troubleshooting, and FAQ

  • Mobile-First UI System: Developed a complete mobile-optimized interface with hamburger navigation, touch-friendly components, and responsive layouts

  • Mobile Navigation: Added slide-out hamburger menu with backdrop overlay and organized navigation sections

  • Mobile-Specific Components: Created MobileHeader, MobileFooter, MobileButton, MobileInput, MobileDialog, and MobileTranscriptionResult components

  • Touch-Optimized Interactions: Implemented 44px minimum touch targets, active state feedback, and gesture-friendly animations

  • Responsive Transcription Results: Added mobile-specific transcription display with task-oriented layout, preview mode, and expandable full text view

  • Mobile-Optimized Changelog: Created accordion-based mobile changelog with version and category sections for better mobile navigation

  • Safe Area Support: Enhanced mobile experience with proper safe area handling for modern smartphones

  • Analytics Opt-Out: You can now opt-out of analytics tracking in the settings.

  • Improved Session History: Added ranged query support for fetching your transcription history.

Improved

  • Enhanced user onboarding experience with welcoming V2 announcement

  • Integrated confetti animation with proper z-index layering for visual celebration

  • Complete Migration to Next.js: The entire application has been migrated from Vite to Next.js, including the move from Netlify Functions to Next.js API Routes and the adoption of native Next.js routing.

  • Modernized Styling: Adopted CSS variables for theming and migrated to Tailwind CSS v4 syntax for a more maintainable and modern codebase.

  • Mobile-First Responsive Design: Extended Tailwind configuration with mobile breakpoints, touch detection utilities, and custom spacing for optimal mobile experience

  • Conditional Mobile/Desktop Rendering: Implemented smart device detection to serve optimized interfaces without affecting existing desktop functionality

  • Enhanced Touch Feedback: Added scale animations, haptic-style feedback, and improved button states for better mobile interaction

  • Mobile Layout Optimization: Redesigned headers, footers, and form layouts specifically for mobile devices while preserving desktop experience

  • Documentation Organization: Migrated documentation from component-based routing to proper Next.js App Router structure

  • Component Architecture: Cleaned up legacy Vite routing code and consolidated documentation into /app/documentation/page.tsx

  • Data Architecture: Centralized changelog data in dedicated /data/changelog.ts module for consistency across components

  • TypeScript Improvements: Fixed ButtonProps export issues and enhanced type safety across UI components

  • Simplified PDF Generation: Removed the external 'printerz' PDF generation service, streamlining the architecture.

  • Smoother UI: Replaced layout transitions with more fluid, spring-based animations.

  • Codebase Health: Performed a major refactoring across the entire application, improving type safety, removing unused code, simplifying components, and enforcing a consistent code style with ESLint and Prettier.

Fixed

  • Resolved a type error for the fileInputRef prop.

  • Fixed various linting errors across the application and API.

  • Corrected the environment variable for the Replicate API.

  • Enabled proper client-side routing within Next.js.

  • Fixed TypeScript compilation errors in AnimatedButton component by properly importing ButtonProps interface

  • Resolved module export issues with UI component type definitions

  • Updated Footer navigation links to point to correct /documentation route instead of /docs

  • Cleaned up old Vite LazyRoutes.tsx file that was causing build failures

  • Fixed mobile component integration compilation errors in TranscriptionForm.tsx

  • Resolved Framer Motion animation type issues in mobile components

  • Fixed conditional rendering logic for mobile vs desktop component display

Thu, 26 Jun 2025

v1.4.7

Improved

  • Refactored generatePdfLocally function to reduce cognitive complexity.

Fixed

  • Addressed cognitive complexity issue in PDF generation logic.

Thu, 26 Jun 2025

v1.4.6

New

  • Added basic content to the Documentation page.

  • Added a "Back to Home" button on the Documentation page.

Thu, 26 Jun 2025

v1.4.4

Improved

  • Removed FFmpeg dependency and all related audio conversion logic.

Fixed

  • Fixed build errors caused by FFmpeg dependency issues.

Wed, 23 Apr 2025

v1.4.3

Improved

  • Enhanced transcription reliability with automatic batch size reduction

  • Added smart retry logic for handling GPU memory limitations

Fixed

  • Fixed "CUDA out of memory" errors by automatically reducing batch size and retrying

  • Resolved transcription failures on larger audio files with dynamic resource allocation

Tue, 23 Apr 2025

v1.4.2

New

  • Added improved multilingual document generation for better international language support

  • Implemented true PDF generation with proper multilingual text support

Improved

  • Enhanced document export to properly handle Arabic, Hebrew, and other non-Latin scripts

  • Optimized PDF generation with automatic RTL text direction detection

  • Added automatic fallback to HTML format when PDF generation fails

  • Implemented proper page layout with headers, footers, and multi-page support

Fixed

  • Fixed issue with Arabic and other non-Latin text displaying as gibberish in exported documents

  • Fixed document generation issues when Printerz API is unavailable

  • Fixed inconsistent file extensions in document downloads

Mon, 21 Apr 2025

v1.4.1

New

  • Added dialog component for improved user interactions

  • Implemented session persistence for transcriptions

  • Added transcription history feature

Improved

  • Refactored code to reduce cognitive complexity in transcription processing

  • Enhanced exception handling in Firebase proxy service

  • Optimized Replicate client by removing unused variables

  • Enhanced transcription progress tracking with cleaner percentage ranges

  • Redesigned layout components for better user experience

Fixed

  • Fixed ignored exceptions in Firebase proxy service

  • Removed useless variable assignment in Replicate client

  • Reduced complexity in transcribe function for better maintainability

  • Resolved typo where meta tag had placeholder information

  • Fixed decimal values in progress percentages for a cleaner UI experience

Sun, 13 Apr 2025

v1.4.0

New

  • Implemented audio upload and transcription functionality with URL support

  • Added SEO and social meta tags

  • Integrated branding assets (favicon, social image)

  • Introduced LoadingFallback component for lazy-loaded routes.

  • Created cleanup service for managing temporary files in Firebase.

Improved

  • Enhanced polling mechanism to fix timing issues

  • Refactored polling logic into custom hook useTranscriptionPolling

  • Replaced MainApp component with MainLayout for better organization.

  • Refactored UploadAudio component to utilize new FileUploadInput and UrlInput components.

  • Added cleanup functionality for temporary files in Firebase after transcription.

  • Implemented lazy loading for transcription-related components in MainLayout.

  • Enhanced URL validation logic in UploadAudio component.

  • Updated Firebase upload utility to handle base64 data uploads.

  • Improved error handling and logging in Replicate API interactions.

Fixed

  • Resolved issue where polling stopped prematurely due to state update timing

Sat, 12 Apr 2025

v1.3.0

New

  • Added OGG support

  • Optimized analytics loading

  • Enhanced chunking strategy for improved performance

Improved

  • Performance optimizations across the application

Fri, 11 Apr 2025

v1.0.0

New

  • Added changelog and feedback components

  • Added Google site verification HTML file

  • Implemented dynamic imports for performance optimization

  • Added Google Analytics integration

  • Added TranscriptionProcessing and TranscriptionResult components

  • Added PDF and DOCX generation capabilities

Improved

  • Enhanced header and footer for changelog access

Wed, 9 Apr 2025

v0.5.0

New

  • Integrated cookie consent management with analytics tracking

  • Added ad blocker detection

  • Added Terms of Service and Privacy Policy components

Improved

  • Analytics initialization to use new Clarity library

Tue, 8 Apr 2025

v0.4.0

New

  • Added feedback form and modal for user feedback collection

  • Added Firebase configuration files

  • Added HTML template and instructions for template generation

  • Added .hintrc configuration file

  • Enhanced PDF generation with Firebase upload

Improved

  • Updated feedback modals to use specific IDs

  • Enhanced feedback form handling with device info detection

  • Updated README with environment variables section

Mon, 7 Apr 2025

v0.3.0

New

  • Added PDF generation support with pdfMake

  • Integrated Printerz API for PDF generation

  • Added customizable title input for PDFs

Improved

  • Refactored code structure for improved readability

Fixed

  • Refactored PDF generation to remove unused code

Mon, 31 Mar 2025

v0.2.0

New

  • Added Firebase integration for file storage

  • Implemented FFmpeg download script

  • Added audio conversion functionality

  • Added CloudConvert function for audio conversion

  • Implemented cookie consent management

Improved

  • Enhanced dark mode support with improved text colors

  • Updated audio format support and user guidance

  • Refactored download-ffmpeg script to use ESM imports

Fixed

  • Firebase file upload handling and cleanup process

  • Implemented alternative FFmpeg download script

  • Updated Netlify configuration

  • Added debug environment function and enhanced file conversion error handling

Tue, 18 Mar 2025

v0.1.1

New

  • Added Netlify deployment support with API functions

Improved

  • Updated .gitignore to include .env and .netlify folders

Fixed

  • Disabled dark mode by changing Tailwind configuration

  • Updated Netlify build command and TypeScript configuration

Mon, 17 Mar 2025

v0.1.0

New

  • Implemented custom file input hook

  • Added audio upload component with improved error handling

  • Enhanced Tailwind CSS configuration with backdrop filter and typography plugin

  • Added Prettier configuration

  • Updated README with project overview and setup instructions

Improved

  • Audio transcription handling with improved error management

  • Audio upload component layout

  • TypeScript settings for module interoperability

Sun, 16 Mar 2025

v0.0.1

New

  • Initial project setup with Vite, React, and TypeScript

  • Added environment configuration, PostCSS, and ESLint