Fri, 31 Oct 2025
v3.0.0
Transcription Studio: Brand new interactive workspace for reviewing transcriptions with professional tools
Audio Player Integration: Full-featured audio player with play/pause, seek, skip (±10s), and volume controls
Interactive Transcript: Click any segment to jump to that exact moment in the audio
Real-time Segment Highlighting: Current segment automatically highlights during playback
OpenAI Whisper Model: Upgraded to OpenAI's official Whisper model for better accuracy and timestamp precision
Timestamp-Based Segments: Each transcription now includes precise start/end times for every segment
Enhanced Export Formats: Download transcriptions in TXT, SRT, VTT, and DOCX with proper timestamps
SRT/VTT Generation: Professional subtitle file generation with accurate timestamps from API
Transcript Search: Search functionality with result highlighting in segments
Responsive Studio Layout: Two-column design optimized for both desktop and mobile
Better Timestamp Accuracy: OpenAI Whisper provides more precise segment boundaries
Streamlined UI: Removed speaker diarization option to simplify the interface
Segment Data Storage: All transcriptions now store segment arrays with timestamps
Modal-Based Studio: Studio opens in a dedicated modal for focused work without losing context
Click-to-Seek: Instant audio navigation by clicking transcript segments
Session Persistence: Segments are saved in session history for later playback
Audio Playback: Fixed audio player to properly connect with transcript
Time Synchronization: Resolved timing issues between audio and transcript highlighting
Export Quality: Improved SRT/VTT file format generation with proper timestamps
Tue, 22 Jul 2025
v2.1.0
Universal Audio Format Support: Added automatic audio format conversion powered by CloudConvert API
Seamless Format Conversion: M4A, AAC, MP4, WMA, AIFF, and CAF files are now automatically converted to MP3 before transcription
iPhone Recording Support: Native support for M4A files from iPhone Voice Memos and other iOS recording apps
Windows Media Audio Support: Added WMA file support for Windows users
macOS Audio Support: Full compatibility with AIFF and CAF audio formats from macOS applications
CloudConvert API Integration: New /api/convert/cloud endpoint with comprehensive job management and error handling
Intelligent Format Detection: Advanced file format detection utility with 15+ supported audio formats
Real-Time Conversion Transparency: Added detailed API response logging visible in UI details panel during conversion
Conversion Progress Tracking: Live status updates with step-by-step conversion process visibility
Secure Temporary File Handling: Automatic cleanup of conversion files with privacy-focused temporary storage
Enhanced File History: Converted files now properly appear as 'file uploads' in transcription history with original filenames
Smart Conversion Routing: Automatic detection and routing based on file format without user intervention
CloudConvert Health Monitoring: Built-in API health checks and service availability validation
Comprehensive Format Documentation: Updated documentation with detailed format support matrix and conversion explanations
Multi-Format Download System: Complete download dropdown with support for TXT, Markdown, DOCX, and PDF formats
Professional Document Generation: Real DOCX files using the docx library with proper formatting, titles, and paragraph structure
PDF Export with Text Wrapping: PDF generation using jsPDF with automatic text wrapping and professional layout
Enhanced Markdown Export: Structured Markdown files with proper headings and formatting
Arabic Language Detection: Automatic detection of Arabic text using comprehensive Unicode character ranges
Arabic Format Beta Warnings: Clear beta ribbons on DOCX and PDF downloads for Arabic content with explanatory notices
Format Compatibility Notices: User-friendly warnings explaining format limitations for Arabic text with recommendations
Enhanced User Experience: Conversion process is completely transparent with immediate 'Converting...' UI feedback
Processing Time Optimization: Direct formats (MP3, WAV, FLAC, OGG) process immediately, converted formats add only 30-60 seconds
Automatic Workflow Integration: Conversion seamlessly flows into existing transcription pipeline without user interaction
Mobile Conversion Support: All conversion features fully functional on mobile devices with touch-optimized interfaces
Visual Format Indicators: Color-coded format categories (green for direct support, blue for auto-conversion) in documentation
Enhanced Progress Visualization: Updated TranscriptionProcessing component with conversion-specific status messages
Improved Error Handling: Better error messages and fallback options for conversion failures
Updated Help Documentation: Completely rewritten audio format support section with visual guides and process explanations
File Format Classification: Clear distinction between 'directly supported' and 'auto-converted' formats for user expectations
Enhanced File Validation: Improved file type detection with both extension and MIME type checking
README Documentation: Updated project README with comprehensive CloudConvert setup instructions and format support matrix
Environment Configuration: Enhanced environment variable documentation for CloudConvert API setup
Unified Transcription Results UI: Complete redesign with mobile-inspired card layout replacing complex tabs system
Streamlined Download Interface: Single dropdown button with organized format options and visual icons
Improved Dropdown Visibility: Solid backgrounds with proper shadows and borders for better readability
Enhanced Download Performance: Optimized file generation with proper memory cleanup and blob URL management
Internationalization Awareness: Smart language detection with format-specific recommendations for non-Latin scripts
Session History Persistence: Fixed transcription history only keeping most recent entry - now properly accumulates all transcriptions chronologically
Session ID Management: Modified createSession() to always generate unique session IDs instead of reusing cookie values
SSL Certificate Issues: Resolved development environment SSL certificate errors in Replicate API calls
UI State Management: Fixed conversion status not appearing immediately - UI now shows 'Converting...' state instantly
File History Categorization: Resolved converted files appearing as 'URL audio' - now properly categorized as file uploads with original filenames
Conversion Flow Metadata: Enhanced data flow to preserve original file information throughout conversion process
File Input Validation: Updated file input to accept all supported formats including M4A, AAC, and other convertible types
UnsupportedFormatHelp Component: Updated component to handle conversion failures rather than format rejection
Mon, 21 Jul 2025
v2.0.0
Added V2 Announcement modal for first-time visitors with celebratory confetti animation
Implemented localStorage-based persistence to show announcement only once per user
Added debug function seenV2(false) for developers to re-enable the announcement modal
Documentation Page: Created comprehensive documentation at /documentation with getting started guide, features overview, troubleshooting, and FAQ
Mobile-First UI System: Developed a complete mobile-optimized interface with hamburger navigation, touch-friendly components, and responsive layouts
Mobile Navigation: Added slide-out hamburger menu with backdrop overlay and organized navigation sections
Mobile-Specific Components: Created MobileHeader, MobileFooter, MobileButton, MobileInput, MobileDialog, and MobileTranscriptionResult components
Touch-Optimized Interactions: Implemented 44px minimum touch targets, active state feedback, and gesture-friendly animations
Responsive Transcription Results: Added mobile-specific transcription display with task-oriented layout, preview mode, and expandable full text view
Mobile-Optimized Changelog: Created accordion-based mobile changelog with version and category sections for better mobile navigation
Safe Area Support: Enhanced mobile experience with proper safe area handling for modern smartphones
Analytics Opt-Out: You can now opt-out of analytics tracking in the settings.
Improved Session History: Added ranged query support for fetching your transcription history.
Enhanced user onboarding experience with welcoming V2 announcement
Integrated confetti animation with proper z-index layering for visual celebration
Complete Migration to Next.js: The entire application has been migrated from Vite to Next.js, including the move from Netlify Functions to Next.js API Routes and the adoption of native Next.js routing.
Modernized Styling: Adopted CSS variables for theming and migrated to Tailwind CSS v4 syntax for a more maintainable and modern codebase.
Mobile-First Responsive Design: Extended Tailwind configuration with mobile breakpoints, touch detection utilities, and custom spacing for optimal mobile experience
Conditional Mobile/Desktop Rendering: Implemented smart device detection to serve optimized interfaces without affecting existing desktop functionality
Enhanced Touch Feedback: Added scale animations, haptic-style feedback, and improved button states for better mobile interaction
Mobile Layout Optimization: Redesigned headers, footers, and form layouts specifically for mobile devices while preserving desktop experience
Documentation Organization: Migrated documentation from component-based routing to proper Next.js App Router structure
Component Architecture: Cleaned up legacy Vite routing code and consolidated documentation into /app/documentation/page.tsx
Data Architecture: Centralized changelog data in dedicated /data/changelog.ts module for consistency across components
TypeScript Improvements: Fixed ButtonProps export issues and enhanced type safety across UI components
Simplified PDF Generation: Removed the external 'printerz' PDF generation service, streamlining the architecture.
Smoother UI: Replaced layout transitions with more fluid, spring-based animations.
Codebase Health: Performed a major refactoring across the entire application, improving type safety, removing unused code, simplifying components, and enforcing a consistent code style with ESLint and Prettier.
Resolved a type error for the fileInputRef prop.
Fixed various linting errors across the application and API.
Corrected the environment variable for the Replicate API.
Enabled proper client-side routing within Next.js.
Fixed TypeScript compilation errors in AnimatedButton component by properly importing ButtonProps interface
Resolved module export issues with UI component type definitions
Updated Footer navigation links to point to correct /documentation route instead of /docs
Cleaned up old Vite LazyRoutes.tsx file that was causing build failures
Fixed mobile component integration compilation errors in TranscriptionForm.tsx
Resolved Framer Motion animation type issues in mobile components
Fixed conditional rendering logic for mobile vs desktop component display
Thu, 26 Jun 2025
v1.4.7
Refactored generatePdfLocally function to reduce cognitive complexity.
Addressed cognitive complexity issue in PDF generation logic.
Thu, 26 Jun 2025
v1.4.6
Added basic content to the Documentation page.
Added a "Back to Home" button on the Documentation page.
Thu, 26 Jun 2025
v1.4.4
Removed FFmpeg dependency and all related audio conversion logic.
Fixed build errors caused by FFmpeg dependency issues.
Wed, 23 Apr 2025
v1.4.3
Enhanced transcription reliability with automatic batch size reduction
Added smart retry logic for handling GPU memory limitations
Fixed "CUDA out of memory" errors by automatically reducing batch size and retrying
Resolved transcription failures on larger audio files with dynamic resource allocation
Tue, 23 Apr 2025
v1.4.2
Added improved multilingual document generation for better international language support
Implemented true PDF generation with proper multilingual text support
Enhanced document export to properly handle Arabic, Hebrew, and other non-Latin scripts
Optimized PDF generation with automatic RTL text direction detection
Added automatic fallback to HTML format when PDF generation fails
Implemented proper page layout with headers, footers, and multi-page support
Fixed issue with Arabic and other non-Latin text displaying as gibberish in exported documents
Fixed document generation issues when Printerz API is unavailable
Fixed inconsistent file extensions in document downloads
Mon, 21 Apr 2025
v1.4.1
Added dialog component for improved user interactions
Implemented session persistence for transcriptions
Added transcription history feature
Refactored code to reduce cognitive complexity in transcription processing
Enhanced exception handling in Firebase proxy service
Optimized Replicate client by removing unused variables
Enhanced transcription progress tracking with cleaner percentage ranges
Redesigned layout components for better user experience
Fixed ignored exceptions in Firebase proxy service
Removed useless variable assignment in Replicate client
Reduced complexity in transcribe function for better maintainability
Resolved typo where meta tag had placeholder information
Fixed decimal values in progress percentages for a cleaner UI experience
Sun, 13 Apr 2025
v1.4.0
Implemented audio upload and transcription functionality with URL support
Added SEO and social meta tags
Integrated branding assets (favicon, social image)
Introduced LoadingFallback component for lazy-loaded routes.
Created cleanup service for managing temporary files in Firebase.
Enhanced polling mechanism to fix timing issues
Refactored polling logic into custom hook useTranscriptionPolling
Replaced MainApp component with MainLayout for better organization.
Refactored UploadAudio component to utilize new FileUploadInput and UrlInput components.
Added cleanup functionality for temporary files in Firebase after transcription.
Implemented lazy loading for transcription-related components in MainLayout.
Enhanced URL validation logic in UploadAudio component.
Updated Firebase upload utility to handle base64 data uploads.
Improved error handling and logging in Replicate API interactions.
Resolved issue where polling stopped prematurely due to state update timing
Sat, 12 Apr 2025
v1.3.0
Added OGG support
Optimized analytics loading
Enhanced chunking strategy for improved performance
Performance optimizations across the application
Fri, 11 Apr 2025
v1.0.0
Added changelog and feedback components
Added Google site verification HTML file
Implemented dynamic imports for performance optimization
Added Google Analytics integration
Added TranscriptionProcessing and TranscriptionResult components
Added PDF and DOCX generation capabilities
Enhanced header and footer for changelog access
Wed, 9 Apr 2025
v0.5.0
Integrated cookie consent management with analytics tracking
Added ad blocker detection
Added Terms of Service and Privacy Policy components
Analytics initialization to use new Clarity library
Tue, 8 Apr 2025
v0.4.0
Added feedback form and modal for user feedback collection
Added Firebase configuration files
Added HTML template and instructions for template generation
Added .hintrc configuration file
Enhanced PDF generation with Firebase upload
Updated feedback modals to use specific IDs
Enhanced feedback form handling with device info detection
Updated README with environment variables section
Mon, 7 Apr 2025
v0.3.0
Added PDF generation support with pdfMake
Integrated Printerz API for PDF generation
Added customizable title input for PDFs
Refactored code structure for improved readability
Refactored PDF generation to remove unused code
Mon, 31 Mar 2025
v0.2.0
Added Firebase integration for file storage
Implemented FFmpeg download script
Added audio conversion functionality
Added CloudConvert function for audio conversion
Implemented cookie consent management
Enhanced dark mode support with improved text colors
Updated audio format support and user guidance
Refactored download-ffmpeg script to use ESM imports
Firebase file upload handling and cleanup process
Implemented alternative FFmpeg download script
Updated Netlify configuration
Added debug environment function and enhanced file conversion error handling
Tue, 18 Mar 2025
v0.1.1
Added Netlify deployment support with API functions
Updated .gitignore to include .env and .netlify folders
Disabled dark mode by changing Tailwind configuration
Updated Netlify build command and TypeScript configuration
Mon, 17 Mar 2025
v0.1.0
Implemented custom file input hook
Added audio upload component with improved error handling
Enhanced Tailwind CSS configuration with backdrop filter and typography plugin
Added Prettier configuration
Updated README with project overview and setup instructions
Audio transcription handling with improved error management
Audio upload component layout
TypeScript settings for module interoperability
Sun, 16 Mar 2025
v0.0.1
Initial project setup with Vite, React, and TypeScript
Added environment configuration, PostCSS, and ESLint