Transcriptr | AI Audio Transcription

Changelog

Fri, 31 Oct 2025

v3.0.0

New

Transcription Studio: Brand new interactive workspace for reviewing transcriptions with professional tools
Audio Player Integration: Full-featured audio player with play/pause, seek, skip (±10s), and volume controls
Interactive Transcript: Click any segment to jump to that exact moment in the audio
Real-time Segment Highlighting: Current segment automatically highlights during playback
OpenAI Whisper Model: Upgraded to OpenAI's official Whisper model for better accuracy and timestamp precision
Timestamp-Based Segments: Each transcription now includes precise start/end times for every segment
Enhanced Export Formats: Download transcriptions in TXT, SRT, VTT, and DOCX with proper timestamps
SRT/VTT Generation: Professional subtitle file generation with accurate timestamps from API
Transcript Search: Search functionality with result highlighting in segments
Responsive Studio Layout: Two-column design optimized for both desktop and mobile

Improved

Better Timestamp Accuracy: OpenAI Whisper provides more precise segment boundaries
Streamlined UI: Removed speaker diarization option to simplify the interface
Segment Data Storage: All transcriptions now store segment arrays with timestamps
Modal-Based Studio: Studio opens in a dedicated modal for focused work without losing context
Click-to-Seek: Instant audio navigation by clicking transcript segments
Session Persistence: Segments are saved in session history for later playback

Fixed

Audio Playback: Fixed audio player to properly connect with transcript
Time Synchronization: Resolved timing issues between audio and transcript highlighting
Export Quality: Improved SRT/VTT file format generation with proper timestamps

Tue, 22 Jul 2025

v2.1.0

New

Universal Audio Format Support: Added automatic audio format conversion powered by CloudConvert API
Seamless Format Conversion: M4A, AAC, MP4, WMA, AIFF, and CAF files are now automatically converted to MP3 before transcription
iPhone Recording Support: Native support for M4A files from iPhone Voice Memos and other iOS recording apps
Windows Media Audio Support: Added WMA file support for Windows users
macOS Audio Support: Full compatibility with AIFF and CAF audio formats from macOS applications
CloudConvert API Integration: New /api/convert/cloud endpoint with comprehensive job management and error handling
Intelligent Format Detection: Advanced file format detection utility with 15+ supported audio formats
Real-Time Conversion Transparency: Added detailed API response logging visible in UI details panel during conversion
Conversion Progress Tracking: Live status updates with step-by-step conversion process visibility
Secure Temporary File Handling: Automatic cleanup of conversion files with privacy-focused temporary storage
Enhanced File History: Converted files now properly appear as 'file uploads' in transcription history with original filenames
Smart Conversion Routing: Automatic detection and routing based on file format without user intervention
CloudConvert Health Monitoring: Built-in API health checks and service availability validation
Comprehensive Format Documentation: Updated documentation with detailed format support matrix and conversion explanations
Multi-Format Download System: Complete download dropdown with support for TXT, Markdown, DOCX, and PDF formats
Professional Document Generation: Real DOCX files using the docx library with proper formatting, titles, and paragraph structure
PDF Export with Text Wrapping: PDF generation using jsPDF with automatic text wrapping and professional layout
Enhanced Markdown Export: Structured Markdown files with proper headings and formatting
Arabic Language Detection: Automatic detection of Arabic text using comprehensive Unicode character ranges
Arabic Format Beta Warnings: Clear beta ribbons on DOCX and PDF downloads for Arabic content with explanatory notices
Format Compatibility Notices: User-friendly warnings explaining format limitations for Arabic text with recommendations

Improved

Enhanced User Experience: Conversion process is completely transparent with immediate 'Converting...' UI feedback
Processing Time Optimization: Direct formats (MP3, WAV, FLAC, OGG) process immediately, converted formats add only 30-60 seconds
Automatic Workflow Integration: Conversion seamlessly flows into existing transcription pipeline without user interaction
Mobile Conversion Support: All conversion features fully functional on mobile devices with touch-optimized interfaces
Visual Format Indicators: Color-coded format categories (green for direct support, blue for auto-conversion) in documentation
Enhanced Progress Visualization: Updated TranscriptionProcessing component with conversion-specific status messages
Improved Error Handling: Better error messages and fallback options for conversion failures
Updated Help Documentation: Completely rewritten audio format support section with visual guides and process explanations
File Format Classification: Clear distinction between 'directly supported' and 'auto-converted' formats for user expectations
Enhanced File Validation: Improved file type detection with both extension and MIME type checking
README Documentation: Updated project README with comprehensive CloudConvert setup instructions and format support matrix
Environment Configuration: Enhanced environment variable documentation for CloudConvert API setup
Unified Transcription Results UI: Complete redesign with mobile-inspired card layout replacing complex tabs system
Streamlined Download Interface: Single dropdown button with organized format options and visual icons
Improved Dropdown Visibility: Solid backgrounds with proper shadows and borders for better readability
Enhanced Download Performance: Optimized file generation with proper memory cleanup and blob URL management
Internationalization Awareness: Smart language detection with format-specific recommendations for non-Latin scripts

Fixed

Session History Persistence: Fixed transcription history only keeping most recent entry - now properly accumulates all transcriptions chronologically
Session ID Management: Modified createSession() to always generate unique session IDs instead of reusing cookie values
SSL Certificate Issues: Resolved development environment SSL certificate errors in Replicate API calls
UI State Management: Fixed conversion status not appearing immediately - UI now shows 'Converting...' state instantly
File History Categorization: Resolved converted files appearing as 'URL audio' - now properly categorized as file uploads with original filenames
Conversion Flow Metadata: Enhanced data flow to preserve original file information throughout conversion process
File Input Validation: Updated file input to accept all supported formats including M4A, AAC, and other convertible types
UnsupportedFormatHelp Component: Updated component to handle conversion failures rather than format rejection

Mon, 21 Jul 2025

v2.0.0

New

Added V2 Announcement modal for first-time visitors with celebratory confetti animation
Implemented localStorage-based persistence to show announcement only once per user
Added debug function seenV2(false) for developers to re-enable the announcement modal
Documentation Page: Created comprehensive documentation at /documentation with getting started guide, features overview, troubleshooting, and FAQ
Mobile-First UI System: Developed a complete mobile-optimized interface with hamburger navigation, touch-friendly components, and responsive layouts
Mobile Navigation: Added slide-out hamburger menu with backdrop overlay and organized navigation sections
Mobile-Specific Components: Created MobileHeader, MobileFooter, MobileButton, MobileInput, MobileDialog, and MobileTranscriptionResult components
Touch-Optimized Interactions: Implemented 44px minimum touch targets, active state feedback, and gesture-friendly animations
Responsive Transcription Results: Added mobile-specific transcription display with task-oriented layout, preview mode, and expandable full text view
Mobile-Optimized Changelog: Created accordion-based mobile changelog with version and category sections for better mobile navigation
Safe Area Support: Enhanced mobile experience with proper safe area handling for modern smartphones
Analytics Opt-Out: You can now opt-out of analytics tracking in the settings.
Improved Session History: Added ranged query support for fetching your transcription history.

Improved

Enhanced user onboarding experience with welcoming V2 announcement
Integrated confetti animation with proper z-index layering for visual celebration
Complete Migration to Next.js: The entire application has been migrated from Vite to Next.js, including the move from Netlify Functions to Next.js API Routes and the adoption of native Next.js routing.
Modernized Styling: Adopted CSS variables for theming and migrated to Tailwind CSS v4 syntax for a more maintainable and modern codebase.
Mobile-First Responsive Design: Extended Tailwind configuration with mobile breakpoints, touch detection utilities, and custom spacing for optimal mobile experience
Conditional Mobile/Desktop Rendering: Implemented smart device detection to serve optimized interfaces without affecting existing desktop functionality
Enhanced Touch Feedback: Added scale animations, haptic-style feedback, and improved button states for better mobile interaction
Mobile Layout Optimization: Redesigned headers, footers, and form layouts specifically for mobile devices while preserving desktop experience
Documentation Organization: Migrated documentation from component-based routing to proper Next.js App Router structure
Component Architecture: Cleaned up legacy Vite routing code and consolidated documentation into /app/documentation/page.tsx
Data Architecture: Centralized changelog data in dedicated /data/changelog.ts module for consistency across components
TypeScript Improvements: Fixed ButtonProps export issues and enhanced type safety across UI components
Simplified PDF Generation: Removed the external 'printerz' PDF generation service, streamlining the architecture.
Smoother UI: Replaced layout transitions with more fluid, spring-based animations.
Codebase Health: Performed a major refactoring across the entire application, improving type safety, removing unused code, simplifying components, and enforcing a consistent code style with ESLint and Prettier.

Fixed

Resolved a type error for the fileInputRef prop.
Fixed various linting errors across the application and API.
Corrected the environment variable for the Replicate API.
Enabled proper client-side routing within Next.js.
Fixed TypeScript compilation errors in AnimatedButton component by properly importing ButtonProps interface
Resolved module export issues with UI component type definitions
Updated Footer navigation links to point to correct /documentation route instead of /docs
Cleaned up old Vite LazyRoutes.tsx file that was causing build failures
Fixed mobile component integration compilation errors in TranscriptionForm.tsx
Resolved Framer Motion animation type issues in mobile components
Fixed conditional rendering logic for mobile vs desktop component display

Thu, 26 Jun 2025

v1.4.7

Improved

Refactored generatePdfLocally function to reduce cognitive complexity.

Fixed

Addressed cognitive complexity issue in PDF generation logic.

Thu, 26 Jun 2025

v1.4.6

New

Added basic content to the Documentation page.
Added a "Back to Home" button on the Documentation page.

Thu, 26 Jun 2025

v1.4.4

Improved

Removed FFmpeg dependency and all related audio conversion logic.

Fixed

Fixed build errors caused by FFmpeg dependency issues.

Wed, 23 Apr 2025

v1.4.3

Improved

Enhanced transcription reliability with automatic batch size reduction
Added smart retry logic for handling GPU memory limitations

Fixed

Fixed "CUDA out of memory" errors by automatically reducing batch size and retrying
Resolved transcription failures on larger audio files with dynamic resource allocation

Tue, 23 Apr 2025

v1.4.2

New

Added improved multilingual document generation for better international language support
Implemented true PDF generation with proper multilingual text support

Improved

Enhanced document export to properly handle Arabic, Hebrew, and other non-Latin scripts
Optimized PDF generation with automatic RTL text direction detection
Added automatic fallback to HTML format when PDF generation fails
Implemented proper page layout with headers, footers, and multi-page support

Fixed

Fixed issue with Arabic and other non-Latin text displaying as gibberish in exported documents
Fixed document generation issues when Printerz API is unavailable
Fixed inconsistent file extensions in document downloads

Mon, 21 Apr 2025

v1.4.1

New

Added dialog component for improved user interactions
Implemented session persistence for transcriptions
Added transcription history feature

Improved

Refactored code to reduce cognitive complexity in transcription processing
Enhanced exception handling in Firebase proxy service
Optimized Replicate client by removing unused variables
Enhanced transcription progress tracking with cleaner percentage ranges
Redesigned layout components for better user experience

Fixed

Fixed ignored exceptions in Firebase proxy service
Removed useless variable assignment in Replicate client
Reduced complexity in transcribe function for better maintainability
Resolved typo where meta tag had placeholder information
Fixed decimal values in progress percentages for a cleaner UI experience

Sun, 13 Apr 2025

v1.4.0

New

Implemented audio upload and transcription functionality with URL support
Added SEO and social meta tags
Integrated branding assets (favicon, social image)
Introduced LoadingFallback component for lazy-loaded routes.
Created cleanup service for managing temporary files in Firebase.

Improved

Enhanced polling mechanism to fix timing issues
Refactored polling logic into custom hook useTranscriptionPolling
Replaced MainApp component with MainLayout for better organization.
Refactored UploadAudio component to utilize new FileUploadInput and UrlInput components.
Added cleanup functionality for temporary files in Firebase after transcription.
Implemented lazy loading for transcription-related components in MainLayout.
Enhanced URL validation logic in UploadAudio component.
Updated Firebase upload utility to handle base64 data uploads.
Improved error handling and logging in Replicate API interactions.

Fixed

Resolved issue where polling stopped prematurely due to state update timing

Sat, 12 Apr 2025

v1.3.0

New

Added OGG support
Optimized analytics loading
Enhanced chunking strategy for improved performance

Improved

Performance optimizations across the application

Fri, 11 Apr 2025

v1.0.0

New

Added changelog and feedback components
Added Google site verification HTML file
Implemented dynamic imports for performance optimization
Added Google Analytics integration
Added TranscriptionProcessing and TranscriptionResult components
Added PDF and DOCX generation capabilities

Improved

Enhanced header and footer for changelog access

Wed, 9 Apr 2025

v0.5.0

New

Integrated cookie consent management with analytics tracking
Added ad blocker detection
Added Terms of Service and Privacy Policy components

Improved

Analytics initialization to use new Clarity library

Tue, 8 Apr 2025

v0.4.0

New

Added feedback form and modal for user feedback collection
Added Firebase configuration files
Added HTML template and instructions for template generation
Added .hintrc configuration file
Enhanced PDF generation with Firebase upload

Improved

Updated feedback modals to use specific IDs
Enhanced feedback form handling with device info detection
Updated README with environment variables section

Mon, 7 Apr 2025

v0.3.0

New

Added PDF generation support with pdfMake
Integrated Printerz API for PDF generation
Added customizable title input for PDFs

Improved

Refactored code structure for improved readability

Fixed

Refactored PDF generation to remove unused code

Mon, 31 Mar 2025

v0.2.0

New

Added Firebase integration for file storage
Implemented FFmpeg download script
Added audio conversion functionality
Added CloudConvert function for audio conversion
Implemented cookie consent management

Improved

Enhanced dark mode support with improved text colors
Updated audio format support and user guidance
Refactored download-ffmpeg script to use ESM imports

Fixed

Firebase file upload handling and cleanup process
Implemented alternative FFmpeg download script
Updated Netlify configuration
Added debug environment function and enhanced file conversion error handling

Tue, 18 Mar 2025

v0.1.1

New

Added Netlify deployment support with API functions

Improved

Updated .gitignore to include .env and .netlify folders

Fixed

Disabled dark mode by changing Tailwind configuration
Updated Netlify build command and TypeScript configuration

Mon, 17 Mar 2025

v0.1.0

New

Implemented custom file input hook
Added audio upload component with improved error handling
Enhanced Tailwind CSS configuration with backdrop filter and typography plugin
Added Prettier configuration
Updated README with project overview and setup instructions

Improved

Audio transcription handling with improved error management
Audio upload component layout
TypeScript settings for module interoperability

Sun, 16 Mar 2025

v0.0.1

New

Initial project setup with Vite, React, and TypeScript
Added environment configuration, PostCSS, and ESLint