I built a tool, 'Language-Fixer', to automatically fix audio/subtitle metadata (Sonarr/Radarr/Whisper). I'd love your feedback on the idea.

🎬 Language-Fixer

A powerful Docker-based automation tool for managing audio and subtitle language metadata in media libraries. Integrates seamlessly with Sonarr and Radarr to automatically detect, tag, and organize your movie and TV show collections.

🧠 Born from Frustration: After finding countless tools that almost met the needs of a meticulous media monk, this project was born from a simple idea: create a tool that actually does what you want it to do. No compromises, no “close enough” solutions - just intelligent automation that works exactly as intended.

✨ Features

🎵 Audio Management

Smart Language Detection: Uses Whisper API for automatic audio language identification
Language Tagging: Automatically sets correct language metadata (eng, jpn, deu,…

🎬 Language-Fixer

🧠 Born from Frustration: After finding countless tools that almost met the needs of a meticulous media monk, this project was born from a simple idea: create a tool that actually does what you want it to do. No compromises, no “close enough” solutions - just intelligent automation that works exactly as intended.

✨ Features

🎵 Audio Management

Smart Language Detection: Uses Whisper API for automatic audio language identification
Language Tagging: Automatically sets correct language metadata (eng, jpn, deu, etc.)
Audio Title Formatting: Standardizes track titles (e.g., “Dolby Digital 2.0 (English)”)
Default Track Management: Intelligently sets default audio tracks based on language preferences
Commentary Detection: Preserves director’s commentary and special audio tracks

📺 Subtitle Management

Language-based Filtering: Keep only desired subtitle languages
Default Track Assignment: Automatically set preferred subtitle language as default
Cleanup: Remove unwanted subtitle tracks to save space

🗂️ Container & Stream Management

MP4 → MKV Conversion: Automatic container conversion for better metadata support
Stream Removal: Remove unwanted audio/subtitle/attachment streams
Font Management: Optional font attachment removal
Efficient Processing: Smart decision between full remux vs metadata-only edits

🔄 Integration & Automation

Sonarr Integration: Automatic TV show library scanning and updates
Radarr Integration: Seamless movie library management
Scheduled Scanning: Configurable intervals for library maintenance
Progress Tracking: SQLite database prevents reprocessing of files
Batch Processing: Efficient handling of large libraries

🛡️ Reliability & Performance

Crash-Safe Database: Batch commits prevent data loss during interruptions
Smart Remux Logic: Only performs full remux when structurally necessary
Metadata-Only Edits: Uses mkvpropedit for lightning-fast tag changes (500-1350x faster)
Error Handling: Robust retry logic and failure tracking
Dry Run Mode: Test configuration before making changes

🚀 Quick Start

🔒 Safety First: Language-Fixer defaults to DRY_RUN=true for safety!

First run shows you exactly what would be changed

Review the logs to verify planned changes

Set DRY_RUN=false only after confirming changes are correct

Smart defaults automatically become conservative when DRY_RUN=false

⚙️ Switching to Production Mode:

Run with DRY_RUN=true first (default)

Review the container logs: docker logs language-fixer

Verify the planned changes are correct

Set DRY_RUN=false in your docker-compose.yml or environment

Restart the container: docker compose up -d

🔄 Automatic Updates: Using :latest tag ensures automatic updates!

Language-Fixer checks for new versions at startup

Update with: docker compose pull && docker compose up -d

Always review release notes before updating

Docker Compose (Recommended)

version: '3.8'

services:
language-fixer:
image: luckyone94/language-fixer:latest
container_name: language-fixer
restart: unless-stopped
environment:
# User Configuration
- PUID=1000
- PGID=1000
- TZ=Europe/Berlin

# Database & Logging
- DB_PATH=/config/langfixer.db
- LOG_LEVEL=info

# Schedule & Behavior
- RUN_INTERVAL_SECONDS=43200  # 12 hours
- DRY_RUN=true                # SAFE DEFAULT: Review changes first!
- RUN_CLEANUP=true

# Audio Configuration
- REMOVE_AUDIO=true
- RENAME_AUDIO_TRACKS=true
- KEEP_AUDIO_LANGS=jpn,deu,eng,und
- DEFAULT_AUDIO_LANG=jpn

# Subtitle Configuration
- REMOVE_SUBTITLES=true
- KEEP_SUBTITLE_LANGS=jpn,deu,eng
- DEFAULT_SUBTITLE_LANG=deu

# Optional: Whisper API for unknown language detection
- WHISPER_API_URL=http://your-whisper-server:9000/asr
- WHISPER_TIMEOUT=300

# Optional: Sonarr Integration
- SONARR_URL=http://your-sonarr:8989
- SONARR_API_KEY=your-api-key
- SONARR_PATHS=/media/tv,/media/anime

# Optional: Radarr Integration
- RADARR_URL=http://your-radarr:7878
- RADARR_API_KEY=your-api-key
- RADARR_PATHS=/media/movies

# Advanced Options
- REMOVE_ATTACHMENTS=false
- REMOVE_FONTS=false
- KEEP_COMMENTARY=true
- MAX_FAILURES=3

volumes:
- /path/to/config:/config
- /path/to/movies:/media/movies
- /path/to/tv:/media/tv
- /path/to/anime:/media/anime

Docker Run

docker run -d \
--name language-fixer \
--restart unless-stopped \
-v /path/to/config:/config \
-v /path/to/movies:/media/movies \
-v /path/to/tv:/media/tv \
-e PUID=1000 \
-e PGID=1000 \
-e DRY_RUN=true \
-e KEEP_AUDIO_LANGS=eng,jpn \
-e DEFAULT_AUDIO_LANG=eng \
luckyone94/language-fixer:latest

🤖 Complete Stack with AI Language Detection

For automatic language detection of unknown audio tracks, you can run a local Whisper ASR service alongside Language-Fixer:

version: '3.8'

services:
# AI-Powered Language Detection Service
openai-whisper-asr-webservice:
image: onerahmet/openai-whisper-asr-webservice:latest-gpu
container_name: whisper-asr
restart: unless-stopped
ports:
- '9000:9000'
environment:
- ASR_ENGINE=faster_whisper
- ASR_MODEL=small           # Options: tiny, small, medium, large
- ASR_DEVICE=cuda           # Use 'cpu' if no GPU available
- FASTER_WHISPER_COMPUTE_TYPE=float16
deploy:
resources:
reservations:
devices:
- capabilities: [gpu]
count: 1
driver: nvidia
# For CPU-only systems, remove the deploy section and set ASR_DEVICE=cpu

# Main Language-Fixer Service
language-fixer:
image: luckyone94/language-fixer:latest
container_name: language-fixer
restart: unless-stopped
depends_on:
- openai-whisper-asr-webservice
environment:
# User Configuration
- PUID=1000
- PGID=1000
- TZ=Europe/Berlin

# Core Settings
- DRY_RUN=true              # Start safely!
- KEEP_AUDIO_LANGS=jpn,deu,eng,und
- DEFAULT_AUDIO_LANG=jpn
- KEEP_SUBTITLE_LANGS=jpn,deu,eng
- DEFAULT_SUBTITLE_LANG=deu

# AI Language Detection Integration
- WHISPER_API_URL=http://openai-whisper-asr-webservice:9000/asr
- WHISPER_TIMEOUT=300

volumes:
- /path/to/config:/config
- /path/to/movies:/media/movies
- /path/to/tv:/media/tv

networks:
default:
name: media-stack

📚 Whisper Service Details:

Repository: onerahmet/openai-whisper-asr-webservice
GPU Support: NVIDIA GPU recommended for better performance
CPU Fallback: Remove deploy section and set ASR_DEVICE=cpu for CPU-only systems
Model Options:
tiny - Fastest, least accurate (~1GB VRAM)
small - Balanced performance (~2GB VRAM) [Recommended]
medium - Better accuracy (~5GB VRAM)
large - Best accuracy (~10GB VRAM)
Performance: Processes 30-60 seconds of audio in 2-10 seconds (GPU) vs 30-120 seconds (CPU)

💻 CPU-Only Alternative

If you don’t have an NVIDIA GPU, use the CPU version:

openai-whisper-asr-webservice:
image: onerahmet/openai-whisper-asr-webservice:latest  # Note: no '-gpu' suffix
container_name: whisper-asr
restart: unless-stopped
ports:
- '9000:9000'
environment:
- ASR_ENGINE=faster_whisper
- ASR_MODEL=tiny            # Use 'tiny' for CPU for better performance
- ASR_DEVICE=cpu
# Remove the entire 'deploy' section for CPU-only

🎯 When to Use Whisper Integration

The AI language detection is particularly useful for:

Raw/Untitled Media: Files with und (undefined) language tags
Mixed Collections: Libraries from various sources with inconsistent tagging
International Content: Anime, foreign films, or multilingual media
Batch Processing: Automatically tag hundreds of files without manual review

Without Whisper: Files tagged as und are kept as-is (if und is in KEEP_AUDIO_LANGS) With Whisper: Files tagged as und are analyzed and retagged with detected language

⚙️ Configuration

📋 Startup Display: Language-Fixer shows a detailed configuration summary for 30 seconds at startup, displaying all active settings, defaults used, and safety warnings. This gives you time to review and cancel if needed.

🔧 Core Settings

Variable	Default	Description
`PUID`	568	User ID for file permissions
`PGID`	568	Group ID for file permissions
`TZ`	Europe/Berlin	Timezone for logging
`DB_PATH`	/config/langfixer.db	SQLite database location
`LOG_LEVEL`	info	Logging level (debug, info, warning, error)
`RUN_INTERVAL_SECONDS`	43200	Scan interval in seconds (12h default)
`DRY_RUN`	true	Safe mode - no file changes (NEW DEFAULT!)

🎵 Audio Settings

Variable	Default	Description
`REMOVE_AUDIO`	Smart*	Remove unwanted audio tracks
`RENAME_AUDIO_TRACKS`	true	Standardize audio track titles
`KEEP_AUDIO_LANGS`	jpn,deu,eng,und	Audio languages to preserve
`DEFAULT_AUDIO_LANG`	jpn	Preferred default audio language
`KEEP_COMMENTARY`	true	Keep director’s commentary

*Smart Default: true when DRY_RUN=true, false when DRY_RUN=false (safety!)

📺 Subtitle Settings

Variable	Default	Description
`REMOVE_SUBTITLES`	Smart*	Remove unwanted subtitle tracks
`KEEP_SUBTITLE_LANGS`	jpn,deu,eng	Subtitle languages to preserve
`DEFAULT_SUBTITLE_LANG`	deu	Preferred default subtitle language

*Smart Default: true when DRY_RUN=true, false when DRY_RUN=false (safety!)

🔗 Integration Settings

Variable	Default	Description
`SONARR_URL`	-	Sonarr server URL
`SONARR_API_KEY`	-	Sonarr API key
`SONARR_PATHS`	/media/tv	Paths monitored by Sonarr
`RADARR_URL`	-	Radarr server URL
`RADARR_API_KEY`	-	Radarr API key
`RADARR_PATHS`	/media/movies	Paths monitored by Radarr

🤖 AI Language Detection

Variable	Default	Description
`WHISPER_API_URL`	-	OpenAI Whisper API endpoint (see Complete Stack)
`WHISPER_TIMEOUT`	300	Whisper API timeout (seconds)

🔧 Advanced Options

Variable	Default	Description
`MAX_FAILURES`	3	Skip files after X failures
`BATCH_COMMIT_SIZE`	10	Database commits every X files
`FFMPEG_TIMEOUT`	1800	FFmpeg processing timeout (seconds)
`MKVPROPEDIT_TIMEOUT`	300	mkvpropedit timeout (seconds)
`FFMPEG_SAMPLE_TIMEOUT`	60	Audio sampling timeout (seconds)
`LOG_STATS_ON_COMPLETION`	true	Log detailed statistics after scan

📖 How It Works

1. 🔍 File Discovery

Scans configured paths for .mkv and .mp4 files
Skips already processed files (tracked in SQLite database)
Respects failure limits to avoid infinite retry loops

2. 🧠 Stream Analysis

Uses ffprobe to analyze video/audio/subtitle streams
Identifies current language tags and track properties
Detects commentary tracks and special content

3. 🎯 Language Detection

For untagged audio (und): Uses Whisper API if configured
Samples 3 segments from the file for accurate detection
Applies majority voting for final language determination

4. ⚡ Smart Processing Decision

Metadata-Only Changes: Uses mkvpropedit (seconds)
Language tag corrections
Audio title standardization
Default flag management
Full Remux: Uses ffmpeg (minutes) only when necessary
Stream removal
MP4 → MKV conversion
Structural changes

5. 💾 Progress Tracking

Batch commits every 10 files prevent data loss
Failed files are tracked with retry limits
Statistics collection for reporting

6. 🔄 Integration Updates

Notifies Sonarr/Radarr of processed files
Triggers library rescans for updated content
Maintains sync with media server databases

📊 Performance

Language-Fixer delivers exceptional performance through intelligent processing decisions:

⚡ Smart Processing Engine

Operation Type	Processing Time	Resource Usage	Use Case
Metadata Changes	2-5 seconds	<1% CPU, <1MB I/O	Language tags, audio titles, default flags
Stream Removal	5-15 minutes	Moderate CPU	Remove unwanted audio/subtitle tracks
Container Conversion	10-30 minutes	High CPU	MP4 → MKV, structural changes

🎯 Processing Logic

mkvpropedit: Used for metadata-only changes (99% of operations)
ffmpeg remux: Only when structural changes are required
Automatic Detection: Smart decision based on required modifications
Zero Waste: No temporary files for metadata operations

📈 Typical Performance

Large Library (1000+ files): 2-4 hours for complete processing
10GB Movie File: 2-5 seconds for language/title updates
Memory Usage: <100MB consistent footprint
Disk I/O: Minimal impact on system performance

🔍 Monitoring & Troubleshooting

Key Log Messages

# Safety timer at startup
⏳ Zeige Konfiguration für 30 Sekunden...

# Successful file processing
✅ Erfolgreich verarbeitet: movie.mkv
🚫 Überspringe (bereits verarbeitet): movie.mkv

# Processing method indicators
⚡ Führe mkvpropedit durch...    # Fast metadata edit
⚙️ Führe Remux (ffmpeg) durch... # Full remux required

# Batch commits for data safety
💾 Batch-Commit nach 10 Dateien...

Database Troubleshooting

Check database status and processed files:

docker exec language-fixer python3 debug_database.py

Common Issues

Files being reprocessed every run:

Ensure /config volume is persistent and writable
Verify DRY_RUN=false for actual processing
Check container logs for database errors

Slow processing times:

Review processing method in logs (mkvpropedit vs ffmpeg)
Test configuration with DRY_RUN=true first
Check if Whisper API is responding within timeout

Container startup issues:

Verify user permissions (PUID/PGID)
Ensure media paths are correctly mounted
Check environment variable syntax

🤝 Contributing

We welcome contributions! Please feel free to:

Submit bug reports and feature requests via GitHub Issues
Create pull requests with improvements or bug fixes
Share your configuration examples and use cases
Contribute to documentation and translations

Development Setup

git clone https://github.com/Randomname653/language-fixer.git
cd language-fixer
# Edit language_fixer.py
# Test with DRY_RUN=true first

📄 License

This project is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License - see the LICENSE file for details.

Non-Commercial Use Only: This software may not be used for commercial purposes or sold. All other rights (use, modification, distribution) are granted under the CC BY-NC-SA 4.0 license.

🙏 Acknowledgments

FFmpeg for media processing capabilities
OpenAI Whisper for AI-powered language detection
Sonarr/Radarr for media library management integration
MKVToolNix for efficient metadata editing
The countless “almost-perfect” tools that inspired the creation of something better

⭐ If this tool finally gives you the media organization you’ve been searching for, please consider starring the repository!

🎬 Language-Fixer

✨ Features

🎵 Audio Management

🎬 Language-Fixer

✨ Features

🎵 Audio Management

📺 Subtitle Management

🗂️ Container & Stream Management

🔄 Integration & Automation

🛡️ Reliability & Performance

🚀 Quick Start

Docker Compose (Recommended)

Docker Run

🤖 Complete Stack with AI Language Detection

💻 CPU-Only Alternative

🎯 When to Use Whisper Integration

⚙️ Configuration

🔧 Core Settings

🎵 Audio Settings

📺 Subtitle Settings

🔗 Integration Settings

🤖 AI Language Detection

🔧 Advanced Options

📖 How It Works

1. 🔍 File Discovery

2. 🧠 Stream Analysis

3. 🎯 Language Detection

4. ⚡ Smart Processing Decision

5. 💾 Progress Tracking

6. 🔄 Integration Updates

📊 Performance

⚡ Smart Processing Engine

🎯 Processing Logic

📈 Typical Performance

🔍 Monitoring & Troubleshooting

Key Log Messages

Database Troubleshooting

Common Issues

🤝 Contributing

Development Setup

📄 License

🙏 Acknowledgments

Similar Posts