ttsforge Documentation
ttsforge is a command-line tool for converting EPUB files to audiobooks using Kokoro ONNX TTS (Text-to-Speech).
User Guide
Features
EPUB to Audiobook Conversion: Convert EPUB files to M4B, MP3, WAV, FLAC, or OPUS formats
50+ High-Quality Voices: Support for 9 languages with multiple voice options
SSMD Editing: Edit intermediate SSMD files to fine-tune pronunciation and pacing
Resumable Conversions: Long audiobook conversions can be interrupted and resumed
Phoneme Pre-tokenization: Pre-process text to phonemes for faster batch conversions
Configurable Filename Templates: Customize output filenames with book metadata
Voice Blending: Mix multiple voices for custom narration styles
GPU Acceleration: Optional GPU support for faster processing
Chapter Selection: Convert specific chapters or chapter ranges
Metadata Support: Automatic language detection and metadata embedding
Streaming Read: Real-time playback with the
readcommand (optional audio extra)
Quick Example
# Install ttsforge
pip install ttsforge
# Convert an EPUB to audiobook (M4B format with chapters)
ttsforge convert book.epub
# Convert with a specific voice
ttsforge convert book.epub -v am_adam
# Convert specific chapters
ttsforge convert book.epub --chapters 1-5
# List available voices
ttsforge voices
Supported Languages
ttsforge supports 9 languages with native TTS voices:
American English (a) - 20 voices
British English (b) - 8 voices
Spanish (e) - 3 voices
French (f) - 1 voice
Hindi (h) - 4 voices
Italian (i) - 2 voices
Japanese (j) - 5 voices
Brazilian Portuguese (p) - 3 voices
Mandarin Chinese (z) - 8 voices
Requirements
Python 3.10 or later
ffmpeg (required for MP3/FLAC/OPUS/M4B output and chapter merging)
espeak-ng (for phonemization)
~330MB disk space for ONNX models (downloaded automatically)
sounddevice (optional, for playback features)
License
ttsforge is released under the MIT License.