ttsforge Documentation

ttsforge is a command-line tool for converting EPUB files to audiobooks using Kokoro ONNX TTS (Text-to-Speech).

User Guide

API Reference

API Reference

Features

EPUB to Audiobook Conversion: Convert EPUB files to M4B, MP3, WAV, FLAC, or OPUS formats
50+ High-Quality Voices: Support for 9 languages with multiple voice options
SSMD Editing: Edit intermediate SSMD files to fine-tune pronunciation and pacing
Resumable Conversions: Long audiobook conversions can be interrupted and resumed
Phoneme Pre-tokenization: Pre-process text to phonemes for faster batch conversions
Configurable Filename Templates: Customize output filenames with book metadata
Voice Blending: Mix multiple voices for custom narration styles
GPU Acceleration: Optional GPU support for faster processing
Chapter Selection: Convert specific chapters or chapter ranges
Metadata Support: Automatic language detection and metadata embedding
Streaming Read: Real-time playback with the read command (optional audio extra)

Quick Example

# Install ttsforge
pip install ttsforge

# Convert an EPUB to audiobook (M4B format with chapters)
ttsforge convert book.epub

# Convert with a specific voice
ttsforge convert book.epub -v am_adam

# Convert specific chapters
ttsforge convert book.epub --chapters 1-5

# List available voices
ttsforge voices

Supported Languages

ttsforge supports 9 languages with native TTS voices:

American English (a) - 20 voices
British English (b) - 8 voices
Spanish (e) - 3 voices
French (f) - 1 voice
Hindi (h) - 4 voices
Italian (i) - 2 voices
Japanese (j) - 5 voices
Brazilian Portuguese (p) - 3 voices
Mandarin Chinese (z) - 8 voices

Requirements

Python 3.10 or later
ffmpeg (required for MP3/FLAC/OPUS/M4B output and chapter merging)
espeak-ng (for phonemization)
~330MB disk space for ONNX models (downloaded automatically)
sounddevice (optional, for playback features)

License

ttsforge is released under the MIT License.