MLX-Audio¶
MLX-Audio is an audio framework built on Apple MLX for Apple Silicon Macs. It supports VoxCPM as one of its TTS backends, and gives you a CLI, an OpenAI-compatible FastAPI server, and a separate Next.js web UI.
Repo: Blaizzy/mlx-audio
Note
This is a good fit if you want to run VoxCPM on an Apple Silicon Mac through the MLX stack, especially when you also want a local API server or a browser UI.
VoxCPM 1.0 (0.5B) |
✅ Supported |
VoxCPM 1.5 |
✅ Supported |
VoxCPM 2 |
⚠️ This page focuses on the current VoxCPM 1.x path on MLX-Audio |
Prerequisites¶
Apple Silicon Mac (M1 or later)
Python 3.10+
mlxandmlx-audioinstalled
Installation¶
pip install mlx-audio
Or from source:
git clone https://github.com/Blaizzy/mlx-audio.git
cd mlx-audio
pip install -e .
CLI Usage¶
Generate speech directly from the command line:
mlx_audio.tts.generate --text "Hello from MLX-Audio!" --model voxcpm
The --model flag accepts voxcpm (maps to VoxCPM 1.0) or voxcpm1.5.
API Server¶
MLX-Audio includes an OpenAI-compatible FastAPI server:
mlx_audio.server --model voxcpm
This starts a local server that accepts /v1/audio/speech requests, compatible with the OpenAI TTS API format. You can point any OpenAI-compatible client at it.
Web UI¶
A separate Next.js web UI is included under mlx_audio/ui/. See the MLX-Audio README for setup instructions.
Limitations¶
VoxCPM 2 support is not yet available in MLX-Audio. The current backend covers VoxCPM 1.0 and 1.5.
Performance depends on your Apple Silicon chip. M1 Pro / M1 Max and later provide the best experience.
The MLX-Audio project is community-maintained and not officially affiliated with OpenBMB.