MLX Speech Models
Collection
Speech AI models for Apple Silicon via MLX. ASR, TTS, VAD, diarization, speaker embedding. • 39 items • Updated • 3
How to use aufklarer/Qwen3-ASR-0.6B-MLX-4bit with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir Qwen3-ASR-0.6B-MLX-4bit aufklarer/Qwen3-ASR-0.6B-MLX-4bit
MLX 4-bit quantized conversion of Qwen/Qwen3-ASR-0.6B for Apple Silicon inference.
Used by speech-swift Qwen3ASR module:
let model = try await Qwen3ASRModel.fromPretrained()
let text = model.transcribe(audio: samples, sampleRate: 16000)
audio transcribe audio.wav
4-bit
Base model
Qwen/Qwen3-ASR-0.6B