Automatic Speech Recognition
Safetensors
MLX
mlx-audio
whisper
speech-to-text
speech-to-speech
speech
speech generation
stt
audio
hf-asr-leaderboar
Instructions to use mlx-community/whisper-large-v3-asr-fp16 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use mlx-community/whisper-large-v3-asr-fp16 with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir whisper-large-v3-asr-fp16 mlx-community/whisper-large-v3-asr-fp16
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- LM Studio
mlx-community/whisper-large-v3-asr-fp16
This model was converted to MLX format from openai/whisper-large-v3 using mlx-audio version 0.3.0.
Refer to the original model card for more details on the model.
Use with mlx-audio
pip install -U mlx-audio
CLI Example:
python -m mlx_audio.stt.generate --model mlx-community/whisper-large-v3-asr-fp16 --audio "audio.wav"
Python Example:
from mlx_audio.stt.utils import load_model
from mlx_audio.stt.generate import generate_transcription
model = load_model("mlx-community/whisper-large-v3-asr-fp16")
transcription = generate_transcription(
model=model,
audio_path="path_to_audio.wav",
output_path="path_to_output.txt",
format="txt",
verbose=True,
)
print(transcription.text)
- Downloads last month
- 51
Model size
2B params
Tensor type
F16
·
Hardware compatibility
Log In to add your hardware
Quantized