| Crates.io | aud2txt |
| lib.rs | aud2txt |
| version | 0.5.0 |
| created_at | 2025-05-10 22:12:18.411739+00 |
| updated_at | 2025-05-10 22:12:18.411739+00 |
| description | Audio to text tool using ggerganov's whisper.cpp |
| homepage | |
| repository | https://github.com/xandkar/aud2txt |
| max_upload_size | |
| id | 1668870 |
| size | 73,725 |
Audio to text tool, using ggerganov's whisper.cpp via whisper-rs and FFmpeg.
ffmpeg command is availablecargo install aud2txtaud2txt <INPUT_FILE>
where <INPUT_FILE> is any media file readable by ffmpeg.
Also see the demo script.
Usage: aud2txt [OPTIONS] <INPUT_FILE>
Arguments:
<INPUT_FILE> Input audio file
Options:
-l, --log <LOG_LEVEL> [default: error]
-m, --model-file <MODEL_FILE>
-N, --no-normalize Disable audio normalization before conversion to text
-o, --output-file <OUTPUT_FILE> Output text file
-h, --help Print help
If --model-file argument is omitted, aud2txt will try to download and use
the default model from: https://huggingface.co/ggerganov/whisper.cpp/resolve/main/ggml-base.en.bin
If --no-normalize flag is passed, the normalization step will be skiped,
removing the runtime dependency on ffmpeg.