Crates.io | aud2txt |
lib.rs | aud2txt |
version | 0.5.0 |
created_at | 2025-05-10 22:12:18.411739+00 |
updated_at | 2025-05-10 22:12:18.411739+00 |
description | Audio to text tool using ggerganov's whisper.cpp |
homepage | |
repository | https://github.com/xandkar/aud2txt |
max_upload_size | |
id | 1668870 |
size | 73,725 |
Audio to text tool, using ggerganov's whisper.cpp via whisper-rs and FFmpeg.
ffmpeg
command is availablecargo install aud2txt
aud2txt <INPUT_FILE>
where <INPUT_FILE>
is any media file readable by ffmpeg
.
Also see the demo script.
Usage: aud2txt [OPTIONS] <INPUT_FILE>
Arguments:
<INPUT_FILE> Input audio file
Options:
-l, --log <LOG_LEVEL> [default: error]
-m, --model-file <MODEL_FILE>
-N, --no-normalize Disable audio normalization before conversion to text
-o, --output-file <OUTPUT_FILE> Output text file
-h, --help Print help
If --model-file
argument is omitted, aud2txt
will try to download and use
the default model from: https://huggingface.co/ggerganov/whisper.cpp/resolve/main/ggml-base.en.bin
If --no-normalize
flag is passed, the normalization step will be skiped,
removing the runtime dependency on ffmpeg
.