aud2txt

Crates.io	aud2txt
lib.rs	aud2txt
version	0.5.0
created_at	2025-05-10 22:12:18.411739+00
updated_at	2025-05-10 22:12:18.411739+00
description	Audio to text tool using ggerganov's whisper.cpp
homepage
repository	https://github.com/xandkar/aud2txt
max_upload_size
id	1668870
size	73,725

Siraaj Khandkar (xandkar)

documentation

README

aud2txt

Audio to text tool, using ggerganov's whisper.cpp via whisper-rs and FFmpeg.

install

install FFmpeg (via your package manager or directly)
ensure ffmpeg command is available
cargo install aud2txt

usage

TL;DR

aud2txt <INPUT_FILE>

where <INPUT_FILE> is any media file readable by ffmpeg.

Also see the demo script.

options

Usage: aud2txt [OPTIONS] <INPUT_FILE>

Arguments:
  <INPUT_FILE>  Input audio file

Options:
  -l, --log <LOG_LEVEL>            [default: error]
  -m, --model-file <MODEL_FILE>
  -N, --no-normalize               Disable audio normalization before conversion to text
  -o, --output-file <OUTPUT_FILE>  Output text file
  -h, --help                       Print help

If --model-file argument is omitted, aud2txt will try to download and use the default model from: https://huggingface.co/ggerganov/whisper.cpp/resolve/main/ggml-base.en.bin

If --no-normalize flag is passed, the normalization step will be skiped, removing the runtime dependency on ffmpeg.

Commit count: 21

aud2txt

documentation

README

aud2txt

install

usage

TL;DR

options

cargo fmt