whisperd

Crates.iowhisperd
lib.rswhisperd
version0.1.12
sourcesrc
created_at2023-10-26 23:18:27.979619
updated_at2023-10-27 00:02:31.349546
descriptionHTTP server for the OpenAI Whisper speech-to-text model.
homepagehttps://github.com/tiero/whisperd
repositoryhttps://github.com/tiero/whisperd
max_upload_size
id1015399
size103,317
Marco Argentieri (tiero)

documentation

README

whisperd 🎙️

Crates.io Documentation GitHub release

A simple HTTP server written in Rust for the OpenAI Whisper speech-to-text model.

Features ✨

  • 🎧 Transcribe audio files
  • 🔄 OpenAI API compatibility
  • 🌈 Models
    • tiny.en
    • tiny
    • base.en
    • base
    • small.en
    • small
    • medium.en
    • medium
    • large
    • large-v1
  • 🌎 Languages
    • 🇬🇧 English (en)

    • 🇨🇳 Chinese (zh)

    • 🇩🇪 German (de)

    • 🇪🇸 Spanish (es)

    • 🇷🇺 Russian (ru)

Quickstart 🚀

  1. Clone this repository:
git clone https://github.com/tiero/whisperd.git
  1. Navigate to the repository and build:
cd whisperd
cargo build --release
  1. Run the server:
./target/release/whisperd serve --model_path path_to_whisper_model

Now, the server is running at http://localhost:8000 and ready to transcribe!

Usage 🛠️

CLI Commands

  • Start the transcription server:
whisperd serve --port 5000 --model_path <path_to_model> 
  • Transcribe a given audio file (this downloads the model automatically from HuggingFace):
whisperd transcribe --audio <path_to_audio>

For more advanced options, use:

whisperd --help

Contribution 🤝

Pull requests and issues are welcome!

License 📜

This project is licensed under the MIT License - see the LICENSE file for details.

Commit count: 29

cargo fmt