| Crates.io | arabic_pdf_to_text |
| lib.rs | arabic_pdf_to_text |
| version | 0.1.0 |
| created_at | 2025-07-17 15:03:40.840476+00 |
| updated_at | 2025-07-17 15:03:40.840476+00 |
| description | A CLI tool to convert Arabic PDFs to text using Google's Gemini API |
| homepage | https://github.com/RustSandbox/arabic_pdf_to_text |
| repository | https://github.com/RustSandbox/arabic_pdf_to_text |
| max_upload_size | |
| id | 1757643 |
| size | 113,726 |
A Rust CLI tool that converts Arabic PDFs to text using Google's Gemini API.
The tool successfully:
cargo install arabic_pdf_to_text
# Clone the repository
git clone https://github.com/RustSandbox/arabic_pdf_to_text.git
cd arabic_pdf_to_text
# Build the project
cargo build --release
# The binary will be at target/release/arabic_pdf_to_text
Copy .env.example to .env:
cp .env.example .env
Add your Gemini API key:
export GEMINI_API_KEY="your-api-key"
# Process a PDF
./arabic_pdf_to_text "path/to/arabic.pdf" -o output.txt
# With custom chunk size
./arabic_pdf_to_text "path/to/arabic.pdf" --chunk-size 524288 -o output.txt
# See all options
./arabic_pdf_to_text --help
For production use, consider:
cargo build --release
cargo test
We welcome contributions! Please see our Contributing Guide for details.
This project is licensed under the MIT License - see the LICENSE file for details.
While this software is free for any use including commercial, if you use it in a commercial product or service, we kindly request (but do not require) that you include the following attribution:
This product includes software developed by the arabic_pdf_to_text project
(https://github.com/RustSandbox/arabic_pdf_to_text)