Crates.io | gpt4ocr |
lib.rs | gpt4ocr |
version | 0.3.2 |
source | src |
created_at | 2024-08-06 08:40:45.098375 |
updated_at | 2024-08-06 08:55:39.240404 |
description | Extract structured text from PDFs using OpenAI's GPT4o. |
homepage | |
repository | |
max_upload_size | |
id | 1327046 |
size | 4,953,123 |
A simple OCR tool that uses GPT-4o to perform OCR on pdf files. Requires a .env file with the following variables:
OPENAI_API_KEY=your_openai_api_key
Alternatively, you can pass in the OPENAI_API_KEY as an environment variable to the extract_json_from_pdf
function.
Runs on linux. Needs poppler-utils to be installed. To install it on Ubuntu, run
sudo apt install poppler-utils
sudo apt install libssl-dev
json" and "
" to get the JSON data. This handled by the library currently.