pdf_process

Crates.iopdf_process
lib.rspdf_process
version0.2.0
sourcesrc
created_at2024-08-05 09:47:27.800615
updated_at2024-08-20 01:01:23.140731
descriptionLibrary for rendering and extracting metadata/text from PDF files using poppler
homepagehttps://github.com/jacobtread/pdf_process
repositoryhttps://github.com/jacobtread/pdf_process
max_upload_size
id1325774
size107,793
Jacob (jacobtread)

documentation

README

PDF Process

Library for processing PDF files in Rust, wraps the CLI utilities provided by Poppler specifically pdftotext (Text extraction), pdftocairo (Image rendering), pdfinfo (Extracting basic details)

Provides functionality for:

  • Extracting PDF text contents
  • Rendering PDF files to images (PNG/JPEG/TIFF)
  • Basic PDF Details (Encryption, Page Count, Subject, Title, Creator, Author, etc..)

Prerequisites

Library developed against a Linux host. Windows is not supported

Requires Plopper be installed on your system and the utilities on your PATH. Lots of distributions will come with this pre-installed. You can check if its installed by using pdfinfo -v which should produce an output similar to:

pdfinfo version 24.02.0
Copyright 2005-2024 The Poppler Developers - http://poppler.freedesktop.org
Copyright 1996-2011, 2022 Glyph & Cog, LLC,

Otherwise you can install it with one of the commands below:

Fedora:

sudo dnf install poppler-utils

Adjust the command above for your specific Linux distribution

Installation

Install with cargo:

cargo add pdf_process

Or add the following to the [dependencies] section of your Cargo.toml:

pdf_process = "0.1.0"

Tested

Tested against:

  • pdftotext version 24.02.0
  • pdftocairo version 24.02.0
  • pdfinfo version 24.02.0
Commit count: 0

cargo fmt