## Textract Rust library for extracting text from various file types. supported file extension. txt odf ods odt pptx xlsx pdf ## Installation and usage; Use cargo to install textract. ``` // there is a pdf file at ./tmp.pdf let content = textract::extract("tmp.pdf","pdf").unwrap; // content contains raw text in pdf. do whatever you want. ``` main.rs contains usage of textract library. ### commandline The command line as simple. ``` textract tmp.pdf pdf ``` ## Roadmap. This lib is in beta stage with few file types support. but texract supports will keep increasing the file types support. since this project is part of ![achoz](https://github.com/kcubeterm/achoz) * supports of compressed file and tar archives * use lib magic to guess file types. * All types of documents files.