| Crates.io | xml-xls-parser |
| lib.rs | xml-xls-parser |
| version | 0.1.0 |
| created_at | 2020-04-13 15:49:49.996347+00 |
| updated_at | 2020-04-13 15:49:49.996347+00 |
| description | Parse XLS files as XML |
| homepage | |
| repository | https://github.com/blakehawkins/xml-xls-parser |
| max_upload_size | |
| id | 229735 |
| size | 3,722,260 |
I encountered some XLS files that fail to be parsed by a number of tools (xlrd, pandas, openpyxl, calamine).
The files appear to be in XML format with the following properties:
WorkbookWorksheetTableRowCellDataStylesStyleNumberFormatFontAlignmentIt is unclear what makes the files unreadable by XLS and XLSX parsers.
This project reads XLS consisting only of the above properties (XML formatted document) and emits a best-effort TSV.
$ cp /path/to/file.xls input.xls
$ cargo run > out.tsv
$ less -S out.tsv
It's just a serde specification, using serde-xml-rs.
Expect to modify the code if your source document contains anything other than the properties defined above.