dot
dotext
cargo install dotext
dot

dotext

Simple Document File Text Extraction Library for Rust

by Robin Syihab

0.1.1 (see all)License:MIT
cargo install dotext
Readme

Document File Text Extractor

Build Status Build status Crates.io

Simple Rust library to extract readable text from specific document format like Word Document (docx). Currently only support several format, other format coming soon.

Supported Document

  • Microsoft Word (docx)
  • Microsoft Excel (xlsx)
  • Microsoft Power Point (pptx)
  • OpenOffice Writer (odt)
  • OpenOffice Spreadsheet (ods)
  • OpenDocument Presentation (odp)
  • PDF

Usage

let mut file = Docx::open("samples/sample.docx").unwrap();
let mut isi = String::new();
let _ = file.read_to_string(&mut isi);
println!("CONTENT:");
println!("----------BEGIN----------");
println!("{}", isi);
println!("----------EOF----------");

Test

$ cargo test

or run example:

$ cargo run --example readdocx data/sample.docx

[] Robin Sy.

GitHub Stars

24

LAST COMMIT

2yrs ago

MAINTAINERS

1

CONTRIBUTORS

4

OPEN ISSUES

1

OPEN PRs

0
VersionTagPublished
0.1.1
5yrs ago
0.1.0
5yrs ago
No alternatives found
No tutorials found
Add a tutorial