DocBleach

An open-source Content Disarm & Reconstruct software sanitizing Office, PDF and RTF Documents.
DocBleach logo

Here is the markdown description of DocBleach:

DocBleach

An advanced Content Disarm and Reconstruction software designed to remove misbehaving dynamic content from Office files or any threat to computer safety.


Howto's


Building DocBleach

To build DocBleach, use Maven:

$ mvn clean package

The final jar is stored in cli/target/docbleach.jar.


Running DocBleach

To use DocBleach, you may either use the Web Interface or run it in CLI:

java -jar docbleach.jar -in unsafe_document.doc -out safe_doc.doc

The input file may be a relative/absolute path, an URI (think: http://link), or a dash (-). The output file may be a relative/absolute path, or a dash (-).


Advanced Usage

See [Wiki][wiki] for more information.


Get the Sources

git clone https://github.com/docbleach/DocBleach.git
cd DocBleach
mvn install

Run the Tests

The tests run with JUnit 5, which is perfectly integrated in Maven. To run tests, just run mvn test.


License

See [LICENSE][license].


Project Status

Don't expect the code base to change everyday, but feel free to contribute: new ideas are more than welcome, and threats evolve - so should we.

Some things would be awesome, though:

  • Adding a way to configure bleaches
  • Write tests!
  • Writing more content to show and explain how the sanitation process works, why it works.
  • Adding more stats!




> Visit DocBleach Website <