Overview is a document mining application originally built for investigative journalists. It’s also used for legal work, training machine learning models, and research of all types. It’s a visualization and analysis tool designed for sets of documents, from dozens to millions of pages of material.
Overview imports many formats and languages, includes built-in OCR, a sophisticated search engine, document annotation, word clouds, entity detection, and topic-based document clustering. It has tagging and metadata support and supports many input and export formats. If you need custom analysis, you can write your own plugins using the API.
Run Overview on your own computer
Other Frequently Asked Questions
Overview Services Inc. provides paid support, custom feature development, and enterprise licensing — contact us for details.
3 thoughts on “Open-Source Document Mining”
Comments are closed.