FreeEed Demo

FreeEed For Windows FreeEed For Linux https://github.com/elephantscale/elastic-labs/blob/master/integrations/1-FreeEed.md

Text summarization

Text summarization To create an automatic summary of every document, just click the checkbox “AI Advisor” tab of the settings. Here is an example Input: “House Speaker Nancy Pelosi and Democratic leaders have greenlighted a plan to craft legislation that would prohibit members of Congress from trading stock, after months of resistance to a ban […]

Tesseract install

To have OCR capabilities, you need to install Tesseract. OCR is not supported on Windows, only on MacOS and Linux Linux: sudo apt-get install tesseract-ocr MacOS: brew install tesseract

PII detection

FreeEed can detect PII using Inabia or Amazon. Example Input Hello 713-777-7777 Name: John Doe, johndoe@gmail.com. Lorem Ipsum is simply dummy text of the printing and typesetting industry. 1301 McKinney St #2400, Houston, TX 77010 Output Phone: 713-777-7777 Name: John Doe Email: johndoe@gmail.com Address: 1301 McKinney St #2400, Houston, TX 77010 How to tell FreeEed […]

Lotus Notes Processing

Lotus Notes Processing Currently FreeEed supports two options for processing of Lotus databases 1.) Windows and Notes You should run under windows You should have installed Lotus Designer 8.5.3 (as free software) Add the path to Lotus Notes directory in your system path You should be ready to go, just process your NSF files 2.) […]

FreeEed Installation

Install Java. Amazon Coretto works great! Don’t forget to set JAVA_HOME and JRE_HOME. Again, follow Coretto instructions. On Mac, install readpst, brew install libpst Unzip your download and run “start_all.bat” in Windows or “start_all.sh” on Mac or Linux

FreeEed capabilities

Capabilities Works in Windows, Mac, Linux, VirtualBox, Amazon AWS cloud Ability to process over 1,400 file types, including MS Office and PST files (Tika formats and more) OCR Culling Deduplication, for documents and emails Document review (through the companion project FreeEedUI) Imaging, that is, conversion of documents to PDF

Cluster processing details

The most important, for now, is that you execute this (or similar, with changed parameters) command on the cluster cd FreeEed; ./run_hadoop_s3.sh 0002.project.s3 /freeeed/output 1 FreeEedUI does it by logging in with SSH and issuing the command. If you want to run it manually on the cluster, it will also work. Just make sure you […]

Changelog

Changelog All notable changes to this project will be documented in this file. [8.0.4] Load the result of eDiscovery production Load the DAT file with TEXT files directory Latest Tika (V1.24.1) Ingesting HTML files [8.0.3] – 2020-07-10 Stabilized Started a review of the software