TextExtractor extracts plain text from hundreds of different file types, storing the text extracted in suitably named text files.

TextExtractor 1.10 works in six different modes :-

Instant Mode - Just select any file and extract the text from it.
Batch Mode - Select a group of files and extract the text from all of them in one go.
Polling Mode - Watch a folder location, processing new files as they appear there.
Hierarchical Mode - Extract Text from files in a directory hierarchy.
File List Mode - Extract Text from files in a list.
File Viewer - Select individual files from a file tree to see their textual content.

Features

  • Reads PDF,DOC,DOX,XLS.XLSX,ODT,RTF and many other file types.
  • Also reads DLLs, EXE, COM and binary files.
  • Outputs plain text files, one for each file processed.
  • Extract text instantly, in batch mode, or poll a folder and process files as they appear there.
  • Fast, accurate text extraction.
  • Process multiple file types at the same time.
  • Process whole directory hierarchies.
  • View the text in individual files selected from a directory tree.
  • Make a single list of files, where ever they are located and extract text from them.

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow TextExtractor

TextExtractor Web Site

Other Useful Business Software
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
Try Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of TextExtractor!

Additional Project Details

Operating Systems

Windows

Intended Audience

End Users/Desktop

User Interface

Java Swing

Registered

2022-11-16