Text extraction#

This module provides a framework for extracting text content from various file formats, such as PDFs and Office documents. The extracted content can be used programmatically or made available directly on your File objects.

Installation#

bash

composer require silverstripe/textextraction

GitHub repository#

https://github.com/silverstripe/silverstripe-textextraction

Text extraction#

Installation#

GitHub repository#

Configuration

Usage

Apache Solr

Tika