Document to plain text

Convert your document file into text.

Description

This component provides a convenient way to convert your document files into plain text. By using this feature, you can easily extract the textual content from complex file formats, making it accessible for further processing, analysis or integration into other applications. This can be particularly useful for tasks like data mining, text analysis and content management where having the text in a simple editable format is essential.

The Document to plain text component has the identifier of dtt-X, where X represents the instance number of the Document to plain text component.

Component settings

Parameter NameDescription

Data

The document file that will be converted to plain text.

Advanced configurations

OptionsDescription

Enable caching

This option determines whether the results of the component are cached. This means that on the next run of the Flow, Diaflow will utilize the previous computed component output, as long as the inputs have not changed.

Caching time

Only applicable if the "Enable Caching" option has been enabled. This parameter controls how long Diaflow will wait before automatically clearing the cache.

Use case

Here is a simple use case of the Document to plain text component, where the Document to plain text component is being used to generate text with an input PDF document.

Last updated