Tag Results

Items tagged with "text cleaning" (3)

Note: some items may not be visible to you, due to viewing permissions.


Workflows (3)

Workflow Clean plain text (ASCII) (1)

Thumb
This workflow will remove any XML-invalid and non-ASCII characters (e.g. for sending to the ASCII-only Termine service) from any text supplied to the input port. This is a workflow component, designed to be used as a nested workflow inside a larger text mining or text processing workflow.

Created: 2010-02-18 | Last updated: 2011-12-13

Credits: User James Eales

Workflow Clean plain text (1)

Thumb
This workflow will remove any XML-invalid characters (these characters often appear in the output of PDF to text software) from any text supplied to the input port. This is a workflow component, designed to be used as a nested workflow inside a larger text mining or text processing workflow.  

Created: 2010-02-18 | Last updated: 2011-12-13

Credits: User James Eales

Workflow One sentence per line (1)

Thumb
This workflow accepts a plain text input and provides a single text document per input containing one sentence per line.  Newline characters are removed from the original input. The OpenNLP sentence splitter is used to split the text, this is provided by University of Manchester Web Services.

Created: 2011-05-06 | Last updated: 2011-12-13

Credits: User James Eales

What is this?

Linked Data

Non-Information Resource URI: http://www.myexperiment.org/tags/1782


Alternative Formats

HTML
RDF
XML