Workflows
Search filter terms
Filter by type
Filter by tag
Filter by user
Filter by licence
Filter by group
Filter by wsdl
Results per page:
Sort by:
Showing 2 results.
Use the filters on the left and the
search box below to refine the results.
This workflow will extract the plain text content of PDF files supplied to the input port. You can connect the Load PDF from directory workflow to this workflows input. We recommend you send the output from this workflow to the Clean plain text workflow, because the PDF to text process can add characters into the text that are XML-invalid and therefore can not be sent to most services as plain text. Another way round this problem is to encode the text as Base64 using the handy loc...
Created: 2010-02-19 | Last updated: 2011-12-13
Credits: James Eales
This workflow will attempt to split up text into sentences, returning a list of sentences to the output port. The sentence splitting service makes use of the OpenNLP sentence detector and has been trained to work on english text. This workflow can be used to provide input to the Termine with c-value threshold workflow.
This is a workflow component, designed to be used as a nested workflow inside a larger text mining or text processing workflow.
Created: 2010-02-19 | Last updated: 2011-12-13
Credits: James Eales
Results per page:
Sort by: