Termine with c-value threshold
This workflow accepts a list of sentences from a single document and returns the terms found by the TerMine web service. It also allows you to set a threshold c-value score so that only terms with a user-controlled probability (of being a real term) are returned as an output.
To get sentences to supply to this workflow you can use the sentence splitting workflow. The TerMine service (used in this workflow) only accepts text in ASCII encoding, so you should also use the Clean plain text (ASCII) workflow before splitting sentences.
This is a workflow component, designed to be used as a nested workflow inside a larger text mining or text processing workflow.
Unfortunately there are some restrictions on IP access to the TerMine web service at the NaCTeM. These can be viewed here. If you are at a UK higher eductation institution then there should be no problems, others have to request access through this page.
Preview
Run
Run this Workflow in the Taverna Workbench...
Option 1:
Copy and paste this link into File > 'Open workflow location...'
http://myexperiment.org/workflows/1060/download?version=1
[ More Info ]
Workflow Components
Reviews (0)
Other workflows that use similar services (4)
Only the first 2 workflows that use similar services are shown. View all workflows that use these services.
Terms from collection of PDF files (2)
Created: 2010-02-19 | Last updated: 2011-12-13
Credits: James Eales
Comments (0)
No comments yet
Log in to make a comment