Workflows

Search filter terms
Filter by type
Filter by tag
Filter by user
Filter by licence
Filter by group
Filter by wsdl
Filter by curation
Results per page:
Sort by:
Showing 30 results. Use the filters on the left and the search box below to refine the results.
User: Marco Roos Licence: by-sa

Workflow Discover_entities (2)

Thumb
This workflow contains the 'Named Entity Recognize' web service from the AIDA toolbox, created by Sophia Katrenko. It can be used to discover entities of a certain type (determined by 'learned_model') in documents provided in a lucene output format. Known issues: The output of NErecognize contains concepts with / characters, breaking the xml. For post-processing its results it is better to use string manipulation than xml manipulations. The output is per document, which means entities will ...

Created: 2007-12-10 | Last updated: 2007-12-10

Credits: User Marco Roos User Sophia katrenko Network-member AID

Workflow TestIteratorStrategy_withCloning (2)

Thumb
This workflow implements a strategy for this problem: > I would like to perform an iteration including a dot product between > a list and a list of lists; example: > Input: > > [1] (1) > [A,B,C] (2) > [[a,b],[c,d],[e,f]] (3) > > Desired output: > > [1Aa, 1Ab, 1Bc, 1Bd, 1Ce, 1Cf] In this implementation a java beanshell is used to clone the items in list 2 as many times per item as there are items in the sublists of list 3. The iteration stra...

Created: 2007-11-29 | Last updated: 2007-11-29

Credits: User Marco Roos

Workflow BioAID_Discover_proteins_from_text_plus_sy... (1)

Thumb
This workflow discovers proteins from plain text and adds synonyms using Martijn Schuemie's proteins synonym service. Proteins are discovered with the AIDA 'Named Entity Recognize' web service by Sophia Katrenko (service based on LingPipe), from which output it filters out proteins. The Named Recognizer services uses the pre-learned genomics model, named 'MedLine', to find genomics concepts in plain text.

Created: 2007-11-15

Credits: User Marco Roos User Martijn Schuemie Network-member AID

Workflow Demo_DiseaseDiscovery_byHumanUniprot_scaffold (1)

Thumb
This workflow finds disease relevant to the query string via the following steps: A user query: a list of terms or boolean query - look at the Apache Lucene project for all details. E.g.: (EZH2 OR "Enhancer of Zeste" +(mutation chromatin) -clinical); consider adding 'ProteinSynonymsToQuery' in front of the input if your query is a protein. Retrieve documents: finds 'maximumNumberOfHits' relevant documents (abstract+title) based on query (the AIDA service inside is based on Apache's Lucene)...

Created: 2007-12-10

Credits: User Marco Roos Network-member AID

Workflow Retrieve_documents_MR1 (1)

Thumb
This workflow applies the search web service from the AIDA toolbox. Comments: This search service is based on lucene defaults; it may be necessary to optimize the querystring to adopt the behaviour to what is most relevant in a particular domain (e.g. for medline prioritizing based on publication date is useful). Lucene favours shorter sentences, which may be bad for subsequent information extraction.

Created: 2007-12-10

Credits: User Marco Roos User Edgar Network-member AID

Workflow Lucene_bioquery_optimizer_MR1 (1)

Thumb
This workflow does four things: it retrieves documents relevant for the query string it discovers entities in those documents, these are considered relevant entities it filters proteins from those entities (on the tag protein_molecule) it removes all terms from the list produced by 3 (query terms temporarily considered proteins) ToDo Replace step 4 by the following procedure: 1. remove the query terms from the output of NER (probably by a regexp matching on what is inside the tag, ...

Created: 2007-12-10

Credits: User Marco Roos Network-member AID

Workflow TestIteratorStrategy_withNesting (1)

Thumb
Implementation of the iteration workaround by Tom Oin conform the Q&A below. The nested workflow 'NestedProcessor' is called that to conform to Tom's explanation. For an alternative solution using a java beanshell to clone list items see 'TestIteratorStrategy_withCloning. This workflow implements the following Q&A: Marco Roos wrote: > Dear Taverna user, > > Issue 1: Complex iteration > > I would like to perform an iteration including a dot product between > a list and a list of li...

Created: 2007-11-29

Credits: User Marco Roos User Tomoinn

Workflow TestIterator (1)

Thumb
Workflow to experiment with list iteration strategies. Look at metadata of nested workflow 'Concatenate' to see the current iteration strategy.

Created: 2007-11-28

Credits: User Marco Roos

Workflow DatabaseID to ConceptID (7)

Thumb
Purpose: This workflow maps input Identifiers, common database identifiers, to the Concept Identifiers from the EMC ontology. Result: Concept Identifiers from the EMC ontology. Comments: Database: one of CAS, DRUG, etc. The supported databases are listed below (database, description, example). CAS, Chemical Abstracts Service registry number, 64-17-5. DRUG, Drug Bank, DB00316. AF, Affymetrix, 200007_at. CHEB, ChEBI, 16236. CHID, ChemIDplus, 0000050000. EG, Entrez-Gene, 3064. GO, Gene Ontol...

Created: 2012-06-25 | Last updated: 2014-07-14

Credits: User Marco Roos User Martijn Schuemie User Reinout van Schouwen Network-member BioSemantics

Workflow HPO-UMLS-ConceptID mapping (1)

Thumb
Generate HPO-Concept profiles via HPO-UMLS mappings. The result is a list of Concept IDs corresponding to Concept profiles for UMLS concepts that approximate HPO concepts. The output is a table of UMLS-ID, HPO- ID, COncept-ID rows.

Created: 2014-10-20

Credits: User Marco Roos Network-member BioSemantics

Results per page:
Sort by: