Marco Roos' Workflows

Search filter terms
Filter by type
Filter by tag
Filter by licence
Filter by group
Filter by wsdl
Filter by curation
Results per page:
Sort by:
Showing 32 results. Use the filters on the left and the search box below to refine the results.

Workflow CloneItemsInList (1)

Thumb
Utility workflow that clones an item copy_number times. You can use this to work around standard iteration strategies, e.g. in combination with the CountListItems workflow. Workflow examples: TestIterationStrategy_withClones. For an alternative approach see TestIterationStrategy_withNesting. Example I/O: input: A copy_number: 3 result: [A,A,A] input: [A,B,C] copy_number: 3 result: [[A,A,A][B,B,B][C,C,C]] input: [A,B,C] copy_number: [3,2] result: [[[A,A,A],[A,A]][[B,B,B],[B,B]],[[C,C,C],...

Created: 2007-11-29

Credits: User Marco Roos

Workflow Extract_proteins (2)

Thumb
This workflow filters protein_molecule-labeled terms from an input string(list). The result is a tagged list of proteins (disregarding false positives in the input). Internal information: This workflow is a copy of 'filter_protein_molecule_MR3' used for the NBIC poster (now in Archive).

Created: 2007-12-10 | Last updated: 2007-12-10

Credits: User Marco Roos

Workflow Discover_entities (2)

Thumb
This workflow contains the 'Named Entity Recognize' web service from the AIDA toolbox, created by Sophia Katrenko. It can be used to discover entities of a certain type (determined by 'learned_model') in documents provided in a lucene output format. Known issues: The output of NErecognize contains concepts with / characters, breaking the xml. For post-processing its results it is better to use string manipulation than xml manipulations. The output is per document, which means entities will ...

Created: 2007-12-10 | Last updated: 2007-12-10

Credits: User Marco Roos User Sophia katrenko Network-member AID

Workflow TestIteratorStrategy_withCloning (2)

Thumb
This workflow implements a strategy for this problem: > I would like to perform an iteration including a dot product between > a list and a list of lists; example: > Input: > > [1] (1) > [A,B,C] (2) > [[a,b],[c,d],[e,f]] (3) > > Desired output: > > [1Aa, 1Ab, 1Bc, 1Bd, 1Ce, 1Cf] In this implementation a java beanshell is used to clone the items in list 2 as many times per item as there are items in the sublists of list 3. The iteration stra...

Created: 2007-11-29 | Last updated: 2007-11-29

Credits: User Marco Roos

Workflow BioAID_Discover_proteins_from_text_plus_sy... (1)

Thumb
This workflow discovers proteins from plain text and adds synonyms using Martijn Schuemie's proteins synonym service. Proteins are discovered with the AIDA 'Named Entity Recognize' web service by Sophia Katrenko (service based on LingPipe), from which output it filters out proteins. The Named Recognizer services uses the pre-learned genomics model, named 'MedLine', to find genomics concepts in plain text.

Created: 2007-11-15

Credits: User Marco Roos User Martijn Schuemie Network-member AID

Workflow Demo_DiseaseDiscovery_byHumanUniprot_scaffold (1)

Thumb
This workflow finds disease relevant to the query string via the following steps: A user query: a list of terms or boolean query - look at the Apache Lucene project for all details. E.g.: (EZH2 OR "Enhancer of Zeste" +(mutation chromatin) -clinical); consider adding 'ProteinSynonymsToQuery' in front of the input if your query is a protein. Retrieve documents: finds 'maximumNumberOfHits' relevant documents (abstract+title) based on query (the AIDA service inside is based on Apache's Lucene)...

Created: 2007-12-10

Credits: User Marco Roos Network-member AID

Workflow Retrieve_documents_MR1 (1)

Thumb
This workflow applies the search web service from the AIDA toolbox. Comments: This search service is based on lucene defaults; it may be necessary to optimize the querystring to adopt the behaviour to what is most relevant in a particular domain (e.g. for medline prioritizing based on publication date is useful). Lucene favours shorter sentences, which may be bad for subsequent information extraction.

Created: 2007-12-10

Credits: User Marco Roos User Edgar Network-member AID

Workflow Lucene_bioquery_optimizer_MR1 (1)

Thumb
This workflow does four things: it retrieves documents relevant for the query string it discovers entities in those documents, these are considered relevant entities it filters proteins from those entities (on the tag protein_molecule) it removes all terms from the list produced by 3 (query terms temporarily considered proteins) ToDo Replace step 4 by the following procedure: 1. remove the query terms from the output of NER (probably by a regexp matching on what is inside the tag, ...

Created: 2007-12-10

Credits: User Marco Roos Network-member AID

Workflow TestIteratorStrategy_withNesting (1)

Thumb
Implementation of the iteration workaround by Tom Oin conform the Q&A below. The nested workflow 'NestedProcessor' is called that to conform to Tom's explanation. For an alternative solution using a java beanshell to clone list items see 'TestIteratorStrategy_withCloning. This workflow implements the following Q&A: Marco Roos wrote: > Dear Taverna user, > > Issue 1: Complex iteration > > I would like to perform an iteration including a dot product between > a list and a list of li...

Created: 2007-11-29

Credits: User Marco Roos User Tomoinn

Workflow TestIterator (1)

Thumb
Workflow to experiment with list iteration strategies. Look at metadata of nested workflow 'Concatenate' to see the current iteration strategy.

Created: 2007-11-28

Credits: User Marco Roos

Results per page:
Sort by: