DAS sequence retrieval and parsing with JDAS
Created: 2011-05-30 03:12:23
Last updated: 2011-05-30 03:14:03
Retrieve Protein or Genome sequences using the Distributed Annotation System (DAS) and create your own text output by modifying the JDAS component. To be able to use this workflow with JDAS, copy this file … http://www.ebi.ac.uk/~maven/m2repo/uk/ac/ebi/das/jdas/1.0.3/jdas-1.0.3.jar … to the lib folder inside the Taverna application. This jar file is a dependency needed to parse DAS outputs.
Preview
Run
Run this Workflow in the Taverna Workbench...
Workflow Components
Authors (0)
Titles (1)
DAS sequence retrieval and parsing with JDAS |
Descriptions (1)
Retrieve Protein or Genome sequences using the Distributed Annotation System (DAS) and create your own text output by modifying the JDAS component. To be able to use this workflow with JDAS copy this file …
http://www.ebi.ac.uk/~maven/m2repo/uk/ac/ebi/das/jdas/1.0.3/jdas-1.0.3.jar
… to the lib folder inside the Taverna application. This jar file is a dependency needed to parse DAS outputs. |
Dependencies (0)
Inputs (4)
Name |
Description |
IDs |
You can input more than one ID using comma-separated values. You can use different types of IDs depending on the DAS coordinate system (http://www.dasregistry.org/help_coordsys.jsp) you decided to use. If you selected a DAS source of type "Protein Sequence" and authority "UniProt" you could use UniProt (TrEMBL and SwissProt) accessions. For instance "P37173". If you selected a DAS source of type "Chromosome", authority "GRCh_37" and organism "Homo sapiens" you could use a chromosome ID. For instance "1".
|
DasSource |
Input a DAS reference source following this format "http://www.server.com/das/source/". Find a public list of available DAS reference sources providing sequence information in the DAS registry … http://www.dasregistry.org/listSources.jsp … or create your own DAS source with easyDAS … http://www.ebi.ac.uk/panda-srv/easydas/
|
start |
The start and stop attributes indicate the range of the segment to query. In this workflow is optional. Set an empty value to get the whole sequence.
|
stop |
The start and stop attributes indicate the range of the segment to query. In this workflow is optional. Set an empty value to get the whole sequence.
|
Processors (4)
Name |
Type |
Description |
DAS_sequence_retrieval |
workflow |
|
JDAS |
beanshell |
Scriptimport uk.ac.ebi.das.jdas.schema.sequence.DASSEQUENCE;
import uk.ac.ebi.das.jdas.schema.sequence.SEQUENCE;
import uk.ac.ebi.das.jdas.adapters.sequence.DasSequenceAdapter;
import javax.xml.bind.JAXBContext;
import javax.xml.bind.Unmarshaller;
import java.io.ByteArrayInputStream;
/* Use JAXB and JDAS to parse the DAS format*/
String sequenceText = "";
ByteArrayInputStream input = new ByteArrayInputStream(sequenceXML.getBytes());
JAXBContext jc = JAXBContext.newInstance("uk.ac.ebi.das.jdas.schema.sequence");
Unmarshaller unmarshaller = jc.createUnmarshaller();
DasSequenceAdapter dasSequence = new DasSequenceAdapter((DASSEQUENCE) unmarshaller.unmarshal(input));
SEQUENCE sequence = dasSequence.getSequence().get(0);
/* Chose what to print and how to print it */
String newline = "\n";
sequenceText += "id: " + sequence.getId() + newline;
sequenceText += "label: " + sequence.getLabel() + newline;
sequenceText += "start: " + sequence.getStart().toString() + newline;
sequenceText += "stop: " + sequence.getStop().toString() + newline;
sequenceText += "content: " + sequence.getContent() + newline;
sequenceText += "version: " + sequence.getVersion() + newline;
|
split_by_regex |
localworker |
ScriptList split = new ArrayList();
if (!string.equals("")) {
String regexString = ",";
if (regex != void) {
regexString = regex;
}
String[] result = string.split(regexString);
for (int i = 0; i < result.length; i++) {
split.add(result[i].replaceAll(" ",""));
}
}
|
regex_value |
stringconstant |
Value, |
Beanshells (2)
Name |
Description |
Inputs |
Outputs |
JDAS |
|
sequenceXML
|
sequenceText
|
create_range |
|
start
stop
|
range
|
Outputs (3)
Name |
Description |
Sequence_Custom_Format |
This output was achieved using the JDAS library …
http://code.google.com/p/jdas/
To be able to use this workflow with JDAS copy this file …
http://www.ebi.ac.uk/~maven/m2repo/uk/ac/ebi/das/jdas/1.0.3/jdas-1.0.3.jar
… to the lib folder inside the Taverna application. This jar file is a dependency needed to parse DAS outputs.
|
Sequence_DAS_format |
Have a look to the specification to know more about the DAS format ... http://www.biodas.org/documents/spec-1.6.html#sequence
|
Service_status |
HTTP response status codes ... http://en.wikipedia.org/wiki/List_of_HTTP_status_codes
|
Datalinks (10)
Source |
Sink |
DasSource |
DAS_sequence_retrieval:DasSource |
start |
DAS_sequence_retrieval:start |
stop |
DAS_sequence_retrieval:stop |
split_by_regex:split |
DAS_sequence_retrieval:ID |
DAS_sequence_retrieval:Sequence_DAS_format |
JDAS:sequenceXML |
IDs |
split_by_regex:string |
regex_value:value |
split_by_regex:regex |
JDAS:sequenceText |
Sequence_Custom_Format |
DAS_sequence_retrieval:Sequence_DAS_format |
Sequence_DAS_format |
DAS_sequence_retrieval:Service_status |
Service_status |
Uploader
License
All versions of this Workflow are
licensed under:
Version 1
(of 1)
Credits (1)
(People/Groups)
Attributions (0)
(Workflows/Files)
None
Shared with Groups (0)
None
Featured In Packs (1)
Log in to add to one of your Packs
Attributed By (0)
(Workflows/Files)
None
Favourited By (0)
No one
Statistics
Other workflows that use similar services
(0)
There are no workflows in myExperiment that use similar services to this Workflow.
Comments (0)
No comments yet
Log in to make a comment