Nucleotide_InterProScan
Run InterProScan using a nucleotide sequence as input.
The InterProScan tool (http://www.ebi.ac.uk/Tools/InterProScan/) searches a protein sequence against a selection of protein domain, feature and family signature databases, and integrates the results giving potential assignments to InterPro entries and Gene Ontology terms. Since InterProScan is a protein search tool to use it with a nucleotide sequence, the sequence must be translated into a protein sequence. There are a number of ways of doing this, depending on the properties of the nucleotide sequence, in this case a simple open reading frame (ORF) model is used to obtain the candidate translations. These translations are filtered for length (>80aa) and a search against UniProtKB (http://www.uniprot.org/) is performed to ensure that only sequences which have some relationship with known protein space, on which the signatures used are based, are passed to InterProScan. Once the set of translations has been filtered the remaining sequences as passed on to InterProScan for analysis.
Note: the coordinates in the InterProScan output are in protein coordinates relative to the input translated sequence, to map these on to the input nucleotide sequence see the fasta header of the corresponding translated ORF where the nucleotide coordinates are shown.
This implementation uses:
- EBI's WSDbfetch web service (http://www.ebi.ac.uk/Tools/webservices/services/dbfetch) to retreive enties specified by database identifer.
- EMBOSS seqret tool (http://emboss.sourceforge.net/apps/release/5.0/emboss/apps/getorf.html) via Soaplab (http://www.ebi.ac.uk/Tools/webservices/soaplab/overview) to ensure input sequences are in an appropriate format (i.e. fasta format).
- EMBOSS getorf tool (http://emboss.sourceforge.net/apps/release/5.0/emboss/apps/getorf.html) via Soaplab (http://www.ebi.ac.uk/Tools/webservices/soaplab/overview) to find the ORFs, perform the translation and filter the translations for length.
- EBI's WSNCBIBlast web service (http://www.ebi.ac.uk/Tools/webservices/services/ncbiblast) to perform the filtering BLAST search against UniProtKB.
- EBI's WSInterProScan web service (http://www.ebi.ac.uk/Tools/webservices/services/interproscan) to access InterProScan for the final search.
and is based on the proceedure described for nucleotide InterProScan searches described on the WSInterProScan web pages (see http://www.ebi.ac.uk/Tools/webservices/services/interproscan).
Preview
Run
Run this Workflow in the Taverna Workbench...
Option 1:
Copy and paste this link into File > 'Open workflow location...'
http://myexperiment.org/workflows/229/download?version=3
[ More Info ]
Workflow Components
Workflow Type
Version 3 (of 4)
Log in to add Tags
Shared with Groups (0)
None
Statistics
Reviews (0)
Other workflows that use similar services (36)
Only the first 2 workflows that use similar services are shown. View all workflows that use these services.
Protein_transmembrane_prediction (2)
Created: 2008-10-26 | Last updated: 2011-04-01
Credits: Hamish McWilliam
Attributions: EBI_InterProScan_tmhmm_signalp EBI_Phobius tmap_single_sequence
Created: 2008-10-26 | Last updated: 2012-08-22
Credits: Hamish McWilliam
Attributions: EBI_WU-BLAST EBI_dbfetch_fetchBatch Fasta_string_to_fasta_list EBI_InterProScan
Comments (0)
No comments yet
Log in to make a comment