Tutorial 1 Workflow
Created: 2014-04-16 14:55:56
Last updated: 2014-04-16 15:07:17
Fetch and combine transcriptomics and proteomics data from NSC11 cells in
GEO and PRIDE respectively. Apply spectral counting on the proteomics data.
The workflow demonstrates how to combine data from different domains (here microarray-
based transcriptomics and mass spectrometry-based proteomics) and do something
useful with the data. The workflow also demonstrates spreadsheet import and XPath
parsing of XML data.
This workflow is for tutorial/educational use only. The cells were not grown
under the same conditions, nor should raw spectral counts be used for protein
quantitation.
Preview
Run
Run this Workflow in the Taverna Workbench...
Workflow Components
Authors (1)
Titles (1)
Descriptions (1)
Fetch and combine transcriptomics and proteomics data from NSC11 cells in
GEO and PRIDE respectively. Apply spectral counting on the proteomics data.
The workflow demonstrates how to combine data from different domains (here microarray-
based transcriptomics and mass spectrometry-based proteomics) and do something
useful with the data. The workflow also demonstrates spreadsheet import and XPath
parsing of XML data.
This workflow is for tutorial/educational use only. The cells were not grown
under the same conditions, nor should raw spectral counts be used for protein
quantitation. |
Dependencies (0)
Inputs (1)
Name |
Description |
UniProt_Accession2GeneID |
|
Processors (11)
Name |
Type |
Description |
Line_Break |
stringconstant |
Value\r\n|\n |
Split_string_into_string_list_by_regular_expression |
localworker |
ScriptList split = new ArrayList();
if (!string.equals("")) {
String regexString = ",";
if (regex != void) {
regexString = regex;
}
String[] result = string.split(regexString);
for (int i = 0; i < result.length; i++) {
split.add(result[i]);
}
}
|
XPath_Service |
xpath |
Xpath Expression/default:msms_pipeline_analysis/default:msms_run_summary/default:spectrum_query/default:search_result/default:search_hit/@protein |
Spectral_Counting |
beanshell |
Scriptout1 = new ArrayList();
counts = new HashMap();
for (i=0; i |
Merge_Gene_Expression_and_Spectral_Counts |
beanshell |
ScriptString delims = "[ \t|]+";
String[] tokens;
out1 = new ArrayList();
counts = new Hashtable();
for (i=0; i |
Gene_Expression_with_UniProt_Accessions |
beanshell |
ScriptString delims = "[ \t]+";
String[] tokens;
out1 = new ArrayList();
dict = new Hashtable();
for (i=0; i1) dict{tokens[1] + "_at"} = tokens[0];
}
for (i=0; i1) out1.add( tokens[0] + "\t" + dict{tokens[0]} + "\t" + tokens[1]);
} |
Transcriptomics_Data |
stringconstant |
An NSC11 dataset in GEO Valuehttp://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?view=data&acc=GSM1098404&id=13672&db=GeoDb_blob95 |
Proteomics_Data |
stringconstant |
An NSC11 PRIDE dataset in pepXML Valuehttp://www.ebi.ac.uk/pride/archive/files/21260108 |
Get_Web_Page_from_URL |
localworker |
Scriptif ((url == void) || (url == null)) {
throw new RuntimeException("The url must be specified");
}
URL inputURL = null;
if (base != void) {
inputURL = new URL(new URL(base), url);
}
else {
inputURL = new URL(url);
}
URLConnection con = inputURL.openConnection();
InputStream in = con.getInputStream();
StringBuffer result = new StringBuffer();
BufferedReader reader;
String encoding = con.getContentEncoding();
if (encoding == null) {
reader = new BufferedReader(new InputStreamReader(in));
} else {
reader = new BufferedReader(new InputStreamReader(in, encoding));
}
String line = null;
String NEWLINE = System.getProperty("line.separator");
while ((line = reader.readLine()) != null) {
result.append(line);
result.append(NEWLINE);
}
reader.close();
contents = result.toString();
|
Import_into_R_and_Plot |
rshell |
Scriptcombined_data<-in1;
A<-matrix(unlist(strsplit(combined_data,"\t")), nrow = length(combined_data), ncol = 4, byrow = TRUE);
genes<-as.numeric(A[,3]);
proteins<-as.numeric(A[,4]);
png(filename="comparison.png", height=400, width=400, bg="white");
plot(genes, proteins, xlab="gene expression", ylab="protein spectral count");
dev.off();
R Serverlocalhost:6311 |
SpreadsheetImport |
spreadsheet |
|
Beanshells (3)
Name |
Description |
Inputs |
Outputs |
Spectral_Counting |
|
in1
|
out1
|
Merge_Gene_Expression_and_Spectral_Counts |
|
in1
in2
|
out1
|
Gene_Expression_with_UniProt_Accessions |
|
in1
in2
|
out1
|
Outputs (3)
Name |
Description |
proteins |
|
genes |
|
comparison |
|
Datalinks (14)
Source |
Sink |
Line_Break:value |
Split_string_into_string_list_by_regular_expression:regex |
UniProt_Accession2GeneID |
Split_string_into_string_list_by_regular_expression:string |
Get_Web_Page_from_URL:contents |
XPath_Service:xml_text |
XPath_Service:nodelist |
Spectral_Counting:in1 |
Gene_Expression_with_UniProt_Accessions:out1 |
Merge_Gene_Expression_and_Spectral_Counts:in1 |
Spectral_Counting:out1 |
Merge_Gene_Expression_and_Spectral_Counts:in2 |
Split_string_into_string_list_by_regular_expression:split |
Gene_Expression_with_UniProt_Accessions:in1 |
SpreadsheetImport:A |
Gene_Expression_with_UniProt_Accessions:in2 |
Proteomics_Data:value |
Get_Web_Page_from_URL:url |
Merge_Gene_Expression_and_Spectral_Counts:out1 |
Import_into_R_and_Plot:in1 |
Transcriptomics_Data:value |
SpreadsheetImport:fileurl |
Import_into_R_and_Plot:proteins |
proteins |
Import_into_R_and_Plot:genes |
genes |
Import_into_R_and_Plot:comparison |
comparison |
Uploader
License
All versions of this Workflow are
licensed under:
Version 2 (latest)
(of 2)
Credits (1)
(People/Groups)
Attributions (0)
(Workflows/Files)
None
Shared with Groups (1)
Featured In Packs (0)
None
Log in to add to one of your Packs
Attributed By (1)
(Workflows/Files)
Favourited By (0)
No one
Statistics
Other workflows that use similar services
(0)
There are no workflows in myExperiment that use similar services to this Workflow.
Comments (0)
No comments yet
Log in to make a comment