Name |
Type |
Description |
Read_Text_File |
localworker |
This shell reads the X!Tandem/PeptideProphet results to feed the XPath extractors below. ScriptBufferedReader getReader (String fileUrl, String encoding) throws IOException {
InputStreamReader reader;
try {
if (encoding == null) {
reader = new FileReader(fileUrl);
} else {
reader = new InputStreamReader(new FileInputStream(fileUrl),encoding);
}
}
catch (FileNotFoundException e) {
// try a real URL instead
URL url = new URL(fileUrl);
if (encoding == null) {
reader = new InputStreamReader (url.openStream());
} else {
reader = new InputStreamReader (url.openStream(), encoding);
}
}
return new BufferedReader(reader);
}
StringBuffer sb = new StringBuffer(4000);
if (encoding == void) {
encoding = null;
}
BufferedReader in = getReader(fileurl, encoding);
String str;
String lineEnding = System.getProperty("line.separator");
while ((str = in.readLine()) != null) {
sb.append(str);
sb.append(lineEnding);
}
in.close();
filecontents = sb.toString();
|
Extract_Peptides |
xpath |
This XPath extracts the amino acid sequences of the identified peptides. Xpath Expression/default:msms_pipeline_analysis/default:msms_run_summary/default:spectrum_query/default:search_result/default:search_hit/@peptide |
Extract_Scan_Numbers |
xpath |
This XPath extracts the corresponding scan numbers for the identified peptides. Xpath Expression/default:msms_pipeline_analysis/default:msms_run_summary/default:spectrum_query/@start_scan |
Join_and_Insert_Tabs |
beanshell |
This Beanshell formats the data for RTCalc. ScriptString temp_training = new String();
String temp_validation_flat = new String();
for (i=0; i |
RTCalc_Train |
externaltool |
This component trains the RTCalc predictor. |
RTCalc_Predict |
externaltool |
This tool applies the RTCalc predictor. |
Compare_Z_Scores_and_Probabilities |
rshell |
This Rshell script calculates and plots RTCalc retention time Z-scores for all peptides in the pepXML file.
There is no probability cutoff for the training of RTCalc, so for datasets with many false peptide
identifications (or peptide-spectrum matches of low probability), a selection should be in the XPaths, in
the Join_and_Insert_Tabs component or in an additional component at any point before the
RTCalc_Train call. Scriptfile_split=unlist(strsplit(measured, split="\\n")) # split data into lines
peptides=lapply(strsplit(file_split,"\\t"), function(x) x[1])# create string vector using every second element
filtered_peptide_list<-"peptide\tprobability";
measured_RT=as.numeric(lapply(strsplit(file_split,"\\t"), function(x) x[2]))# create string vector using every second element
file_split=unlist(strsplit(predicted, split="\\n")) # split data into lines
predicted_RT=as.numeric(lapply(strsplit(file_split,"\\t"), function(x) x[2]))# create string vector using every second element
file_split=unlist(strsplit(probabilities, split="\\n")) # split data into lines
probabilities_P=as.numeric(lapply(strsplit(file_split,"\\t"), function(x) x[1])) # create string vector using every first element
rmsd2<-sqrt(sum(((measured_RT-predicted_RT)^2)/length(measured_RT)))
Z_score<-(measured_RT-predicted_RT)/rmsd2;
png("Z_scores_vs_probabilities.png", width=700, height=700); # make PNG plot
plot(log10(1-probabilities_P), Z_score, pch=20, xlab="log10(1-probability)", ylab="RTCalc Z-score");
for(i in 1:length(probabilities_P)) {
if (abs(Z_score[i])>max_abs_Z_score) points(log10(1-probabilities_P[i]), Z_score[i], pch=4, col="red")
if (abs(Z_score[i])<=max_abs_Z_score) {
new_line<-paste(peptides[i], probabilities_P[i], sep = "\t");
filtered_peptide_list<-paste(filtered_peptide_list, new_line ,sep = "\n");
}
}
temp<-dev.off();
R Serverlocalhost:6311 |
Extract_Probabilities |
xpath |
This XPath extracts the corresponding PeptideProphet probabilities for the identified peptides. Xpath Expression/default:msms_pipeline_analysis/default:msms_run_summary/default:spectrum_query/default:search_result/default:search_hit/default:analysis_result/default:peptideprophet_result/@probability |
Make_Table |
beanshell |
This Beanshell formats the probabilities for the Rshell below. Script//can be removed, connecting the nodelist to a numerical vector input
String temp_probabilities = new String();
for (i=0; i |
Comments (0)
No comments yet
Log in to make a comment