Search CABRI human and animal cell lines catalogues by cell line name and retrieve full cell line descriptions
Created: 2010-01-15 16:26:41
Last updated: 2010-01-15 16:26:59
This workflow takes the cell line name and the catalogue(s) name(s) as input and retrieve the full cell line description(s) by first retrieving the cell lines' unique IDs associated with the input (done via a call to the getCellLineIdsByName web service) and then using IDs for retrieving the full cell lines descriptions (done via a call to the getCellLinesByIds web service).
Preview
Run
Run this Workflow in the Taverna Workbench...
Workflow Components
Inputs (2)
Name |
Description |
libs |
This input includes the name(s) of the CABRI human and animal cell lines catalogues involved in the query.
Multiple values can be specified, in a unique string field, each name in a distinct text line (thus, names
must be divided by a '\n' character)
As of Feb 15, 2005, possible values are:
- 'iclc' (i.e., the Interlab Cell Line Collection, www.iclc.it)
- 'ecacc_cell' (i.e., the European Collection of Cell Cultures, www.ecacc.org.uk)
- 'dsmz_mutz' (i.e., the collection of human and animal cell cultures of the DSMZ, www.dsmz.de)
Catalogues can be added (or, rarely, removed) without notice. See www.cabri.org for further information. |
name |
The input must specify the name of the required cell line(s). Due to the indexing rules in the CABRI network
service (see www.cabri.org), only one word can be used in the search and no spaces are allowed in the cell
line name.
<br>
Moreover:
<br> cell lines names cannot include the following characters: '/','-' and '*',
<br> cell lines names are case insensitive.
<br>
Example of valid cell lines names are:
<br>vero
<br>hela
<br>a172
<br>calu6 |
Processors (8)
Name |
Type |
Description |
Text_lines_separator |
stringconstant |
This string ('\n') specify that cell lines IDs are separated by a end of line character.
It is used as a regex separator string to move more cell lines IDs from one text string into a list of strings. |
Substitute_blanks_with_SP |
beanshell |
This is a trivial beanshell script that substitute all occurrences of blank spaces in a string with the substring '_SP_' .
It is used with the aim of adapting a string including blank spaces to an SRS search made through a remote access to
the wgetz application. |
Regex_for_catalogue_name_extraction |
stringconstant |
This regular expression specifies the format of a cell line id in the CABRI network service.
This regular expression - ([A-Z_]*:)([ A-Z0-9-]*) - specifies that:
- the first part of the ID is the catalogue name and is made up of one or more uppercase letters and underscore characters
folowed by a semicolon (':')
- the second part of the ID is the code of the resource and can includes letter, numbers, spaces and dashes ('-'). |
Group_for_catalogue_name_extraction |
stringconstant |
This data specifies that the catalogue code is the second part of the ID (wrt the regular expression specified by the
'Regex_for_catalogue_name_extraction' string. |
Separate_cell_line_ids |
local |
This processor implements the following local string elaboration.
IDs which are returned by the getCellLineIdsByName call are included in a plain text string.
They must be transformed in a list of strings.
This task is implemented by using a Split_string_into_string_list_by_regular_expression local processor. |
Extract_ids_by_removing_catalogues_names |
local |
This processor implements the following local string elaboration.
IDs which are returned by the call to getCellLineIdsByName include catalogue's name and cell line code in the catalogue.
The catalogue's code must be removed before the utilization of Ids for further processing.
This task is implemented by using a Filter_list_of_strings_extracting_match_to_a_regex local processor. |
getCellLinesById |
soaplabwsdl |
Get cell line descriptions by Id from CABRI human and animal cell lines catalogues (see www.cabri.org) |
getCellLineIdsByName |
soaplabwsdl |
Get cell lines' IDs by name from CABRI human and animal cell lines catalogues (see www.cabri.org) |
Beanshells (1)
Name |
Description |
Inputs |
Outputs |
Substitute_blanks_with_SP |
This is a trivial beanshell script that substitute all occurrences of blank spaces in a string with the substring '_SP_' .
It is used with the aim of adapting a string including blank spaces to an SRS search made through a remote access to
the wgetz application. |
input_str
|
output_str
|
Outputs (1)
Name |
Description |
cell_line_des |
The cell line description is a flat file record from the CABRI human and animal cell lines catalogues.
<br>
Within CABRI (see www.cabri.org), a standardization effort has been carried out in order to achieve common descriptions
for biological resources from different collections (and thus different databases). Three data sets have been defined
for each biological material (cell lines, bacteria strains, plasmids, etc...).
<br>
The Minimum Data Set (MDS) includes those information that are needed to univoquely identify the resource.
The Recommeded Data Set (RDS) includes those information that are needed in order to achieve a good scientific description
of the characteristics and properties of the resource.
The Full Data Set (FDS) includes all the remaining information that can also be collection specific.
<br>
The human and animal cell lines data sets include the following information:
SET Field name
MDS Accession_number
MDS Cell_line_name
MDS Brief_description
MDS Description
MDS Depositor
MDS Bibliographic_references
MDS Morphology
MDS Culture_conditions
MDS Viruses
MDS Properties
MDS Release_conditions
MDS Hazard
RDS Passage_number
RDS Species_validation
FDS Tumorigenicity (ICLC)
FDS Karyology (ICLC)
FDS Freezing_medium (ICLC)
FDS Sterility (ICLC)
FDS Validation_assays (ICLC)
FDS Further_bibliography (ICLC)
FDS Comments (ICLC)
FDS Storage (DSMZ_MUTZ)
FDS Doubling_time (DSMZ_MUTZ)
FDS Mycoplasma (DSMZ_MUTZ)
FDS Fingerprint (DSMZ_MUTZ)
FDS Cytogenetics (DSMZ_MUTZ)
FDS Karyotype (ECACC_CELL)
FDS Comments (ECACC_CELL)
FDS Research_council_deposit (ECACC_CELL)
FDS BIOMED_1 (ECACC_CELL)
<br>
The following is an example of a cell lines description:
Accession_number ICLC ATL95005
Cell_line_name Vero
Brief_description Species: monkey, African green; Tissue: kidney
Description Species: monkey, African green adult; Tissue: kidney
Depositor Obtained from ECACC.
Bibliographic_reference Nippon Rinsho 1963;21:1209
Morphology fibroblast, grown as monolayer
Culture_conditions continuous culture; DMEM + 10% FBS + 2mM L-Glutamine; split confluent cultures 1:5-1:20 using trypsin/EDTA; seed at 1-3x104 cells/cm2. 37C, 5% CO2.
Viruses Search for viruses was not performed
Properties indicator line for mycoplasma testing; virology; virus titration; virus replication; plaque assays; bacterial cytotoxicity.; Susceptible to: poliovirus 3, echovirus, reovirus, rubella, herpes simplex, arbovirus, adenovirus 12, influenza, paramyxovirus, Getah arbovirus, orthomixovirus, African swine fever virus, bluetongue virus, bovine leucosis, orbivirus, porcine epidemic diarrhea virus
Release_conditions Cell line available for distribution. For non-commercial investigative use only
Hazard -
Species_validation Validated by isoenzymes: confirmed as monkey with AST, MD, NP, PepB
Karyology modal number 58
Freezing_medium Culture medium + 50% FBS + 10% DMSO
Sterility mycoplasma negative, HOECHST and PCR
Further_bibliography Kitasato Arc Exp Med 1964;37:27-42 [PMID: 5833688] <BR> Nature 1967;216:271 <BR> Proc Soc Exp Biol Med 1967;125:602 [PMID: 4961492] |
Links (11)
Source |
Sink |
libs |
getCellLineIdsByName:libs |
name |
getCellLineIdsByName:name |
libs |
getCellLinesById:libs |
getCellLinesById:result |
cell_line_des |
getCellLineIdsByName:result |
Separate_cell_line_ids:string |
Separate_cell_line_ids:split |
Extract_ids_by_removing_catalogues_names:stringlist |
Regex_for_catalogue_name_extraction:value |
Extract_ids_by_removing_catalogues_names:regex |
Text_lines_separator:value |
Separate_cell_line_ids:regex |
Group_for_catalogue_name_extraction:value |
Extract_ids_by_removing_catalogues_names:group |
Extract_ids_by_removing_catalogues_names:filteredlist |
Substitute_blanks_with_SP:input_str |
Substitute_blanks_with_SP:output_str |
getCellLinesById:id |
Uploader
License
All versions of this Workflow are
licensed under:
Version 1
(of 1)
Credits (1)
(People/Groups)
Attributions (0)
(Workflows/Files)
None
Shared with Groups (0)
None
Featured In Packs (0)
None
Log in to add to one of your Packs
Attributed By (0)
(Workflows/Files)
None
Favourited By (0)
No one
Statistics
Other workflows that use similar services
(0)
There are no workflows in myExperiment that use similar services to this Workflow.
Comments (0)
No comments yet
Log in to make a comment