Tag Results

Items tagged with "text" (36)

Note: some items may not be visible to you, due to viewing permissions.


Files (10)
Uploader

Blob Pathway Cosine Scores from Day7 and Tir1 QTL

Created: 2009-08-10 15:55:24

Credits: User Paul Fisher

License: Creative Commons Attribution-Share Alike 3.0 Unported License

This excel file contains a list of all pathways found to be differentially expressed at day 7 post infection in the trypanosomiasis resistance phenotype, which contain genes in the Tir1 QTL. The pathways in this file have been ranked according to the scores obtained after calculating a cosine vector value against the trypanosomiasis resistance phenotype. The higher the score, the more closely linked to a phentype a given pathway is. This allows each pathway to be ranked giving biologists a ...

File type: Excel workbook

Comments: 0 | Viewed: 87 times | Downloaded: 0 times

Tags:

Uploader

Blob Gene Cosine Scores

Created: 2009-08-10 16:00:45

Credits: User Paul Fisher

License: Creative Commons Attribution-Share Alike 3.0 Unported License

This excel file contains a list of genes linked to the resistance to African trypanosomiasis in the mouse. Genes from the Tir1 QTL were used in a search through PubMed. The results were then correlated to the trypanosomiasis resistance phenotype. The higher the score (and ranking) the more related to the phenotype the gene is likely to be. This is based on the co-occurrence of terms within the gene and phentoype corpora.

File type: Excel workbook

Comments: 0 | Viewed: 98 times | Downloaded: 0 times

Tags:

Uploader

Blob Phenotype Abstracts for Trypanosomiasis Resistance

Created: 2009-08-11 12:45:24

Credits: User Paul Fisher

License: Creative Commons Attribution-Share Alike 3.0 Unported License

This file contains a list of published abstracts from MEDLINE, that are related to the African Trypanosomiasis resistance phentoype in the mouse. The term used in the PubMed search was: trypanosom* AND (tolerance OR resistance) . The workflow limited the date of the search using PubMed between 31/12/2005 to 01/01/2009, and was restricted to 500 abstracts.

File type: Plain text

Comments: 0 | Viewed: 58 times | Downloaded: 0 times

Tags:

Uploader

Blob Phenotype Concept Profile - Terms

Created: 2009-08-11 13:05:07 | Last updated: 2009-08-11 13:06:51

Credits: User Paul Fisher

License: Creative Commons Attribution-Share Alike 3.0 Unported License

This file contains a list of all terms extracted from the phenotype corpus, relating to African trypanosomiasis resistance in the mouse model. These terms were extracted using the following service: http://gopubmed4.biotec.tu-dresden.de/GoPubMedTermGenerationService/services/GoPubMedTermGeneration?wsdl These terms represent the concept profile for the phenotype.

File type: Plain text

Comments: 0 | Viewed: 179 times | Downloaded: 0 times

Tags:

Uploader

Blob Phenotype Term Counts (in Phenotype Corpus)

Created: 2009-08-11 13:34:42 | Last updated: 2009-08-11 13:58:28

Credits: User Paul Fisher

License: Creative Commons Attribution-Share Alike 3.0 Unported License

This file contains a count of each phenotype term extracted from corpus of phenotype abstracts. Each value represents the number of articles in the phenotype corpus the term appears. The use of this file is to calculate a cosine vector score for correlating a given concept (e.g. pathway or gene) with a phenotype.

File type: Plain text

Comments: 0 | Viewed: 76 times | Downloaded: 0 times

Tags:

Uploader

Blob Pathway Abstracts for Day7 Microarray Tir1 QTL

Created: 2009-08-11 14:08:41 | Last updated: 2009-08-11 14:15:58

Credits: User Paul Fisher

License: Creative Commons Attribution-Share Alike 3.0 Unported License

This file contains all the abstracts for pathways found to be differentially expressed at day 7 post infection and intersect the Tir1 QTL region, from the African Trypanosomiasis project. Each pathway is listed as ">> [Pathway Name]", together with a PubMed identifier, date, and abstract for each article. Each pathway has been restricted to 500 abstracts, and is given in the date range 31/12/2007 to 01/01/2009. Note, some pathways do not have any abstracts available due to th...

File type: Plain text

Comments: 0 | Viewed: 80 times | Downloaded: 0 times

Tags:

Uploader

Blob Pathway Term Enrichment Scores

Created: 2009-08-11 14:23:49

Credits: User Paul Fisher

License: Creative Commons Attribution-Share Alike 3.0 Unported License

This file contains a list of each pathway identified from day 7 post infection and linked to the Tir1 QTL. With each pathway is a list of terms that are common to both pathway and phenotype corpora. These terms were ranked accoring to their enrichement scores. The higher the score, the more significant the term is in relation to correlating the pathway with the African trypanosomiasis resistance phenotype.

File type: Plain text

Comments: 0 | Viewed: 110 times | Downloaded: 0 times

Tags:

Uploader

Blob Ondex and Taverna Tutorial

Created: 2009-10-22 13:50:53 | Last updated: 2009-10-22 13:51:51

Credits: User George

License: Creative Commons Attribution-Share Alike 3.0 Unported License

Biological Data Integration Using Ondex and Taverna: A Tutorial 25/26th November 2009 The University of Manchester The Ondex SABR project (http://ondex.org/sabr.html) invite you to a two-day tutorial that aims to show participants how to use Ondex and Taverna to perform common biological data collection, integration and visualisation tasks.

File type: Word document

Comments: 0 | Viewed: 93 times | Downloaded: 60 times

Tags:

Uploader

Blob Bilateral Perisylvian Polymicrogyria (Epilepsy)

Created: 2010-12-07 16:34:31 | Last updated: 2010-12-07 16:34:37

Credits: User Paul Fisher

License: Creative Commons Attribution-Share Alike 3.0 Unported License

This zip file contains the results of running a QTL workflow for Bilateral Perisylvian Polymicrogyria in human (homo sapiens). Provided are a list of candidate QTL genes (QTg) and their corresponding KEGG pathways. Each gene and pathway have been subsequently run through a series of text mining workflows to determine the significance of each may play in relation to Bilateral Perisylvian Polymicrogyria AND/OR Epilepsy. Further to this, I have also collected the SNPs (single nucleotide...

File type: application/x-zip-compressed

Comments: 0 | Viewed: 68 times | Downloaded: 0 times

Tags:

Uploader

Blob Ondex Workflows

Created: 2011-09-01 17:17:57 | Last updated: 2011-09-01 17:18:05

Credits: User Paul Fisher

License: Creative Commons Attribution-Share Alike 3.0 Unported License

This zip file contains a large number of Taverna 2 workflows that utilise the Ondex Web Service, for manipulating Ondex graphs.

File type: ZIP archive

Comments: 0 | Viewed: 59 times | Downloaded: 0 times

Tags:

Packs (3)
Creator

Pack Pathway to Phenotype using Text Mining


Created: 2009-08-10 13:01:47 | Last updated: 2009-08-11 14:51:31

This pack contains a list of workflows and result files obtained from the analysis of candidate pathways believed to play a role in resistance to African Trypanosomiasis in the mouse model organism.

12 items in this pack

Comments: 0 | Viewed: 291 times | Downloaded: 81 times

Tags:

Creator

Pack Text Mining Workflows


Created: 2010-12-08 11:55:03 | Last updated: 2011-02-01 11:33:11

This pack contains workflows to navigate from candidate Quantitative Trait genes and pathways to a given phenotype.

5 items in this pack

Comments: 0 | Viewed: 193 times | Downloaded: 62 times

Tags:

Creator

Pack Trichuriasis induced Colitis


Created: 2011-02-16 12:49:21 | Last updated: 2011-02-16 15:26:36

This pack contains the workflows and data relating to Trichuriasis induced colitis.

5 items in this pack

Comments: 0 | Viewed: 81 times | Downloaded: 38 times

Tags:

Workflows (23)

Workflow fetch_fasta (1)

Thumb
This work flow is designed to take an EMBL file containing the genomic data for an identified bacterium. From this information the workflow can determine whether or not that this strain is an MRSA type of bug. This can be determined based on the MecA profile of the given strain. Blast is utilised to find a relationship with given proteins and that of know S. aureus strains. This phylogenic output is generated from a ClustalW algorithm that plots a phylogenic tree. The output is prese...

Created: 2009-03-20 | Last updated: 2009-03-20

Credits: User Jumblejumble

Workflow DOI Record Generator (1)

Thumb
This workflow generates DOI record files for deposit, using data set metadata for the FLOSSmole project. It reads in an input file generated from a SQL query from an eprints database, and transforms the parts of the source file as necessary to create a comprehensive DOI deposit record. It also generates DOIs for the data sets. These metadata are inserted into an XML record template (based on the std-doi.xsd schema) and the individual resources are aggregated into a single file.

Created: 2009-04-29

Credits: User Andrea Wiggins

Attributions: Workflow Data Set Metadata Generator

Workflow DOI Files (1)

Thumb
This workflow generates additional files required for handling DOI creation: the DOI URL mapping required for the DOI deposit, and a set of sql update statements to insert the DOIs into an eprints database. Note that it is extremely important for this workflow to use the same CSV file as was used with the DOI record generator, as well as the same seed number.

Created: 2009-06-05

Credits: User Andrea Wiggins

Attributions: Workflow DOI Record Generator

Workflow Cosine vector space (1)

Thumb
This workflow calculates the cosine vector space between two sets of corpora. The workflow then removes any null values from the output. The result is a cosine vector score between 0 and 1, showing the significance of any links between one concept (e.g. pathway) to another (e.g. phenotype). A score of 0 means there is no or an undetermined correlation between the two concepts. A score approaching 1 represents positive correlation.

Created: 2009-08-10 | Last updated: 2009-08-10

Credits: User Paul Fisher

Workflow Extract Scientific Terms (1)

Thumb
This workflow takes in a document containg text and removes any non-ascii characters. The cleaned text is then sent to a service in Dresden, to extract all scientific terms. These terms represent a concept profile for the input concpet. Any null values are also removed.

Created: 2009-08-10 | Last updated: 2009-08-10

Credits: User Paul Fisher

Workflow Rank Phenotype Terms (1)

Thumb
This workflow counts the number of articles in the pubmed database in which each term occurs, and identifies the total number of articles in the entire PubMed database. It also identified the total number of articles within pubmed so that a term enrichment score may be calculated. The workflow also takes in a document containing abstracts that are related to a particular phenotype. Scientiifc terms are then extracted from this text and given a weighting according to the number of terms that ...

Created: 2009-08-10

Credits: User Paul Fisher

Workflow Table Parser (1)

Thumb
This workflow parsers a table (specified by the user), into an Ondex Graph on the web server.

Created: 2009-08-19

Credits: User Paul Fisher

Workflow Split text/string into its lines and filte... (2)

Thumb
When retrieving a URL or soemthing alike, one can often identify the region of interest as a single line. Besides the expected output, also some interim values, like the lines split are forwarded, to allow some straight-forward cascading of filters with reduced redundancy.

Created: 2009-08-19

Credits: User Steffen Möller

Workflow Gene to Pubmed (3)

Thumb
This workflow takes in a list of gene names and searches the PubMed database for corresponding articles. Any matches to the genes are then retrieved (abstracts only). These abstracts are then returned to the user.

Created: 2010-07-05 | Last updated: 2011-01-26

Credits: User Paul Fisher

Workflow Phenotype to pubmed (3)

Thumb
This workflow takes in a phenotype search term, and searches for abstracts in the PubMed database. These are passed to the eSearch function and searched for in PubMed. Those abstracts found are returned to the user

Created: 2010-07-05 | Last updated: 2011-01-11

Credits: User Paul Fisher

Workflow Cosine vector space (2)

Thumb
This workflow calculates the cosine vector space between two sets of corpora. The workflow then removes any null values from the output. this is some extra text vbeing added

Created: 2010-12-08 | Last updated: 2011-01-11

Credits: User Paul Fisher

Attributions: Workflow Cosine vector space

Workflow Rank Phenotype Terms (2)

Thumb
This workflow counts the number of articles in the pubmed database in which each term occurs, and identifies the total number of articles in the entire PubMed database. It also identified the total number of articles within pubmed so that a term enrichment score may be calculated. The workflow also takes in a document containing abstracts that are related to a particular phenotype. Scientiifc terms are then extracted from this text and given a weighting according to the number of terms that ...

Created: 2010-12-08 | Last updated: 2011-01-11

Credits: User Paul Fisher

Attributions: Workflow Rank Phenotype Terms

Workflow Pathway to Pubmed (2)

Thumb
This workflow takes in a list of KEGG pathway descriptions and searches the PubMed database for corresponding articles. Any matches to the pathways are then retrieved (abstracts only). These abstracts are then returned to the user.

Created: 2010-12-08 | Last updated: 2011-01-11

Credits: User Paul Fisher

Workflow Extract Scientific Terms (2)

Thumb
This workflow takes in a document containg text and removes and non-ascii characters. The cleaned text is then sent to a service in dresden to extract all scientific terms. These terms represent a profile for the input document. Any null values are also removed.

Created: 2010-12-08 | Last updated: 2011-01-11

Credits: User Paul Fisher

Workflow Rank Phenotype Terms (1)

Thumb
This workflow counts the number of articles in the pubmed database in which each term occurs, and identifies the total number of articles in the entire PubMed database. It also identified the total number of articles within pubmed so that a term enrichment score may be calculated. The workflow also takes in a document containing abstracts that are related to a particular phenotype. Scientiifc terms are then extracted from this text and given a weighting according to the number of terms that ...

Created: 2011-02-01 | Last updated: 2011-02-01

Credits: User Paul Fisher

Attributions: Workflow Cosine vector space Workflow Rank Phenotype Terms

Workflow Remove Non-ASCII (1)

Thumb
THis workflow removes any non-ascii characters from a segment of text. Any characters that are found are removed. Letters either side f the non-ASCII are concatenated - this may cause the loss of word meaning

Created: 2011-02-03 | Last updated: 2011-02-03

Credits: User Paul Fisher

Workflow Gene to Pubmed (4)

Thumb
This workflow takes in a list of gene names and searches the PubMed database for corresponding articles. Any matches to the genes are then retrieved (abstracts only). These abstracts are then returned to the user.

Created: 2011-02-08 | Last updated: 2011-02-10

Credits: User Paul Fisher

Attributions: Workflow Cosine vector space Workflow Extract Scientific Terms Workflow Rank Phenotype Terms Workflow Cosine vector space Workflow Rank Phenotype Terms Workflow Pathway to Pubmed Workflow Extract Scientific Terms

Workflow Content based recommender (1)

Thumb
This process is a special case of the item to item similarity matrix based recommender where the item to item similarity is calculated as cosine similarity over TF-IDF word vectors obtained from the textual analysis over all the available textual data. The inputs to the process are context defined macros: %{id} defines an item ID for which we would like to obtain recommendation and %{recommender_no} defines the required number of recommendations. The process internally uses an example set of...

Created: 2011-03-15 | Last updated: 2011-03-15

Workflow Example for external tools with gzip and g... (1)

Thumb
This workflow is self-contained. It has a fixed URL from which a text file is downloaded (output as Original_file). That file is gzipped (output as Compressed_File) and then gunzipped again (output as Decompressed_File). The workflow wraps external tools from taverna.nordugrid.org and needs a beta version of Taverna 2.3 or later.

Created: 2011-05-18 | Last updated: 2011-05-18

Credits: User Steffen Möller User Alan Williams

Workflow Read file from S3 bucket (1)

Thumb
This workflow simply loads a text file that is stored in an AWS S3 bucket. It is provided as an example of how to do this, rather than be a complete, reusable solution. You need to have s3fs installed and configured with your AWS credentials to use this workflow (see http://code.google.com/p/s3fs/).

Created: 2012-08-23

Credits: User Robert Haines

Workflow Write file to S3 bucket (1)

Thumb
This workflow simply writes a text file to an AWS S3 bucket. It is provided as an example of how to do this, rather than be a complete, reusable solution. You need to have s3fs installed and configured with your AWS credentials to use this workflow (see http://code.google.com/p/s3fs/).

Created: 2012-08-23

Credits: User Robert Haines

Attributions: Workflow Read file from S3 bucket

Workflow STUDY OF QUANTIFICATION OF IMPURITIES AND ... (1)

  Bulk drug during its production process, after its scale up, it is necessary to analyse for the presence of any impurities or related substances in it. This is to ensure the impurities and related substances are within their limits as per ICH Guidelines. Required brief study        The primary objective of the study is to develop HPLC method and validate it for the detection and quantification of impurities and related substances in the manufactu...

Created: 2012-09-16

Credits: User Drkrishnasarmapathy

Workflow Text-mining using OSCAR to obtain a list o... (1)

Thumb
This service extracts chemical names from text and obtains identifiers for these names. It outputs a HTML string that can be opened in a browser providing a table of information and links to ChemSpider.Known issues - Character limit ~3000 - Unable to produce InChIs or CSID for some names - Error sometimes encountered when a trivial and systematic name for the same compound are used - Some issues with identifiers being recognised but not able to be processedrequires access ...

Created: 2013-04-18

Credits: User Michael Smith User Mark Borkum

Attributions: Workflow InChIToCSID

What is this?

Linked Data

Non-Information Resource URI: http://www.myexperiment.org/tags/497


Alternative Formats

HTML
RDF
XML