Indigo-pains

Created: 2011-06-07 00:39:29      Last updated: 2015-12-01 00:45:02

If you like this workflow, please reference our paper doi:10.1002/minf.201100076, and check the related workflows Indigo-pains-recursive, and RDKit-pains.

*** Update 20151130 - using KNIME 3 nodes and the 'RDKit' version of PAINS queries ***


Implementation of the PAINS filters[1] using the Indigo (1.1.1300.201511201230) nodes in KNIME (3.0.1). Original PAINS filters were published in SLN format. his workflow contains the SMARTS form of the filters published by Greg Landrum as part of the RDKit library[2], which are based on the original conversion by the Guha group[3]. Also distributed with a 10k reference structure set from WEHI[1] which is used by default as input if no other file is chosen.

The updated PAINS filters now remove 825 structures from the reference set. The original SLN filters remove 861.

For a faster workflow that uses a recursive loop, see Indigo-pains-recursive http://www.myexperiment.org/workflows/3683.html

This workflow continues to use only those nodes that were available at the time in 2010.


The Indigo nodes require installation of the 'community nodes' in KNIME. See https://tech.knime.org/community and http://lifescience.opensource.epam.com/indigo/knime.html

Based on simple sub-structure workflow by James Davidson
http://tech.knime.org/forum/rdkit/substructure-search-with-rdkit
Modified to use SD/SMILES file as input, or manual SMILES entry, or database search.

1. J. B. Baell and G. A. Holloway, Journal of Medicinal Chemistry, 2010, 53, 2719-2749, http://dx.doi.org/10.1021/jm901137j

2. See discussion http://sourceforge.net/p/rdkit/mailman/message/34405703/ and the file in the repository https://github.com/rdkit/rdkit/tree/master/Data/Pains

3. Rajarshi Guha. "PAINS Substructure Filters as SMARTS. 2010-11-14."
http://blog.rguha.net/?p=850. Accessed: 2010-11-14. (Archived by
WebCite® at http://www.webcitation.org/5uF4wHEpB)

Information Preview

Medium

Information Run

Not available


Information Workflow Components

Not available

Information Workflow Type

KNIME

Information Uploader

Information License

All versions of this Workflow are licensed under:

Information Version 2 (latest) (of 2)

View version:

Information Credits (1)

(People/Groups)

Information Attributions (1)

(Workflows/Files)

Information Tags (7)

Log in to add Tags

Information Shared with Groups (1)

Information Featured In Packs (0)

None

Log in to add to one of your Packs

Information Attributed By (2)

(Workflows/Files)

Information Favourited By (0)

No one

Information Statistics

 

Citations (0)

None


Version History

In chronological order:



Reviews Reviews (0)

No reviews yet

Be the first to review!



Comments Comments (2)

Log in to make a comment

  • Friday 22 July 2011 14:25:16 (UTC)

    If you are having difficulty running large data sets, edit your knime.ini file and change the -Xmx value to increase the memory available to KNIME. A 300k SMILES set can be processed easily with -Xmx1500m.

  • Tuesday 01 September 2015 04:36:43 (UTC)

    The updated PAINS filters from the RDKit developer are resulting in improved results:

    http://rdkit.blogspot.com/2015/08/curating-pains-filters.html

    Just waiting on the nodes to be updated in KNIME before updating the workflows here, but you can grab the filters from the above link if you can't wait.




Workflow Other workflows that use similar services (0)

There are no workflows in myExperiment that use similar services to this Workflow.