Indigo-pains
If you like this workflow, please reference our paper doi:10.1002/minf.201100076, and check the related workflows Indigo-pains-recursive, and RDKit-pains.
*** Update 20151130 - using KNIME 3 nodes and the 'RDKit' version of PAINS queries ***
Implementation of the PAINS filters[1] using the Indigo (1.1.1300.201511201230) nodes in KNIME (3.0.1). Original PAINS filters were published in SLN format. his workflow contains the SMARTS form of the filters published by Greg Landrum as part of the RDKit library[2], which are based on the original conversion by the Guha group[3]. Also distributed with a 10k reference structure set from WEHI[1] which is used by default as input if no other file is chosen.
The updated PAINS filters now remove 825 structures from the reference set. The original SLN filters remove 861.
For a faster workflow that uses a recursive loop, see Indigo-pains-recursive http://www.myexperiment.org/workflows/3683.html
This workflow continues to use only those nodes that were available at the time in 2010.
The Indigo nodes require installation of the 'community nodes' in KNIME. See https://tech.knime.org/community and http://lifescience.opensource.epam.com/indigo/knime.html
Based on simple sub-structure workflow by James Davidson
http://tech.knime.org/forum/rdkit/substructure-search-with-rdkit
Modified to use SD/SMILES file as input, or manual SMILES entry, or database search.
1. J. B. Baell and G. A. Holloway, Journal of Medicinal Chemistry, 2010, 53, 2719-2749, http://dx.doi.org/10.1021/jm901137j
2. See discussion http://sourceforge.net/p/rdkit/mailman/message/34405703/ and the file in the repository https://github.com/rdkit/rdkit/tree/master/Data/Pains
3. Rajarshi Guha. "PAINS Substructure Filters as SMARTS. 2010-11-14."
http://blog.rguha.net/?p=850. Accessed: 2010-11-14. (Archived by
WebCite® at http://www.webcitation.org/5uF4wHEpB)
Preview
Run
Not available
Workflow Components
Not available
Reviews (0)
Other workflows that use similar services (0)
There are no workflows in myExperiment that use similar services to this Workflow.
Comments (2)
Log in to make a comment
If you are having difficulty running large data sets, edit your knime.ini file and change the -Xmx value to increase the memory available to KNIME. A 300k SMILES set can be processed easily with -Xmx1500m.
The updated PAINS filters from the RDKit developer are resulting in improved results:
http://rdkit.blogspot.com/2015/08/curating-pains-filters.html
Just waiting on the nodes to be updated in KNIME before updating the workflows here, but you can grab the filters from the above link if you can't wait.