Re: [Rdkit-discuss] Explicit H in substructure searches
Open-Source Cheminformatics and Machine Learning
Brought to you by:
glandrum
|
From: Paolo T. <pao...@gm...> - 2019-11-05 22:48:44
|
Hi Markus, I tried to put together a comprehensible explanation in his gist: https://gist.github.com/ptosco/1088937ce332bd66c999a2a5fbc855b3 Please also refer to the following threads on the mailing list: https://sourceforge.net/p/rdkit/mailman/message/29679834/ https://sourceforge.net/p/rdkit/mailman/message/36696340/ and to this blog post by Roger Sayle: https://nextmovesoftware.com/blog/2013/02/27/explicit-and-implicit-hydrogens-taking-liberties-with-valence/ for further clarifications. Cheers, p. On 05/11/2019 19:52, Markus Heller wrote: > > Hi, > > I’m trying to understand how to properly use explicit hydrogens in > substructure searches. Below is an example. I would like to find all > molecules that contain my query with hydrogens at the nitrogens, and I > thought I was on the right track … Why does the first query with the > explicit H not match m1? > > Thanks > > Markus > > <code> > > from rdkit import Chem > > from rdkit.Chem.Draw import IPythonConsole > > from rdkit.Chem import rdDepictor > > rdDepictor.SetPreferCoordGen(True) > > IPythonConsole.ipython_useSVG = True > > m1 = Chem.MolFromSmiles('c1cn[nH]c1N') > > m2 = Chem.MolFromSmiles('CNc1ccn[nH]1') > > m3 = Chem.MolFromSmiles('Nc1ccnn(C)1') > > # do not remove explicit H > > params = Chem.SmilesParserParams() > > params.removeHs=False > > query = Chem.MolFromSmiles('c1cn[nH]c(N([H])([H]))1', params) > > # first should be True, but all are False > > m1.HasSubstructMatch(query) > > m2.HasSubstructMatch(query) > > m3.HasSubstructMatch(query) > > # rebuild query with explicit H removed, not what I want > > query = Chem.MolFromSmiles('c1cn[nH]c(N([H])([H]))1') > > m1.HasSubstructMatch(query) > > m2.HasSubstructMatch(query) > > m3.HasSubstructMatch(query) > > </code> > > -- > > *Markus Heller, PhD* > > Senior Scientist > > Direct:604.827.1122 Main:604.827.1147 > > A027228F > > 2405 WesbrookMall,4th Floor,Vancouver,BCV6T 1Z3 > > Thisemail and any attachments theretomay > containconfidentialmaterialforthesoleuse of theintended > recipient.Anyreview,copying,or distribution of thisemail (or any > attachments thereto)by others is strictly prohibited. If youare not > theintended recipient,please contact thesenderimmediately and > permanently delete theoriginal and any copiesof thisemail and any > attachments thereto. > > > > _______________________________________________ > Rdkit-discuss mailing list > Rdk...@li... > https://lists.sourceforge.net/lists/listinfo/rdkit-discuss |