From: Peter Murray-R. <pm...@ca...> - 2004-03-11 19:40:58
|
[crossposted, so please be considerate in replies] Greetings to the OpenSource chemistry community, We have recently developed a method for publishing molecules on an RSS feed, CMLRSS and hope this may be of interest. CMLRSS consists of a standard RSS news feed with CML embedded in those items which contain chemistry. When viewed with a CML-aware client such as Jmol or JChempaint with the rssviewer plugin the molecule appears in appropriate form (2D or 3D). This is a simple method to provide an Open feed of molecules of all types. Details are at http://wwmm.ch.cam.ac.uk/moin/CmlRss We would first like to thank the OpenSource chemistry community for contributing much to this. The viewers (Jmol/JChempaint) are in Java, and many of the molecules have been converted using OpenBabel and/or CDK. The WorldWideMolecularMatrix and CMLRSS are being presented at the ACS meeting in Anaheim http://oasys2.confex.com/acs/227nm/techprogram/P725453.HTM. We append details below. We invite members of the OpenSource chemistry community and others to contribute feeds of Open molecules. Tools to create feeds are available on our site and we intend to add CMLRSS output to CDK, OpenBabel, etc. (If you can write CML then it's easy to add the <item> information for CMLRSS). If you have molecules, with or without ancillary information, you wish to make available see http://wwmm.ch.cam.ac.uk/moin/CmlRss. We already have proof of concept for molecules, properties, spectra (no viewer yet) and crystallography. The feeds do not have to be large - but the entries should be interesting or useful in some way. Note that 2D, 3D, 5D (2D+3D) and 8D (2D, 3D +fractional) coordinate systems are all allowable. The system is designed to allow aggregation of molecules (e.g. in a Xindice database) and to be re-used. For example the output of one feed could be input to another service such as JOELib which computes properties and descriptors. This is still under construction (V0.2) PeterMR Announcement =========== As part of a session on "Research Collaboratories, Virtual laboratories and Grid Computing" at the forthcoming ACS meeting (http://oasys2.confex.com/acs/227nm/techprogram/ ) we are presenting on "Semantic Grid computing - the WorldWideMolecularMatrix" Part of this will focus on the theme of RSS which I posted about last year. We have now developed a chemical component to this, now christened CMLRSS, and it is the purpose of this email to invite people if they wish, to join in to this "virtual laboratory", and hence "participate" in the ACS session. The procedure is as follows a) For an introduction to the concept, see eg http://www.ch.ic.ac.uk/rzepa/cmlrss_distrib/ or http://wwmm.ch.cam.ac.uk/moin/CmlRss b)We have prepared a distribution kit available either from http://www.ch.ic.ac.uk/rzepa/cmlrss_distrib/ or from http://wwmm.ch.cam.ac.uk/moin/RssClient We emphasize that whilst we believe this to function correctly, it has not yet been comprehensively tested on all platforms, and some components are still at the "late alpha", or "early beta" stage. Needless to say, we welcome comments and reports of bugs! c) You can view the existing CMLRSS grid by installing either Jmol (a 3D molecule viewer) or JChempaint (a 2D editor) as provided, along with the CMLRSS plugin in the kit (these are Java programs so you will have to have Java installed). By running this, you will open around 8 existing RSS "Channels" and hence subscribe to chemical feeds from these sites. You can if you wish aggregate these feeds into one, then filter them down by (e.g. element). You will notice several themes in the existing channels, including an aggregation of the long running "Molecules of the Month" sites, several database sites (NMR, X-Ray, ChemStock) and the World-Wide Molecular Matrix. d) You are also welcome to follow the instructions in the above distribution to create your own CMLRSS channel (I might add that several of the sites mentioned above were our beta testers for this kit; it took only one or two days to get up and running). If you do so, please let us know so that we can add you to the list (one of the many "todo" items is to organise the addition of further channels in a better way). I should end with the observation that as far as we know, this is the furthest that the RSS concept has been pushed into a science field, and another step towards the "semantic web" espoused by Sir Tim Berners-Lee. We would of course love to hear about your experiences with this system. -- Henry Rzepa. +44 (020) 7594 5774 (Voice); +44 (0870) 132 3747 (eFax) http://www.ch.ic.ac.uk/rzepa/ Dept. Chemistry, Imperial College London, SW7 2AZ, UK. Peter Murray-Rust Unilever Centre for Molecular Informatics Chemistry Department, Cambridge University Lensfield Road, CAMBRIDGE, CB2 1EW, UK Tel: +44-1223-763069 |
From: Joerg K. W. <we...@in...> - 2004-03-12 09:58:07
|
Hi all, very fine ! > [crossposted, so please be considerate in replies] So, do we need a cross-platform mailing list ?:-} Can a RSS channel used for such things ? > We already have proof of concept for molecules, properties, spectra (no > viewer yet) and crystallography. For a Java based visualization i would recommend the complex SGT: http://www.epic.noaa.gov/java/sgt/ I've already used it for the primitive visualization of the JCAMP-DX format in JOELib (only developer interface for uncompressed formats !). No JCAMP-CS support, which could be something for OpenBabel/JOELib. > For example the output of one feed > could be input to another service such as JOELib which computes > properties and descriptors. This is still under construction (V0.2) Great ! > the purpose of this email to invite people if they wish, to join in to > this "virtual laboratory", and hence "participate" in the ACS session. I've not the money to join :-( But i'm going to the 1. Chemoinformatics Conference in Sheffield, U.K., 21-23 April ... 2. Analytica Conference Munich, Germany, 12-14 May ... ... if anyone of you will be there i would be happy to discuss any chemoinformatics topics in more detail. Regards, Joerg -- Dipl. Chem. Joerg K. Wegner Center of Bioinformatics Tuebingen (ZBIT) Department of Computer Architecture Univ. Tuebingen, Sand 1, D-72076 Tuebingen, Germany Phone: (+49/0) 7071 29 78970 Fax: (+49/0) 7071 29 5091 E-Mail: mailto:we...@in... WWW: http://www-ra.informatik.uni-tuebingen.de -- Never mistake motion for action. (E. Hemingway) Never mistake action for meaningful action. (Hugo Kubinyi,2004) |
From: Peter Murray-R. <pm...@ca...> - 2004-03-12 10:28:51
|
At 10:39 12/03/2004 +0100, Joerg K. Wegner wrote: >Hi all, > >very fine ! > >>[crossposted, so please be considerate in replies] >So, do we need a cross-platform mailing list ?:-} It could be that we need a chemical developers blog. >Can a RSS channel used for such things ? Yes. In general RSS is read-only so it would be coupled to a mailing list of some sort (Henry?) >>We already have proof of concept for molecules, properties, spectra (no >>viewer yet) and crystallography. >For a Java based visualization i would recommend the complex SGT: >http://www.epic.noaa.gov/java/sgt/ > >I've already used it for the primitive visualization of the JCAMP-DX >format in JOELib (only developer interface for uncompressed formats !). No >JCAMP-CS support, which could be something for OpenBabel/JOELib. Christoph, Stefan and I are working on a CMLSpect viewer. We already have JCAMP2CML. We want to avoid the posting of JCAMP as it breaks the XML model. However it would be possible to convert CMLSpect to JCAMP >>For example the output of one feed could be input to another service such >>as JOELib which computes properties and descriptors. This is still under >>construction (V0.2) >Great ! ... mainly at the design stage:-) The main technical question is how one merges new properties into an old molecule. This is a middleware problem. I think it's most like to be relevant to servers at present. "Add your CML molecule and get some more properties. We already do this for MOPAC. I think that JOELib could easily be factored in - best if someone already had a running protocol. Thus if you had a clogp module already running as something like: JOEMol j = new JOEMOl(cmlMolecule); double logp = j.getLogP(); then it would be easy to incorporate. I haven't had time to look at how to code JOELib and it is easiest if we have high-level routines at this level, especially cml2JOE and joe2CML. I have done this for CDK by passing XML strings. I know this is inefficient but ultimately we need to work at this API level. I therefore have just two calls: cdkMolecule = getCDKMolecule(cmlMolecule) cdkMolecule.doSomCalculation() newCmlMolecule = getCMLMolecule(cdkMolecule) merge(cmlMolecule, newCMLMolecule) and we could do the same for JOELIB. >>the purpose of this email to invite people if they wish, to join in to >>this "virtual laboratory", and hence "participate" in the ACS session. >I've not the money to join :-( But i'm going to the >1. Chemoinformatics Conference in Sheffield, U.K., 21-23 April ... >2. Analytica Conference Munich, Germany, 12-14 May ... >... if anyone of you will be there i would be happy to discuss any >chemoinformatics topics in more detail. YY will be presenting WWMM at Munich P. >Regards, Joerg > >-- >Dipl. Chem. Joerg K. Wegner >Center of Bioinformatics Tuebingen (ZBIT) >Department of Computer Architecture >Univ. Tuebingen, Sand 1, D-72076 Tuebingen, Germany >Phone: (+49/0) 7071 29 78970 >Fax: (+49/0) 7071 29 5091 >E-Mail: mailto:we...@in... >WWW: http://www-ra.informatik.uni-tuebingen.de >-- >Never mistake motion for action. > (E. Hemingway) > >Never mistake action for meaningful action. > (Hugo Kubinyi,2004) Peter Murray-Rust Unilever Centre for Molecular Informatics Chemistry Department, Cambridge University Lensfield Road, CAMBRIDGE, CB2 1EW, UK Tel: +44-1223-763069 |
From: Egon W. <eg...@sc...> - 2004-03-13 13:13:47
|
On Friday 12 March 2004 11:07, Peter Murray-Rust wrote: > At 10:39 12/03/2004 +0100, Joerg K. Wegner wrote: > >very fine ! > > > >>[crossposted, so please be considerate in replies] > > > >So, do we need a cross-platform mailing list ?:-} > > It could be that we need a chemical developers blog. > > >Can a RSS channel used for such things ? > > Yes. In general RSS is read-only so it would be coupled to a mailing list > of some sort (Henry?) I'm not that clear with the blog concept: a website of someone keeping a diary?, or a wiki like communal website? Anyway, I agree that it is time to organize all chemistry open source initiative and come up with solutions which many projects deal with, e.g. atom typing and interoperability... > >>We already have proof of concept for molecules, properties, spectra (no > >>viewer yet) and crystallography. > > > >For a Java based visualization i would recommend the complex SGT: > >http://www.epic.noaa.gov/java/sgt/ > > > >I've already used it for the primitive visualization of the JCAMP-DX > >format in JOELib (only developer interface for uncompressed formats !). No > >JCAMP-CS support, which could be something for OpenBabel/JOELib. > > Christoph, Stefan and I are working on a CMLSpect viewer. We already have > JCAMP2CML. We want to avoid the posting of JCAMP as it breaks the XML > model. However it would be possible to convert CMLSpect to JCAMP > > >>For example the output of one feed could be input to another service such > >>as JOELib which computes properties and descriptors. This is still under > >>construction (V0.2) > > > >Great ! > > ... mainly at the design stage:-) The main technical question is how one > merges new properties into an old molecule. This is a middleware problem. I > think it's most like to be relevant to servers at present. "Add your CML > molecule and get some more properties. We already do this for MOPAC. I > think that JOELib could easily be factored in - best if someone already had > a running protocol. Thus if you had a clogp module already running as > something like: > > JOEMol j = new JOEMOl(cmlMolecule); > double logp = j.getLogP(); > > then it would be easy to incorporate. I haven't had time to look at how to > code JOELib and it is easiest if we have high-level routines at this level, > especially cml2JOE and joe2CML. I have done this for CDK by passing XML > strings. I know this is inefficient but ultimately we need to work at this > API level. I therefore have just two calls: > > cdkMolecule = getCDKMolecule(cmlMolecule) > cdkMolecule.doSomCalculation() > newCmlMolecule = getCMLMolecule(cdkMolecule) > merge(cmlMolecule, newCMLMolecule) > > and we could do the same for JOELIB. CDK has libio to JOELib... so without the need to write anything (except the getJOELibMolecule() wrapper), you can already do: joelibMol = getJOELibMolecule(getCDKMolecule(cmlMolecule)) joelibMol.doSomeCalc() newCMLMol = getCMLMolecule(getCDKMolecule(joelibMol)) merge(cmlMolecule, newCMLMolecule) BTW, Joerg, it would be very nice if you could have a CMLRSS set up that takes all current other feeds, calculates properties and puts that only as CMLRSS again. Egon |
From: Peter Murray-R. <pm...@ca...> - 2004-03-13 15:48:33
|
> > > > Yes. In general RSS is read-only so it would be coupled to a mailing list > > of some sort (Henry?) > >I'm not that clear with the blog concept: a website of someone keeping a >diary?, or a wiki like communal website? Yes and yes. It's a writable website with standard HTTP tools and often open to all. RSS is the common way of aggregating them. I haven't looked recently but I expect that Slashdot would be regarded as a blog. >Anyway, I agree that it is time to organize all chemistry open source >initiative and come up with solutions which many projects deal with, e.g. >atom typing and interoperability... I think the NeSC meeting will help us a great deal.http://www.nesc.ac.uk/esi/events/394/ > > cdkMolecule = getCDKMolecule(cmlMolecule) > > cdkMolecule.doSomCalculation() > > newCmlMolecule = getCMLMolecule(cdkMolecule) > > merge(cmlMolecule, newCMLMolecule) > > > > and we could do the same for JOELIB. > >CDK has libio to JOELib... so without the need to write anything (except the >getJOELibMolecule() wrapper), you can already do: > >joelibMol = getJOELibMolecule(getCDKMolecule(cmlMolecule)) >joelibMol.doSomeCalc() >newCMLMol = getCMLMolecule(getCDKMolecule(joelibMol)) >merge(cmlMolecule, newCMLMolecule) Excellent. For myself it would be very useful to have these examples as simple standalone example programs (I have to admit I get lost looking for functionality in libraries, including my own!). So it's a excellent opportunity for volunteers to write sample programs. As an example I haven't used JOELib to calculate cLogp simply because I haven't had the time to explore it. >BTW, Joerg, it would be very nice if you could have a CMLRSS set up that >takes >all current other feeds, calculates properties and puts that only as CMLRSS >again. This would be fantastic. Note - at this stage I recommend that all aggregation should be manual. This is to avoid robots out of control which create denial of service, loops etc. We can move to automatic in V0.2. So it might work like this: JoergW reads a list of sites. foreach week foreach site read new molecules (date < lastdate) calculate properties add to JWProperty feed set lastdate At this stage I don't think Joerg should be required to merge molecules although you are welcome to try! This would mean the same property might be calculated twice. A specialist aggregator could deal with that. I am also keen to see bitscreens being distributed. I think we could set a standard here Peter Murray-Rust Unilever Centre for Molecular Informatics Chemistry Department, Cambridge University Lensfield Road, CAMBRIDGE, CB2 1EW, UK Tel: +44-1223-763069 |