Download Latest Version ambit.war (102.3 MB)
Email in envelope

Get an email when there's a new version of AMBIT:Chemical Structure DB&Web Service

Home / Ambit2 / AMBIT applications / ambitcli / ambitcli-3.0.2
Name Modified Size InfoDownloads / Week
Parent folder
README.md 2016-10-20 10.4 kB
ambitcli-3.0.2.jar 2016-04-14 36.3 MB
ambitcli-3.0.2-7433.jar 2016-04-08 36.3 MB
ambitcli-3.0.2-7385.jar 2016-03-16 36.1 MB
ambitcli-3.0.2-7349.jar 2016-03-10 36.1 MB
ambitcli-3.0.2-7354.jar 2016-03-10 36.1 MB
ambitcli-3.0.2-7315.jar 2016-02-27 36.2 MB
Totals: 7 Items   217.2 MB 0

ambitcli-3.0.2.jar is the stable release (build 7472)

See http://ambit.sourceforge.net/ambitcli_standardisation.html

ambitcli-3.0.2-7433.jar is a prerelease of ambit 3.0.2 (build 7433)

  • Dense hashed fingerprint format replaced with sparse one, same as for non-hashed fingerprints
  • Changed to false the default option for storing the transformation info as properties
  • Assign rank property even if tautomerisation is not applicable
  • logging improvements

ambitcli-3.0.2-7385.jar is a prerelease of ambit 3.0.2 (build 7385)

  • Neutralisation SMIRKS added for N oxides
  • logging improvements

ambitcli-3.0.2-7354.jar is a prerelease of ambit 3.0.2 (build 7354)

  • Bug fixes b7346 - Fixed a bug caused by the stereo info left from the tautomer structure cloning.
  • Fingerprint calculation
  • input tag option -d inputtag_smiles
  • ensure RANK column is always present

Usage

a standardize -m post -d <parameters>

"Chemical structure standardization (-i inputfile.sdf -o outputfile.sdf , recognized by extensions .sdf , .csv, .cml , .txt)"
-a standardize -m post
-d smirks=null  // JSON file with SMIRKS transformations    [type:String, mandatory:false]
-d neutralise=false // If true neutralises the molecule via set of predefined SMIRKS    [type:Boolean, mandatory:false]
-d splitfragments=false // If true keeps the largest fragment   [type:Boolean, mandatory:false]
-d implicith=false  // If true converts hydrogens to implicit   [type:Boolean, mandatory:false]
-d generatestereofrom2d=false   // If true uses org.openscience.cdk.stereo.StereoElementFactory to generate the stereochemistry from 2D coordinates [type:Boolean, mandatory:false]
-d isotopes=false   // If true clears isotopes  [type:Boolean, mandatory:false]
-d generate2D=false // Generate 2d coordinates (if no any)  [type:Boolean, mandatory:false]
-d tautomers=false  // If true generates the top ranked tautomer    [type:Boolean, mandatory:false]
-d inchi=true   // Generates InChIs. If -d tautomers=true InChI FixedH=true, otherwise generates standard InChI [type:Boolean, mandatory:false]
-d smiles=true  // Generates SMILES (isomeric, kekule).  Uses CDK SmilesGenerator.isomeric()    [type:Boolean, mandatory:false]
-d smilescanonical=false    // Generates SMILES (canonical).  Uses CDK SmilesGenerator.absolute()   [type:Boolean, mandatory:false]
-d smilesaromatic=false // Generates aromatic SMILES.  Uses CDK SmilesGenerator.aromatic()  [type:Boolean, mandatory:false]
-d page=0   // Start page (first page = 0)  [type:Integer, mandatory:false]
-d pagesize=20000   // Page size (in number of records). Set to -1 to read all records. [type:Integer, mandatory:false]
-d inputtag_smiles=SMILES   // Specifies the name of the column, containing SMILES  [type:String, mandatory:false]
-d tag_inchi=InChI  // Specifies the InChI tag  [type:String, mandatory:false]
-d tag_inchikey=InChIKey    // Specifies the InChIKey tag   [type:String, mandatory:false]
-d tag_smiles=SMILES    // Specifies the SMILES tag [type:String, mandatory:false]
-d tag_rank=RANK    // Specifies the tag to store the tautomer rank (energy based, less is better)  [type:String, mandatory:false]
-d tag_tokeep=  // Specifies which tags to keep, comma delimited list. Everything else will be removed. To keep all the tags, leave this empty. [type:String, mandatory:false]
-d sdftitle=null    // Specifies which field to write in the first SDF line null|inchikey|inchi|smiles|any-existing-field   [type:String, mandatory:false]
-d debugatomtypes=false // Writes only structures with AtomTypes property set. For debug purposes   [type:boolean, mandatory:false]

Generate SMILES, InChI and InChI key, retain only the PUBCHEM_CID from the fields in the input file

java -jar ambitcli-3.0.2-7354.jar -a standardize -m post -a standardize -m post -d page=0 -d pagesize=-1 -d tautomers=false -d tag_tokeep=PUBCHEM_CID -d smilescanonical=false -d smiles=true -d inchi=true -i inputfile -o outputfile

Full standardisation, write SMILES, InChI and InChI key into AMBIT_SMILES, AMBIT_InChI, AMBIT_InChIKey, retain only the PUBCHEM_CID from the fields in the input file

java -jar ambitcli-3.0.2-7354.jar -a standardize -m post -a standardize -m post -d page=0 -d pagesize=-1 -d tag_smiles=AMBIT_SMILES -d tag_inchi=AMBIT_InChI -d tag_inchikey=AMBIT_InChIKey -d tautomers=true -d splitfragments=true -d implicith=true -d smiles=true -d smilescanonical=false -d inchi=true -d neutralise=true -d isotopes=true -d tag_tokeep=PUBCHEM_CID

New command to generate fingerprints

-a fingerprint -m post -d <parameters>

"Fingerprint calculation. Writes multiple files per fingerprint, all files start with prefix given by -o prefix)"
-a fingerprint -m post
-d fpclass=CircularFingerprinter,PubchemFingerprinter,MACCSFingerprinter    // Comma delimited list of class names implementing org.openscience.cdk.fingerprint.IFingerprinter, e.g. KlekotaRothFingerprinter. If not fully qualified will prepend 'org.openscience.cdk.fingerprint.'   [type:String, mandatory:false]
-d page=0   // Start page (first page = 0)  [type:Integer, mandatory:false]
-d pagesize=20000   // Page size (in number of records) [type:Integer, mandatory:false]
-d inputtag_smiles=SMILES   // Specifies the name of the column, containing SMILES  [type:String, mandatory:false]
-d tag_tokeep=InChIKey  // Specifies which tags to keep, comma delimited list. Everything else will be removed. To keep all the tags, leave this empty. [type:String, mandatory:false]
-d write_count=false    // Whether to write the counts of getCountFingerprint() [type:Boolean, mandatory:false]
-d write_raw=false  // Whether to write the raw fingerprint (getRawFingerprint) [type:Boolean, mandatory:false]
-d sdftitle=null    // Specifies which field to write in the first SDF line [type:String, mandatory:false]

Example -d fpclass=CircularFingerprinter,PubchemFingerprinter,MACCSFingerprinter -d page=0 -d pagesize=20000 -d inputtag_smiles=SMILES -d tag_tokeep=InChIKey -d write_count=false -d write_raw=false -d sdftitle=null

ambitcli-3.0.2-7315.jar is a prerelease of ambit 3.0.2 (build 7315)

with new option added to specify which fields to be retained

    -d tag_tokeep=comma_delimited_list

Usage

>java -jar ambitcli-3.0.2-7315.jar -help
INFO   ambitcli-3.0.2-SNAPSHOT build:7315 1456509922527
http://ambit.sourceforge.net/download_ambitcli.html



>java -jar ambitcli-3.0.2-7315.jar -a standardize -m help
INFO   ambitcli-3.0.2-SNAPSHOT build:7315 1456509922527
http://ambit.sourceforge.net/download_ambitcli.html
-a standardize -m post -d <parameters>
"Chemical structure standardization (-i inputfile.sdf -o outputfile.sdf , recognized by extensions .sdf , .csv, .cml , .txt)"
-a standardize -m post
-d smirks=null // JSON file with SMIRKS transformations        [type:String, mandatory:false]
-d neutralise=false    // If true neutralises the molecule via set of predefined SMIRKS        [type:Boolean, mandatory:false]
-d splitfragments=false        // If true keeps the largest fragment   [type:Boolean, mandatory:false]
-d implicith=false     // If true converts hydrogens to implicit       [type:Boolean, mandatory:false]
-d generatestereofrom2d=false  // If true uses org.openscience.cdk.stereo.StereoElementFactory to generate the stereochemistry from 2D coordinates     [type:Boolean, mandatory:false]
-d isotopes=false      // If true clears isotopes      [type:Boolean, mandatory:false]
-d generate2D=false    // Generate 2d coordinates (if no any)  [type:Boolean, mandatory:false]
-d tautomers=true      // If true generates the top ranked tautomer    [type:Boolean, mandatory:false]
-d inchi=true  // Generates InChIs. If -d tautomers=true InChI FixedH=true, otherwise generates standard InChI [type:Boolean, mandatory:false]
-d smiles=false        // Generates SMILES (isomeric, kekule)  Uses CDK SmilesGenerator.isomeric() [type:Boolean, mandatory:false]
-d smilescanonical=true        // Generates SMILES (canonical). Uses CDK SmilesGenerator.absolute()  [type:Boolean, mandatory:false]
-d page=0      // Start page (first page = 0)  [type:Integer, mandatory:false]
-d pagesize=20000      // Page size (in number of records)     [type:Integer, mandatory:false]
-d tag_inchi=InChI     // Specifies the InChI tag      [type:String, mandatory:false]
-d tag_inchikey=InChIKey       // Specifies the InChIKey tag   [type:String, mandatory:false]
-d tag_smiles=SMILES   // Specifies the SMILES tag     [type:String, mandatory:false]
-d tag_rank=RANK       // Specifies the tag to store the tautomer rank (energy based, less is better)  [type:String, mandatory:false]
-d tag_tokeep= // Specifies which tags to keep, comma delimited list. Everything else will be removed. To keep all thetags, leave this empty. [type:String, mandatory:false]
-d sdftitle=null       // Specifies which field to write in the first SDF line null|inchikey|inchi|smiles|any-existing-field   [type:String, mandatory:false]
-d debugatomtypes=false        // Writes only structures with AtomTypes property set. For debug purposes       [type:boolean, mandatory:false]

Examples:

Generate SMILES, InChI and InChI key, retain only the PUBCHEM_CID from the fields in the input file

java -jar ambitcli-3.0.2-7315.jar -a standardize -m post -a standardize -m post -d page=0 -d pagesize=-1 -d tautomers=false -d tag_tokeep=PUBCHEM_CID -d smilescanonical=false -d smiles=true -d inchi=true -i inputfile -o outputfile

Full standardisation, write SMILES, InChI and InChI key into AMBIT_SMILES, AMBIT_InChI, AMBIT_InChIKey, retain only the PUBCHEM_CID from the fields in the input file

java -jar ambitcli-3.0.2-7315.jar -a standardize -m post -a standardize -m post -d page=0 -d pagesize=-1 -d tag_smiles=AMBIT_SMILES -d tag_inchi=AMBIT_InChI -d tag_inchikey=AMBIT_InChIKey -d tautomers=true -d splitfragments=true -d implicith=true -d smiles=true -d smilescanonical=false -d inchi=true -d neutralise=true -d isotopes=true -d tag_tokeep=PUBCHEM_CID
Source: README.md, updated 2016-10-20