Menu

#95 missing reference substance identifiers when importing i5z files

closed-fixed
i5z (1)
5
2016-04-09
2016-03-30
No

In some cases (not always) Name,CAS,EINECS are missing for reference substances, if the import is done via command line ambitsi-3.0.1-SNAPSHOT from dbsubstance package

java -jar ambit2-dbsubstance-{version}.jar -i "folder_with_i5z_files" -x einecs -c ambit.properties  -m true -t true

When uploading via the web services /ui/uploadsubstance the identifiers are correctly imported.

Discussion

  • Nina Jeliazkova

    Nina Jeliazkova - 2016-03-30

    The reason is the command line import restricts the max number of reference substances by default to one.

    -r,--maxReferenceSubstances <value>   Maximum reference substances in
    

    While reading the i5z file, the reference substances are imported first (with all of its identifiers). If only one reference substance is imported , it might not be the one actualy referred by the substance in the i5z file. Then the right ref substance will be imported only with its UUID, but not CAS, EINECS and name.

    The max number of reference substance option was introduced to cope with IUCLID files containing huge number of reference substances with same CAS/EC/NAME, but different UUIDs.

     

    Last edit: Nina Jeliazkova 2016-03-30
  • Nina Jeliazkova

    Nina Jeliazkova - 2016-04-06

    i5 importer was modified to assign I5UUID as different properties , thus one structure may have several I5UUIDs, coming from different reference substances. How to match these is controlled by option -x of the ambitsi.jar

     

    Last edit: Nina Jeliazkova 2016-04-06
  • Nina Jeliazkova

    Nina Jeliazkova - 2016-04-06

    Revision: 7427
    Fixed bug resulting in empty structures when build2d of the normalizer is turned on
    and it is turned on when importing structures from i5z files
    (this was not an issue with cdk 1.4.x)

     
  • Nina Jeliazkova

    Nina Jeliazkova - 2016-04-06

    Revision: 7430
    Remove the default limit of max reference substances (now that we can handle them correctly)

     
  • Nina Jeliazkova

    Nina Jeliazkova - 2016-04-09
    • status: open --> closed-fixed
     
  • Nina Jeliazkova

    Nina Jeliazkova - 2016-04-09

    Fixed , when importing use -x einecs option.
    Howevere, the web upload still uses reference substance UUID for matching substances (which is OK if the i5z files are using unique reference substance UUIDs.

     

Log in to post a comment.

MongoDB Logo MongoDB