In some cases (not always) Name,CAS,EINECS are missing for reference substances, if the import is done via command line ambitsi-3.0.1-SNAPSHOT from dbsubstance package
java -jar ambit2-dbsubstance-{version}.jar -i "folder_with_i5z_files" -x einecs -c ambit.properties -m true -t true
When uploading via the web services /ui/uploadsubstance the identifiers are correctly imported.
The reason is the command line import restricts the max number of reference substances by default to one.
While reading the i5z file, the reference substances are imported first (with all of its identifiers). If only one reference substance is imported , it might not be the one actualy referred by the substance in the i5z file. Then the right ref substance will be imported only with its UUID, but not CAS, EINECS and name.
The max number of reference substance option was introduced to cope with IUCLID files containing huge number of reference substances with same CAS/EC/NAME, but different UUIDs.
Last edit: Nina Jeliazkova 2016-03-30
i5 importer was modified to assign I5UUID as different properties , thus one structure may have several I5UUIDs, coming from different reference substances. How to match these is controlled by option -x of the ambitsi.jar
Last edit: Nina Jeliazkova 2016-04-06
Revision: 7427
Fixed bug resulting in empty structures when build2d of the normalizer is turned on
and it is turned on when importing structures from i5z files
(this was not an issue with cdk 1.4.x)
i5 library : Fix for setting the reference substance of a substance
https://github.com/ideaconsult/i5/commit/4441a74382387a7f5cded0393a4696abbe3be679
Revision: 7430
Remove the default limit of max reference substances (now that we can handle them correctly)
Fixed , when importing use
-x einecsoption.Howevere, the web upload still uses reference substance UUID for matching substances (which is OK if the i5z files are using unique reference substance UUIDs.