The Canto->Chado loader is coping badly with cdc2-33. In Canto cdc2-33 mostly has type "unknown". There are 8 sessions where it is "unknown" and one with "mutation of single amino acid residue". There are 7 session where the description is "unknown" and 2 where it is "A177T". The Canto->Chado loader isn't being smart about this.
Unfortunately cdc2-33 is ending up with type "unknown" in Chado but should be "mutation of single amino acid residue".
Code to check Canto:
./script/canto_curs_map.pl 'for my $al ($curs_schema->resultset("Allele")->search({ name => "cdc2-33"})->all()) { print $al->name(), " ", $al->description(), " ", $al->type, " ", $curs->curs_key(), "\n"; }' | uniq
Results:
cdc2-33 A177T mutation of single amino acid residue 57f3a3b217f0cd70
cdc2-33 unknown unknown c2b25694958a17bc
cdc2-33 unknown unknown d4ae29a798ec94dd
cdc2-33 unknown unknown abf32a065b08bcd5
cdc2-33 unknown unknown 69403b8ceb819841
cdc2-33 unknown unknown 8855a2414459b824
cdc2-33 unknown unknown cfa1ac27dc7acf6c
cdc2-33 unknown unknown 8759012da61b3559
cdc2-33 A177T unknown d676d3ce9ba0d3fe
Update this ticket when the loading is fixed:
https://www.ebi.ac.uk/panda/jira/browse/PB-1950
Appears to be OK!
Reopening because Kim hasn't commented to say anything has been done, and the problem is still there on gene pages.
Yep, I haven't fixed this yet.