Nov5 pre-release: entry ids are unreliable

Brought to you by: luthien-merilin

#7 Nov5 pre-release: entry ids are unreliable

Status: open

Owner: nobody

Labels: None

Priority: 5

Updated: 2010-11-07

Created: 2010-11-07

Creator: hisweloke

Private: No

The "id" attribute of "entry" elements in the original XML TEI source was apparently used to index the dictionary.
However, it is not always reliable, as it is an internal reference - it could even have been a raw number or anything else provided it is unique - and it's just per chance that I initially used something looking like the entry text itself.
Ideally we should use the firth "orth" element in the first "form" element of an "entry" element to ensure the correct word is indexed.
See attached screenshot for an example of the issue - This one is not critical, but there could be other cases where this might be more serious (for instance, words which had a typo fixed at some time, but without changing the internal id).

Discussion

hisweloke - 2010-11-07

Anotated screenshot (png)

issue8.png

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

hisweloke - 2010-11-07

summary: Nov5 pre-release: entry ids are unreliabled --> Nov5 pre-release: entry ids are unreliable
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Nov5 pre-release: entry ids are unreliable

Group

Searches

Help

#7 Nov5 pre-release: entry ids are unreliable

Discussion