Menu

#16 Create resource parser chain for lines.db file format

in-progress
None
bug
major
client
Unassigned
2014-06-25
2014-06-25
Kevin Black
No

Create resource parser chain for lines.db file format.

This is the file format Syriac text data is stored in the Comprehensive Aramaic Lexicon project.

The file format is in columns separate by tab characters:

Column 1: some sort of composite text and line number (from the sample data 60550 appears to be the number for the publius text, and the remainder of the number appears to be associated with the line).

Column 2: a zero-based line index.

Column 3: Syriac tokens represented with some sort of transliteration scheme.

Column 4: a Boolean (0/1) value indicating whether there is a reference available for the line or not.

Trac #437.

Discussion

  • Kevin Black

    Kevin Black - 2014-06-25

    Commit 331 (CCF) Ticket #437. Added stub for the CAL lines.db ResourceParserBuilder in ResourceParserBuidlers and registered it in the ResourceParserBuilderRegistry for extension type "lines.db".

    Commit 332 (CCF) Ticket #437. Added line and token parsers for the lines.db resource parser chain. The token parser might need to be refined to handle in-line emendation annotations.

     
  • Kevin Black

    Kevin Black - 2014-06-25
    • status: new --> in-progress
    • assigned_to: Kevin Black
     

Log in to post a comment.

MongoDB Logo MongoDB