Hello!
Decaleon news:
1)
- The standard Vocabulary File of Decaleon will be extended by another about 8500 words.
- The File "VokabularZusatz2.txt" contains the German Draft.
- The new words will be coded in the next 20 months.
2)
- Decaleon plus was modified to import an extra Vocabulary File containing technical terms.
- This File "VocaExtensions.xml" also is placed in the \bin\Debug\ of Decaleon.
- The technical terms including Semantic Groups will be coded in the next 16 months.
- This Vocabulary will be uploaded regularly on Sourceforge.
3)
- Group Member Indices were changed to 5 digits to reference up to 99999 words per Part of Speech.
- In the XML Vocabulary Files in the Semantic Group they are like LLLXXXXX[#LLL]; in the Program
- corresponding changes were made during import (checking, extensions) and access (dictionary window).
4)
- Design for standard Addition is completed.
- New file VocaAddition.xml is in \bin\Debug\ of Decaleon; German/English finished.
- other languages and Semantic Groups will follow, when technical Extension design is finished.
5)
- Design for technical Extension is finished.
- 20 Thematic Extensions containing about 14000 entries done (5 Languages);
- Standard Addition was finished in spring 2016.
6)
- Version 6.0 of Decaleon software is under development.
- A new Addition is in progress, containing about 23500 entries;
- about 20 additional Languages and Interslavic will be supported on A1/A2 levels.
7)
- Version 6.0 got about 8500 additional words and expressions (Addition3.xml) and about 6000 important phrases (VocaCooccurr.xml).
8)
- Version 7.0 will support Bulgarian, Czech, Dutch, Hungarian, Romanian and Turkish up to level B1 (2022/2023).
TEXminer news:
1)
- TEXminer was modified to import an extra Vocabulary File containing technical terms.
- The Language Models for each language now consist of two files:
- "Vocagram.xml" for standard and "VocagramE.xml" for technical Extensions.
- Also the Semantic Group information is split into "Vocasemg.xml" and "VocasemgE.xml"
2)
- The technical term Extension is in progress till mid of 2016.
- Mathematics, Physics, Chemistry and Biology are the first samples in the Semantic Group datasets.
- Check with typical texts "Geometry.txt"/"Microscope.txt"/"Battery.txt"/"Metabolism.txt":
- the names of the Semantic Group Hits ending with "*" are technical.
3)
- A bug in the registerSemanticGroup Routine suppressed Semantic Group Hits.
- Now it is as it should be.
4)
- TEXminer was modified to import an additional Vocabulary File containing standard terms.
- The Language Models for each language now consist of three files:
- "Vocagram.xml" for standard, "VocagramA.xml" for addition and "VocagramE.xml" for technical Extensions.
- Also the Semantic Group information is split into "Vocasemg.xml", "VocasemgA.xml" and "VocasemgE.xml"
5)
- French, Spanish and Russian Models for Standard Vocabulary Addition have been extended.
- TEXminer now directly supports 19 Languages. Adaptions for other Languages are possible.
6)
- TEXminer will take the words of Decaleon 6.0 new Addition into the Language Models.
- Data Sets are already available in German, English, French, Spanish and Russian; other languages are in progress.
7)
- TEXminer got an update by about 10300 new words. (2018/2019)
Greetings
M.Penzkofer
Availability of Languages by Speech Levels (x = full, o = in part, - = none):
A1 A2 B1 B2 C1 C2(EX) SUM(progress)
English x x x x x o 35000->66000
French x x x x x o 30000->59000
German x x x x x x 59000->69000
Russian x x x x x o 32000->62000
Spanish x x x x x o 30000->59000
Croatian x x x x o - 20000->31000
Danish x x x x o - 19000->30000
Greek x x x x o - 20000->31000
Italian x x x x o - 22000->34000
Polish x x x x o - 20000->33000
Portoguese x x x x o - 20000->30000
Swedish x x x x o - 20000->31000
ESPERANTO x x x o o - 19000->35000
Bulgarian x x x o - - 4500->10500
Czech x x x o - - 4500->10500
Dutch x x x o - - 4500->10500
Hungarian x x x o - - 4500->10500
Romanian x x x o - - 4500->10500
Turkish x x x o - - 4500->10500
Albanian x x o - - - 1400-> 4200
Finnish x x o - - - 1500-> 4500
Norwegian x x o - - - 1500-> 4500
Serbian x x o - - - 1500-> 4500
Slovak x x o - - - 1500-> 4500
Slovene x x o - - - 1500-> 4500
Ukrainian x x o - - - 1500-> 4500
INTERSLAVIC o o o - - - 1000-> 3500
Arabian x o - - - - 600-> 1200
Chinese x o - - - - 400-> 1200
Hindi x o - - - - 400-> 1200
Indonesian x o - - - - 400-> 1200
Japanese x o - - - - 400-> 1200
Korean x o - - - - 400-> 1200
Urdu x o - - - - 400-> 1000
Vietnamese x o - - - - 400-> 1200