JINSECT - N-gram Graph Based Toolkit
The JINSECT toolkit is a Java-based toolkit and library that supports and demonstrates the use of n-gram graphs within Natural Language Processing applications, ranging from summarization and summary evaluation to text classification and indexing.
What does JINSECT stand for?
OK. You got me, it is an acronym: Java INteroperable Semantic Extraction Context-based Toolkit
The idea is that JINSECT is an open toolkit for NLP, that allows analysis of texts taking into account the context. This notion of context is basic in the n-gram graph framework and is applied throughout the applications and code of JINSECT.
I see the site has just opened. What's next?
The next steps will be:
- Indicate related literature.
- Add code snippets for some basic tasks. (Ongoing - Check the Code Snippets page)
- Give a user's manual for the AutoSummENG method (which is implemented in JInsect).
- Give the rationale of the n-gram graphs in a simple-to-understand text.
And that's enough for a start I think...