ExtractThinker is a Document Intelligence library for LLMs
Recognition and resolution of numbers, units, date/time, etc.
Extract schema, statistics and entities from datasets
Chinese XLNet pre-trained model
Modest natural-language processing
Unicode XML TEI text analysis platform
Dataset generation for AI chatbots, NLP tasks