Chinese XLNet pre-trained model
Pretty diff to html javascript library (diff2html)
A library for deep learning end-to-end dialog systems and chatbots
General natural language facilities for node
Gives you the power to build your own Wisdom widget
WikiChat is an improved RAG
ReFT: Representation Finetuning for Language Models
Text mining using tidy tools
Easy-to-use and high-performance NLP and LLM framework
ExtractThinker is a Document Intelligence library for LLMs
Superlinked is a Python framework for AI Engineers
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models
Fast and customizable framework for automatic ML model creation
Semantic search and workflows for medical/scientific papers
A Heterogeneous Benchmark for Information Retrieval
The no-nonsense RAG chunking library
The pluggable natural language linter for text and markdown
Data and tools for generating and inspecting OLMo pre-training data
Efficient Retrieval Augmentation and Generation Framework
Recognition and resolution of numbers, units, date/time, etc.
A full spaCy pipeline and models for scientific/biomedical documents
Dealing with all unstructured data, such as reverse image search
Libraries for applying sparsification recipes to neural networks
Data processing for and with foundation models
An easy-to-use LLMs quantization package with user-friendly apis