Public opinion analysis system
Haystack is an open source NLP framework to interact with your data
A library for deep learning end-to-end dialog systems and chatbots
Industrial-strength Natural Language Processing (NLP)
ExtractThinker is a Document Intelligence library for LLMs
A natural language interface for computers
Semantic search and workflows for medical/scientific papers
The no-nonsense RAG chunking library
Data and tools for generating and inspecting OLMo pre-training data
ReFT: Representation Finetuning for Language Models
Efficient Retrieval Augmentation and Generation Framework
Han Language Processing
Large Language Model Text Generation Inference
A Heterogeneous Benchmark for Information Retrieval
The Classical Language Toolkit
An LLM-powered knowledge curation system that researches topics
Training data (data labeling, annotation, workflow) for all data types
The library to build & auto-optimize LLM applications
Efficient few-shot learning with Sentence Transformers
Trained models & code to predict toxic comments
Hub of ready-to-use datasets for ML models
A Repo For Document AI
A full spaCy pipeline and models for scientific/biomedical documents
Neural Network Compression Framework for enhanced OpenVINO
Extract schema, statistics and entities from datasets