Training data (data labeling, annotation, workflow) for all data types
Toolkit for conversational AI
Haystack is an open source NLP framework to interact with your data
Semantic search and workflows for medical/scientific papers
Han Language Processing
An LLM-powered knowledge curation system that researches topics
Large Language Model Text Generation Inference
Industrial-strength Natural Language Processing (NLP)
ExtractThinker is a Document Intelligence library for LLMs
The no-nonsense RAG chunking library
ReFT: Representation Finetuning for Language Models
A curated list of data mining papers about fraud detection
Easy-to-use and powerful NLP library with Awesome model zoo
Hub of ready-to-use datasets for ML models
Data and tools for generating and inspecting OLMo pre-training data
Trained models & code to predict toxic comments
Build AI-powered semantic search applications
The library to build & auto-optimize LLM applications
Efficient few-shot learning with Sentence Transformers
The Classical Language Toolkit
Efficient Retrieval Augmentation and Generation Framework
Libraries for applying sparsification recipes to neural networks
A Heterogeneous Benchmark for Information Retrieval
Neural Network Compression Framework for enhanced OpenVINO
Openai style api for open large language models