A Heterogeneous Benchmark for Information Retrieval
The no-nonsense RAG chunking library
The library to build & auto-optimize LLM applications
Making large AI models cheaper, faster and more accessible
A coding-free framework built on PyTorch
A tool for learning vector representations of words and entities
Data and tools for generating and inspecting OLMo pre-training data
Obsei is a low code AI powered automation tool
Efficient Retrieval Augmentation and Generation Framework
A full spaCy pipeline and models for scientific/biomedical documents
Dealing with all unstructured data, such as reverse image search
Libraries for applying sparsification recipes to neural networks
Data processing for and with foundation models
An easy-to-use LLMs quantization package with user-friendly apis
An LLM-powered knowledge curation system that researches topics
A Unified Library for Parameter-Efficient Learning
Large Language Model Text Generation Inference
Efficient few-shot learning with Sentence Transformers
Training data (data labeling, annotation, workflow) for all data types
Data loaders and abstractions for text and NLP
Hub of ready-to-use datasets for ML models
Transformers4Rec is a flexible and efficient library
Easy-to-use and powerful NLP library with Awesome model zoo
Build AI-powered semantic search applications
Recognition and resolution of numbers, units, date/time, etc.