Easy-to-use and high-performance NLP and LLM framework
Fast and customizable framework for automatic ML model creation
Semantic search and workflows for medical/scientific papers
A Heterogeneous Benchmark for Information Retrieval
The no-nonsense RAG chunking library
The library to build & auto-optimize LLM applications
Making large AI models cheaper, faster and more accessible
A tool for learning vector representations of words and entities
Data and tools for generating and inspecting OLMo pre-training data
Obsei is a low code AI powered automation tool
Efficient Retrieval Augmentation and Generation Framework
A full spaCy pipeline and models for scientific/biomedical documents
Dealing with all unstructured data, such as reverse image search
Libraries for applying sparsification recipes to neural networks
Data processing for and with foundation models
An easy-to-use LLMs quantization package with user-friendly apis
An LLM-powered knowledge curation system that researches topics
A Unified Library for Parameter-Efficient Learning
Training data (data labeling, annotation, workflow) for all data types
Large Language Model Text Generation Inference
Data loaders and abstractions for text and NLP
Efficient few-shot learning with Sentence Transformers
Hub of ready-to-use datasets for ML models
A library for deep learning end-to-end dialog systems and chatbots
Stanford CoreNLP, a Java suite of core NLP tools