Data and tools for generating and inspecting OLMo pre-training data
Persian NLP Toolkit
The no-nonsense RAG chunking library
Large Language Model Text Generation Inference
Han Language Processing
Stanford NLP Python library for many human languages
Openai style api for open large language models
ExtractThinker is a Document Intelligence library for LLMs
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models
A tool for learning vector representations of words and entities
Obsei is a low code AI powered automation tool
Fast and customizable framework for automatic ML model creation
Efficient Retrieval Augmentation and Generation Framework
A Heterogeneous Benchmark for Information Retrieval
Libraries for applying sparsification recipes to neural networks
A coding-free framework built on PyTorch
Neural Network Compression Framework for enhanced OpenVINO
A Unified Library for Parameter-Efficient Learning
Data loaders and abstractions for text and NLP
Transformers4Rec is a flexible and efficient library
Easy-to-use and powerful NLP library with Awesome model zoo
Training data (data labeling, annotation, workflow) for all data types
Making large AI models cheaper, faster and more accessible
Hub of ready-to-use datasets for ML models
Build AI-powered semantic search applications