WikiChat is an improved RAG
ReFT: Representation Finetuning for Language Models
Large Language Model Text Generation Inference
Han Language Processing
Openai style api for open large language models
Stanford NLP Python library for many human languages
ExtractThinker is a Document Intelligence library for LLMs
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models
A tool for learning vector representations of words and entities
Obsei is a low code AI powered automation tool
Fast and customizable framework for automatic ML model creation
Semantic search and workflows for medical/scientific papers
Efficient Retrieval Augmentation and Generation Framework
A Heterogeneous Benchmark for Information Retrieval
A full spaCy pipeline and models for scientific/biomedical documents
Libraries for applying sparsification recipes to neural networks
The library to build & auto-optimize LLM applications
Data processing for and with foundation models
Extract schema, statistics and entities from datasets
A coding-free framework built on PyTorch
Pretrained model hub for Keras 3
Neural Network Compression Framework for enhanced OpenVINO
Efficient few-shot learning with Sentence Transformers
A Unified Library for Parameter-Efficient Learning
Data loaders and abstractions for text and NLP