Train a 26M-parameter GPT from scratch in just 2h
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Data and tools for generating and inspecting OLMo pre-training data
Integrating LLMs into structured NLP pipelines
Model Context Protocol tool support for LangChain
Simple, Pythonic building blocks to evaluate LLM applications
Visual Instruction Tuning: Large Language-and-Vision Assistant
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
Pretrained (Language) Models for Probabilistic Time Series Forecasting
A modular graph-based Retrieval-Augmented Generation (RAG) system
Guiding Instruction-based Image Editing via Multimodal Large Language
Code for the paper Language Models are Unsupervised Multitask Learners
Browse the web, directly from Cursor etc.
Witness the aha moment of VLM with less than $3
Evaluation suite designed to assess the performance of LLMs
Machine learning, conversational dialog engine for creating chat bots
gpt-oss-120b and gpt-oss-20b are two open-weight language models
DeepSeek Coder: Let the Code Write Itself
World of apps for benchmarking interactive coding agent
CogView4, CogView3-Plus and CogView3(ECCV 2024)
The official gpt4free repository
An elegent pytorch implement of transformers
Data loaders and abstractions for text and NLP
Obsei is a low code AI powered automation tool