Official inference repo for FLUX.1 models
TokenSpeed is a speed-of-light LLM inference engine
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
A modular graph-based Retrieval-Augmented Generation (RAG) system
Framework and no-code GUI for fine-tuning LLMs
Skywork-R1V is an advanced multimodal AI model series
An Open-source Framework for Data-centric Language Agents
The official Meta Llama 3 GitHub site
Chat with it via text and voice
Chinese and English multimodal conversational language model
Open source libraries and APIs to build custom preprocessing pipelines
Agentic, Reasoning, and Coding (ARC) foundation models
Browse the web, directly from Cursor etc.
Witness the aha moment of VLM with less than $3
Evaluation suite designed to assess the performance of LLMs
Revolutionizing Database Interactions with Private LLM Technology
Official inference framework for 1-bit LLMs
Synthetic data curation for post-training and data extraction
Z80-μLM is a 2-bit quantized language model
The official gpt4free repository
⚡ Building applications with LLMs through composability ⚡
Learning to Reason with Search for LLMs via Reinforcement Learning
Enhances Tesseract OCR output using LLMs (local or API)
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Composable building blocks to build Llama Apps