SGLang is a fast serving framework for large language models
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Fast State-of-the-Art Static Embeddings
Easiest and laziest way for building multi-agent LLMs applications
A Pythonic framework to simplify AI service building
Operating LLMs in production
A long-running autonomous coding agent powered by the Claude Agent
Meta Agents Research Environments is a comprehensive platform
Pruna is a model optimization framework built for developers
PyTorch library of curated Transformer models and their components
Code to accompany "A Method for Animating Children's Drawings"
Unleashing 10,000+ Word Generation from Long Context LLMs
Run PyTorch LLMs locally on servers, desktop and mobile
Terminal-based LLM chat tool with multi-model and local support
Cosmos-RL is a flexible and scalable Reinforcement Learning framework
Pre & Post-training & Dataset & Evaluation & Depoly & RAG
AI tool for detecting complex vulnerabilities in Python codebases
A.S.E (AICGSecEval) is a repository-level AI-generated code security
On the Structural Pruning of Large Language Models
Traditional Mandarin LLMs for Taiwan
Accelerate local LLM inference and finetuning
Private chat with local GPT with document, images, video, etc.
Speech-AI-Forge is a project developed around TTS generation model
GPT4V-level open-source multi-modal model based on Llama3-8B
Open-source AI hackers to find and fix your app’s vulnerabilities