Pruna is a model optimization framework built for developers
State-of-the-art Parameter-Efficient Fine-Tuning
Easily compute clip embeddings and build a clip retrieval system
Fast Differentiable Tensor Library in JavaScript & TypeScript with Bun
Low-latency REST API for serving text-embeddings
Lemonade helps users run local LLMs with the highest performance
Making large AI models cheaper, faster and more accessible
RL implementations
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Main repository for Vispy
Streaming Real-time Audio-Driven Avatar Generation
A Python package for extending the official PyTorch
Run Local LLMs on Any Device. Open-source
YOLOv5 is the world's most loved vision AI
3D reconstruction software
A nearly-live implementation of OpenAI's Whisper
Voice Recognition to Text Tool
The official repo of Qwen chat & pretrained large language model
Unified web UI for training and running open models locally
2D and 3D Face alignment library build using pytorch
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
Generate audiobooks from e-books
Public CI, Docker images for popular JAX libraries
Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real
High-Performance Face Recognition Library on PaddlePaddle & PyTorch