Full-stack AI Red Teaming platform
ChatGLM2-6B: An Open Bilingual Chat LLM
Framework for building, orchestrating, and deploying AI agents
The Open Source Cowork Desktop to Unlock Your Exceptional Productivity
Framework for building AI-powered interactive digital humans and agent
TensorRT LLM provides users with an easy-to-use Python API
Your Personal AI Assistant; easy to install, deploy on local or coud
Build voice-based LLM agents. Modular + open source
Official inference library for Mistral models
TFX is an end-to-end platform for deploying production ML pipelines
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Build high-quality LLM apps
High-performance inference framework for large language models
Performance-optimized AI inference on your GPUs
Open source RAG framework for building scalable modular AI apps
Production-grade platform for building agentic IM bots
One brain, many harnesses. Portable .agent/ folder
Agents write python code to call tools and orchestrate other agents
The Triton Inference Server provides an optimized cloud
Voice Recognition to Text Tool
Machine Learning Systems: Design and Implementation
Portia Labs Python SDK for building agentic workflows
Pruna is a model optimization framework built for developers
On the Structural Pruning of Large Language Models
Easy Docker setup for Stable Diffusion with user-friendly UI