Document (PDF, Word, PPTX ...) extraction and parse API
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Easy Docker setup for Stable Diffusion with user-friendly UI
UI-TARS-desktop version that can operate on your local personal device
A python library that makes AMR parsing, generation and visualization
Implementation of DeepLabCut
Open-source platform for building AI agents and serverless automation
Instead of distilling others, it is better to distil yourself
SGLang is a fast serving framework for large language models
A Python package for extending the official PyTorch
AGiXT is a dynamic AI Automation Platform
Operating LLMs in production
Revolutionizing Database Interactions with Private LLM Technology
Library to facilitate federated learning research
The library to build & auto-optimize LLM applications
Open-source AI marketing skills for Claude Code
GEO-first SEO skill for Claude Code
A modular Agentic RAG built with LangGraph
Open-sourced unified customization model
A Powerful Native Multimodal Model for Image Generation
Generating Immersive, Explorable, and Interactive 3D Worlds
A Python library for audio
State-of-the-art diffusion models for image and audio generation
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
TTS model capable of streaming conversational audio in realtime