PPTAgent: Generating and Evaluating Presentations
The most powerful Android RPA agent framework
Official implementation of Watermark Anything with Localized Messages
Python Stream Processing
PyTorch version of Stable Baselines
An open source implementation of CLIP
Unified Model Serving Framework
Reverse-engineered Python API for Google Gemini web app
Advanced techniques for RAG systems
Inference code for CodeLlama models
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Hackable and optimized Transformers building blocks
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Framework for orchestrating role-playing, autonomous AI agents
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Asynchronous multi-platform robot framework written in Python
Open deep learning compiler stack for cpu, gpu, etc.
Pokee Deep Research Model Open Source Repo
Gemma open-weight LLM library, from Google DeepMind
Enable AI to control your desktop, mobile and HMI devices
Provides convenient access to the Anthropic REST API from any Python 3
GPT4V-level open-source multi-modal model based on Llama3-8B
kaldi-asr/kaldi is the official location of the Kaldi project
A Python library for audio