3D reconstruction software
Online machine learning in Python
Reverse-engineered Python API for Google Gemini web app
Open-source Video Translation Skill
AirLLM 70B inference with single 4GB GPU
Generate audiobooks from EPUBs, PDFs and text with captions
Supercharge Your LLM with the Fastest KV Cache Layer
A high performance implementation of HDBSCAN clustering
Open Agent Harness with a built-in personal agent, Ohmo
LLM based autonomous agent that does online comprehensive research
Apple Silicon (MLX) port of Karpathy's autoresearch
Sacred is a tool to help you configure, andorganize IDSIA experiments
Provider-agnostic, open-source evaluation infrastructure
Python SDK for Claude Agent
A New Axis of Sparsity for Large Language Models
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
RGBD video generation model conditioned on camera input
Unifying 3D Mesh Generation with Language Models
The common language for platforms, agents and businesses.
Diversity-driven optimization and large-model reasoning ability
High-Fidelity and Controllable Generation of Textured 3D Assets
State-of-the-art (SoTA) text-to-video pre-trained model
Implementation of 'lightweight' GAN, proposed in ICLR 2021
An advanced paper search agent powered by large language models
AI-powered tool to quickly remove watermarks from videos and photo