Clone a voice in 5 seconds to generate arbitrary speech in real-time
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
HexStrike AI MCP Agents is an advanced MCP server
Video understanding codebase from FAIR for reproducing video models
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Finding the Scaling Law of Agents. A multi-agent framework
Stable Diffusion built-in to Blender
lightweight package to simplify LLM API calls
Automatically translates the text of a video based on a subtitle file
A natural language interface for computers
Skills Catalog for Codex
StreamSpeech is a seamless model for offline speech recognition
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Deep universal probabilistic programming with Python and PyTorch
Open source libraries and APIs to build custom preprocessing pipelines
Automate native Android apps with AI using accessibility APIs
Unified Multimodal Understanding and Generation Models
Provides convenient access to the Anthropic REST API from any Python 3
Inference framework for 1-bit LLMs
Full stack AI software engineer
Fully automatic censorship removal for language models
Public repository for Agent Skills
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Free, open source crypto trading bot