AI-powered code generation tool for scratch development of web apps
Adds support for Yandex Smart Home (Alice voice assistant)
Bash is all you need, write a claude code with only 16 line code
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Industrial-level controllable zero-shot text-to-speech system
Sharp Monocular Metric Depth in Less Than a Second
An AI-powered security review GitHub Action using Claude
ContextGem: Effortless LLM extraction from documents
Benchmarking Multimodal Agents for Open-Ended Tasks
[NeurIPS 2023 Spotlight] LightZero
A game theoretic approach to explain the output of ml models
High-performance library for gradient boosting on decision trees
AnyTool: Universal Tool-Use Layer for AI Agents
Motion-controllable Video Generation via Latent Trajectory Guidance
Virtual AI anchor that combines state-of-the-art technology
Witness the aha moment of VLM with less than $3
Open source platform for the machine learning lifecycle
An open source implementation of CLIP
Self-learning data agent that grounds its answers in layers of content
Diffusion Bee is the easiest way to run Stable Diffusion locally
Specification and documentation for Agent Skills
Build Vision Agents quickly with any model or video provider
Supercharge Your LLM with the Fastest KV Cache Layer
The official Meta Llama 3 GitHub site
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming