Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Learn AI and LLMs from scratch using free resources
Deploy and share agents with open infrastructure
An MCP server that autonomously evaluates web applications
The leading agent orchestration platform for Claude
Repo of Qwen2-Audio chat & pretrained large audio language model
No-code multi-agent framework to build LLM Agents, workflows
Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant
The data structure for multimodal data
Lightning fast C++/CUDA neural network framework
Hub of ready-to-use datasets for ML models
Build cross-modal and multimodal applications on the cloud
A set of Docker images for training and serving models in TensorFlow
Integrate cutting-edge LLM technology quickly and easily into your app
GUI Exploration Lab. One of the best GUI agent solutions
The AI-powered coding wizard
The Operator Splitting QP Solver
Python 3 package for easy bypass reCAPTCHA/reCAPTCHA Mobile/hCaptcha
An advanced paper search agent powered by large language models
LLM-based Reinforcement Learning audio edit model
Defang CLI and sample projects
Local Lambda debug, CodeWhisperer, SAM/CFN syntax, etc.
Scalable machine learning for time series forecasting
Chat & pretrained large audio language model proposed by Alibaba Cloud