Tools like web browser, computer access and code runner for LLMs
Time-lapse Video Generation Models as Metamorphic Simulators
Deploy and share agents with open infrastructure
Real-time voice interactive digital human
On-device Speech-to-Intent engine powered by deep learning
Follow along with my AI Agents Masterclass videos
Document Image Parsing via Heterogeneous Anchor Prompting”
VITS2 backbone with multilingual-bert
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Portia Labs Python SDK for building agentic workflows
Habit Tracker for the AI Coding Workshop
An undetectable, powerful, flexible, high-performance Python library
SWE-agent takes a GitHub issue and tries to automatically fix it
A high-performance ML model serving framework, offers dynamic batching
Open platform for training, serving, and evaluating language models
A TTS that fits in your CPU (and pocket)
StreamSpeech is a seamless model for offline speech recognition
Streamline your ML workflow
Language modeling in a sentence representation space
Code for Language models can explain neurons in language models paper
Spatiotemporal Signal Processing with Neural Machine Learning Models
A minimal yet professional single agent demo project
Repo of Qwen2-Audio chat & pretrained large audio language model
Capable of understanding text, audio, vision, video
Open-source MCP server that gives your coding agent