A course of learning LLM inference serving on Apple Silicon
A fast TTS architecture with conditional flow matching
Code for running inference with the SAM 3D Body Model 3DB
Sharp Monocular Metric Depth in Less Than a Second
A lightweight, powerful framework for multi-agent workflows
Make websites accessible for AI agents
The music player of today
State-of-the-art diffusion models for image and audio generation
CTFs as you need them
AI Toolkit for Healthcare Imaging
Active development of the Azure SDK for Python
Open source annotation tool for machine learning practitioners
Background Remover lets you Remove Background from images and video
Generate high-definition story short videos with one click using AI
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Speech-AI-Forge is a project developed around TTS generation model
Converts text to speech in realtime
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Agent framework and applications built upon Qwen>=3.0
Windows GUI Automation with Python (based on text properties)
Full stack AI software engineer
AI based photo editing website for changing image background
A robust, efficient, low-latency speech-to-text library
Prefect is a workflow orchestration framework
A full spaCy pipeline and models for scientific/biomedical documents