State-of-the-art diffusion models for image and audio generation
1 min voice data can also be used to train a good TTS model
Powerful tool that lets you create and run intelligent agents
Framework for building realtime multimodal voice AI agents apps
A lightweight audio-to-MIDI converter with pitch bend detection
Framework for Telegram Bot API written in Python 3.7 with asyncio
Open Source Document Management System for Digital Archives
A theoretical reconstruction of the Claude Mythos architecture
Code for running inference and finetuning with SAM 3 model
Lemonade helps users run local LLMs with the highest performance
The simplest, fastest repository for training/finetuning models
Simplest working implementation of Stylegan2
Flowly is 100x faster than OpenClaw
AI assistant based on large models that can actively think and plan
Python package for AutoML on Tabular Data with Feature Engineering
A single-file tkinter-based Ollama GUI project
Datasets, transforms and models specific to Computer Vision
GitLab automatic code review tool based on large models
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
AI Agent Networks for Open Collaboration
Multi-lingual large voice generation model, providing inference
Agent Skill for generating 2D sprite sheets and map, transparent PNG
Qwen3-TTS is an open-source series of TTS models
InvokeAI is a leading creative engine for Stable Diffusion models
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD