Multi-source content processor for NotebookLM
Agent S: an open agentic framework that uses computers like a human
A sound cloning tool with a web interface, using your voice
Stanford NLP Python library for many human languages
LLM Large Model of Selling Anchor
Visual intelligence for your home.
A neural network that transforms a design mock-up into static websites
Minimal reproduction of OneRec
Synthesizing and manipulating 2048x1024 images with conditional GANs
Long-term memory OS for AI with structured recall and context awarenes
Sandbox for training deep learning networks
Qwen3-ASR is an open-source series of ASR models
E2M converts various file types (doc, docx, epub, html, htm, url
Code and models for ICML 2024 paper, NExT-GPT
Reading book source
Stable Diffusion with Core ML on Apple Silicon
✨:AI-Powered Piano Audio to MIDI Converter 🎶
Unlimited, private and free Speech-To-Text program
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
ML based QSAR Modelling And Translation of Model to Deployable WebApps
Convert an image to text to spot intelligible words.
Img2Txt - Extract Text From Images using AI
Txt-2-Mp3 6.3 Mark 2 [Improved.Simplified.Alternative]
A Python library for turning text quotes into graphical images
Point cloud diffusion for 3D model synthesis