Retrieval Augmented Generation (RAG) framework
Plug-n-play module turning text-to-image models into animation
Mice speech to text with MX Cinnamon OS ISO
High-Resolution Image Synthesis with Latent Diffusion Models
A Python application to add watermarks (text or image) to PDF files
A Conversational Speech Generation Model
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
Toolkit for audio, music, and speech generation
mice stt tts
VITS2 backbone with multilingual-bert
Overcoming Data Limitations for High-Quality Video Diffusion Models
A graphical manager for ollama that can manage your LLMs
AI-powered semantic indexing: automating the creation of book indexes
Guiding Instruction-based Image Editing via Multimodal Large Language
Tool-integrated Reasoning LLM Agents
Code for the paper Language Models are Unsupervised Multitask Learners
minimal suckless android web browser with unlimited power
A library for transfer learning by reusing parts of TensorFlow models
Obsei is a low code AI powered automation tool
AIlice is a fully autonomous, general-purpose AI agent
A tool for learning vector representations of words and entities