A gallery that showcases on-device ML/GenAI use cases
A multimodal model for brain response prediction
Audiocraft is a library for audio processing and generation
End-to-end speech processing toolkit
Faster Whisper transcription with CTranslate2
Lightning-fast, on-device TTS, running natively via ONNX
Use Microsoft Edge's online text-to-speech service from Python
Public opinion analysis system
Stable Diffusion web UI
Pretrained model hub for Keras 3
Open source no-code system for text annotation and building of text
Voice Recognition to Text Tool
AI that sees your screen and listens to conversations
Deep Research framework, combining language models with tools
Fast and customizable framework for automatic ML model creation
Chinese XLNet pre-trained model
Official Vectorize MCP Server
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Framework for building realtime multimodal voice AI agents apps
TextWorld is a sandbox learning environment for the training
Dealing with all unstructured data, such as reverse image search
Use LLMs and LLM Vision (OCR) to handle paperless-ngx
Document content and metadata extraction microservice
The free, Open Source alternative to OpenAI, Claude and others
Bidirectional token-classification model for identifiable info