Tools like web browser, computer access and code runner for LLMs
Python bindings for MuPDF's rendering library.
A library to help you make the most out of your Pixoo 64
Label Studio is a multi-type data labeling and annotation tool
Qwen-Image is a powerful image generation foundation model
Underthesea - Vietnamese NLP Toolkit
An open-source toolkit for monitoring Language Learning Models (LLMs)
Video-based AI memory library. Store millions of text chunks in MP4
Network analysis in Python
Qwen3-omni is a natively end-to-end, omni-modal LLM
JavaScript parser and stringifier for YAML
Knowledge Agents and Management in the Cloud
Dataset of GPT-2 outputs for research in detection, biases, and more
A nearly-live implementation of OpenAI's Whisper
Main repository for the Sphinx documentation builder
Capable of understanding text, audio, vision, video
Toolkit for conversational AI
Controllable and fast Text-to-Speech for over 7000 languages
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
JupyterLab computational environment
Javascript Canvas Library and SVG-to-Canvas Parser
A Sublime Text 2/3 plugin to see git diff in gutter
SOTA Open Source TTS
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Speech-AI-Forge is a project developed around TTS generation model