Qwen2.5-VL is the multimodal large language model series
Open source NLP guide with models, methods, and real use cases
Using AI models to automatically provide commentary and edit videos
A list of free LLM inference resources accessible via API
Open-source multi-speaker long-form text-to-speech model
Supercharge Your LLM with the Fastest KV Cache Layer
toot - Mastodon CLI & TUI
go1pylib is a Python library designed to control the Go1 robot
Turn words into chords
Open Source Speech Language Model
Multimodal embedding and reranking models built on Qwen3-VL
Multimodal-Driven Architecture for Customized Video Generation
A theme for Sublime Text 3 by Mattia Astorino
AutoGluon: AutoML for Image, Text, and Tabular Data
Build Vision Agents quickly with any model or video provider
The best free open source website change detection and restock service
An open source implementation of CLIP
GenAI Processors is a lightweight Python library
Main repository for the Sphinx documentation builder
OCR model for complex documents with layout-aware structured outputs
Long-form streaming TTS system for multi-speaker dialogue generation
Foundation model for image generation
Cloud-native open source data warehouse for analytics and AI queries
Automated translation solution for visual novels
A speech-text foundation model for real time dialogue