Open source NLP guide with models, methods, and real use cases
A Python tool to help extracting information from structured PDFs
Qwen3-omni is a natively end-to-end, omni-modal LLM
Open Source Document Management System for Digital Archives
toot - Mastodon CLI & TUI
Implementation of Video Diffusion Models
Open Source Speech Language Model
Multimodal embedding and reranking models built on Qwen3-VL
Multimodal-Driven Architecture for Customized Video Generation
A theme for Sublime Text 3 by Mattia Astorino
GenAI Processors is a lightweight Python library
Extract schema, statistics and entities from datasets
lightweight package to simplify LLM API calls
AutoGluon: AutoML for Image, Text, and Tabular Data
Multilingual sentence & image embeddings with BERT
Build Vision Agents quickly with any model or video provider
Generate audiobooks from e-books
Supercharge Your LLM with the Fastest KV Cache Layer
The best free open source website change detection and restock service
Using AI models to automatically provide commentary and edit videos
Qwen2.5-VL is the multimodal large language model series
An open source implementation of CLIP
Flowly is 100x faster than OpenClaw
Long-form streaming TTS system for multi-speaker dialogue generation
Foundation model for image generation