High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Web interface for generating images using Stable Diffusion models
Chat & pretrained large audio language model proposed by Alibaba Cloud
The Inter font family
Professional collaborative platform for embedded development
Flet enables developers to easily build realtime web and mobile apps
Capable of understanding text, audio, vision, video
Repo of Qwen2-Audio chat & pretrained large audio language model
An easy-to-use backup tool for GNU Linux using rsync in the back
Designed for text embedding and ranking tasks
Python tool for converting files and office documents to Markdown
Chat & pretrained large vision language model
State-of-the-art diffusion models for image and audio generation
Music player and music library manager for Linux, Windows, and macOS
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Central interface to connect your LLM's with external data
High accuracy RAG for answering questions from scientific documents
Python bindings for MuPDF's rendering library.
Video-based AI memory library. Store millions of text chunks in MP4
Implementation of Make-A-Video, new SOTA text to video generator
Ark pixel font - Open source Pan-CJK pixel font
MTEB: Massive Text Embedding Benchmark
Qwen3-omni is a natively end-to-end, omni-modal LLM
State-of-the-art TTS model under 25MB
Easily share data across your company via SQL queries