AI video generator optimized for low VRAM and older GPUs use
Windows GUI Automation with Python (based on text properties)
ImageBind One Embedding Space to Bind Them All
Diffusion Transformer with Fine-Grained Chinese Understanding
Qwen3-omni is a natively end-to-end, omni-modal LLM
High-Resolution Image Synthesis with Latent Diffusion Models
Edit PDF files with Nano Banana
Code for running inference and finetuning with SAM 3 model
An open source implementation of CLIP
AutoGluon: AutoML for Image, Text, and Tabular Data
Director, Screenwriter, Producer, and Video Generator All-in-One
A simple tool for reading in poorly redacted documents
Chinese and English multimodal conversational language model
Tensor search for humans
NLP Cloud serves high performance pre-trained or custom models for NER
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
21 Lessons, Get Started Building with Generative AI
Integrate ChatGPT into your own discord bot
Accurate × Fast × Comprehensive
Deep Research framework, combining language models with tools
An Open Source text-to-speech system built by inverting Whisper
Multilingual sentence & image embeddings with BERT
ComfyUI wrapper nodes for WanVideo and related models
CLI tool to extract (meta)data from PDF and manipulate PDF files
High-Resolution 3D Assets Generation with Large Scale Diffusion Models