High-Resolution Image Synthesis with Latent Diffusion Models
Machine learning, conversational dialog engine for creating chat bots
lightweight package to simplify LLM API calls
Algorithms for outlier, adversarial and drift detection
Concatenate a directory full of files into a single prompt
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
A modular graph-based Retrieval-Augmented Generation (RAG) system
Provides CTP stock options and Zhongtai Securities XTP
Implementation of Make-A-Video, new SOTA text to video generator
Stable Diffusion web UI
Guiding Instruction-based Image Editing via Multimodal Large Language
Diffusion Transformer with Fine-Grained Chinese Understanding
Sample code and notebooks for Generative AI on Google Cloud
OCR expert VLM powered by Hunyuan's native multimodal architecture
Flexible Photo Recrafting While Preserving Your Identity
Bailing is a voice dialogue robot similar to GPT-4o
Build Vision Agents quickly with any model or video provider
An open source implementation of CLIP
Qwen3-Coder is the code version of Qwen3
Toolkit for audio, music, and speech generation
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Extract schema, statistics and entities from datasets
Data loaders and abstractions for text and NLP
One-click deployment (including offline integration package)
Large Multimodal Models for Video Understanding and Editing