CogView4, CogView3-Plus and CogView3(ECCV 2024)
Multilingual sentence & image embeddings with BERT
Guiding Instruction-based Image Editing via Multimodal Large Language
ImageBind One Embedding Space to Bind Them All
Towards Real-World Vision-Language Understanding
An open source implementation of CLIP
Extract schema, statistics and entities from datasets
Model Context Protocol Server for Apache OpenDAL™
Fast and customizable framework for automatic ML model creation
Generate 3D objects conditioned on text or images
textgen, Text Generation models
AutoGluon: AutoML for Image, Text, and Tabular Data
Supercharge Your LLM with the Fastest KV Cache Layer
Pretrained model hub for Keras 3
A tool for learning vector representations of words and entities
Dealing with all unstructured data, such as reverse image search
Qwen2.5-VL is the multimodal large language model series
Aider is AI pair programming in your terminal
Chinese and English multimodal conversational language model
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Tensor search for humans
Implementation of "MobileCLIP" CVPR 2024
The official repo of Qwen chat & pretrained large language model
Python package for AutoML on Tabular Data with Feature Engineering
Repo of Qwen2-Audio chat & pretrained large audio language model