Stable Diffusion built-in to Blender
Multilingual sentence & image embeddings with BERT
Repo of Qwen2-Audio chat & pretrained large audio language model
A tool for learning vector representations of words and entities
Dealing with all unstructured data, such as reverse image search
A single Gradio + React WebUI with extensions for ACE-Step
Qwen2.5-VL is the multimodal large language model series
Flexible Photo Recrafting While Preserving Your Identity
Implementation of "MobileCLIP" CVPR 2024
Chinese and English multimodal conversational language model
Tensor search for humans
Python package for AutoML on Tabular Data with Feature Engineering
SOTA discrete acoustic codec models with 40/75 tokens per second
Unified Multimodal Understanding and Generation Models
The official PyTorch implementation of Google's Gemma models
The official repo of Qwen chat & pretrained large language model
Aider is AI pair programming in your terminal
ktrain is a Python library that makes deep learning AI more accessible
Official code for Style Aligned Image Generation via Shared Attention
Supercharge Your LLM with the Fastest KV Cache Layer
Memory-efficient and performant finetuning of Mistral's models
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
The open-source data curation platform for LLMs
Official python implementation of UTCP. UTCP is an open standard
Solve end to end problems using Llama model family