Dealing with all unstructured data, such as reverse image search
Parse files for optimal RAG
Official Python inference and LoRA trainer package
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Python project template generator with batteries included
Fast-stable-diffusion + DreamBooth
Implementation of "MobileCLIP" CVPR 2024
Multimodal embedding and reranking models built on Qwen3-VL
Free, high-quality text-to-speech API endpoint to replace OpenAI
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
21 Lessons, Get Started Building with Generative AI
Deep Research framework, combining language models with tools
A Systematic Framework for Interactive World Modeling
Unified Multimodal Understanding and Generation Models
Open-Sora: Democratizing Efficient Video Production for All
Open source personal AI Assistant for Linux, Windows and Mac
Open source libraries and APIs to build custom preprocessing pipelines
The data structure for multimodal data
Large-language-model & vision-language-model based on Linear Attention
Generate Any 3D Scene in Seconds
Pretrained model hub for Keras 3
GenAI Processors is a lightweight Python library
Powerful open source team chat application
Extract one time password (OTP) secrets from QR codes
Official implementation of DreamCraft3D