SQL-Driven RAG Engine
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Retrieval and Retrieval-augmented LLMs
95% token savings. 155x faster queries. 16 languages
Chinese XLNet pre-trained model
Framework for building neural networks
Memory-efficient and performant finetuning of Mistral's models
Official python implementation of UTCP. UTCP is an open standard
A python tool that uses GPT-4, FFmpeg, and OpenCV
A Pioneering Open-Source Alternative to GPT-4o
Concatenate a directory full of files into a single prompt
Documentation for Google's Gen AI site - including Gemini API & Gemma
MOSS-TTS-Nano is an open-source multilingual tiny speech generation
Audio foundation model excelling in audio understanding
Phi-3.5 for Mac: Locally-run Vision and Language Models
Pre-trained Deep Learning models and demos
Central interface to connect your LLM's with external data
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Open source demo platform where you can easily showcase your AI models
Real-World Centric Foundation GUI Agents
One-click deployment (including offline integration package)
Shared repository for open-sourced projects from the Google AI Lang
Fast-stable-diffusion + DreamBooth
Multimodal Diffusion with Representation Alignment
SDK for building interactive UI components over MCP for AI tools