Central interface to connect your LLM's with external data
21-lesson course to learn and build with generative AI
State-of-the-art diffusion models for image and audio generation
A robust, efficient, low-latency speech-to-text library
Model Context Protocol Server for Apache OpenDAL™
Code for the paper Language Models are Unsupervised Multitask Learners
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
A Powerful Native Multimodal Model for Image Generation
Implementation of Video Diffusion Models
Diffusion Transformer with Fine-Grained Chinese Understanding
The most reliable AI agent framework that supports MCP
Adding guardrails to large language models
Multimodal Diffusion with Representation Alignment
Aider is AI pair programming in your terminal
A generative speech model for daily dialogue
A framework to enable multimodal models to operate a computer
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
A Unified Framework for Text-to-3D and Image-to-3D Generation
Obsei is a low code AI powered automation tool
CLIP + FFT/DWT/RGB = text to image/video
Multimodal-Driven Architecture for Customized Video Generation
Toolkit for conversational AI
TextWorld is a sandbox learning environment for the training
Machine learning, conversational dialog engine for creating chat bots
Official python implementation of UTCP. UTCP is an open standard