Chinese and English multimodal conversational language model
GPT4V-level open-source multi-modal model based on Llama3-8B
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Research code artifacts for Code World Model (CWM)
A Customizable Image-to-Video Model based on HunyuanVideo
Chat & pretrained large audio language model proposed by Alibaba Cloud
Building a Secure and Interoperable Future for AI-Driven Payments
Bolt is a deep learning library with high performance
Flexible and powerful framework for managing multiple AI agents
A code-first agent framework for seamlessly planning analytics tasks
ESP32 Camera motion capture application to record JPEGs to SD card
Designed to facilitate the deployment of multiple LLM-based agents
A RWKV management and startup tool, full automation, only 8MB
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
High performance Twitch bot in Rust
A Discord music bot that's easy to set up and run yourself
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Easy-to-use Speech Toolkit including Self-Supervised Learning model
NLP Cloud serves high performance pre-trained or custom models for NER
Implementation of Video Diffusion Models
Implementation of Phenaki Video, which uses Mask GIT
Benchmarking synthetic data generation methods
(Golang) Go bindings for Discord
AIMET is a library that provides advanced quantization and compression