Central interface to connect your LLM's with external data
Test Suites for validating ML models & data
A Telegram bot for Large Language Models
ComfyUI wrapper nodes for HunyuanVideo
AI-Researcher: Autonomous Scientific Innovation
Tiny vision language model
Free, high-quality text-to-speech API endpoint to replace OpenAI
Real-World Centric Foundation GUI Agents
Handwritten Text Recognition (HTR) system implemented with TensorFlow
A powerful tool for automated LLM fuzzing
MoBA: Mixture of Block Attention for Long-Context LLMs
Multimodal-Driven Architecture for Customized Video Generation
User toolkit for analyzing and interfacing with Large Language Models
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Turn your website into a GIF
A Universal Customization Method for Single and Multi Conditioning
Repo of Qwen2-Audio chat & pretrained large audio language model
The Agent-User Interaction Protocol
A Personalized LLM-powered Agent Frameworks
Deep learning driven jazz generation using Keras & Theano
Designed for training LLM/VLM agents via RL
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
95% token savings. 155x faster queries. 16 languages
4M: Massively Multimodal Masked Modeling
PyTorch extensions for fast R&D prototyping and Kaggle farming