AI multi-agent framework for automating data-driven R&D workflows
Faster and easier training and deployments
Designed for training LLM/VLM agents via RL
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Document (PDF, Word, PPTX ...) extraction and parse API
High-performance inference framework for large language models
Long-form streaming TTS system for multi-speaker dialogue generation
Open-Source Financial Large Language Models
Fast and Universal 3D reconstruction model for versatile tasks
4M: Massively Multimodal Masked Modeling
This repository contains the official implementation of FastVLM
ICLR2024 Spotlight: curation/training code, metadata, distribution
A PyTorch library for implementing flow matching algorithms
One-click local MCP server installation in desktop apps
Memory-efficient and performant finetuning of Mistral's models
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Diffusion Transformer with Fine-Grained Chinese Understanding
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
An industrial grade federated learning framework
Googles NotebookLM but local
Korvus is a search SDK that unifies the entire RAG pipeline
Repo of Qwen2-Audio chat & pretrained large audio language model
Plug-and-play library to enable agents to call MCP and UTCP tools
Experimental, AI/ML-powered and open sourced Marketing Mix Modeling