Best practices on recommendation systems
GitLab automatic code review tool based on large models
This repo contains the code for 1D tokenizer and generator
A Unified Framework for Image Customization
Flexible Photo Recrafting While Preserving Your Identity
Multi-Agent daTa geneRation Infra and eXperimentation framework
Scalable generative AI framework built for researchers and developers
Interface for OuteTTS models
MARS5 speech model (TTS) from CAMB.AI
Plug-and-play library to enable agents to call MCP and UTCP tools
Diversity-driven optimization and large-model reasoning ability
This repository provides an advanced RAG
An MCP server that autonomously evaluates web applications
A solution to build and deploy MCP agents and applications
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
The data structure for multimodal data
Open source framework for deep learning satellite and aerial imagery
Fast image augmentation library and an easy-to-use wrapper
Build cross-modal and multimodal applications on the cloud
Python binding to the Apache Tika™ REST services
A library for deep learning end-to-end dialog systems and chatbots
Deep learning library
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA