Hunyuan Translation Model Version 1.5
Block Diffusion for Ultra-Fast Speculative Decoding
High-resolution models for human tasks
ChatGPT interface with better UI
Genome modeling and design across all domains of life
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
This repository contains the official implementation of FastVLM
FAIR Sequence Modeling Toolkit 2
ICLR2024 Spotlight: curation/training code, metadata, distribution
Diffusion Transformer with Fine-Grained Chinese Understanding
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Controllable & emotion-expressive zero-shot TTS
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Unified Multimodal Understanding and Generation Models
Language modeling in a sentence representation space
Large Multimodal Models for Video Understanding and Editing
Towards Real-World Vision-Language Understanding
LLM-based Reinforcement Learning audio edit model
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Chat & pretrained large vision language model
Chat & pretrained large audio language model proposed by Alibaba Cloud
Pushing the Limits of Mathematical Reasoning in Open Language Models
The ChatGPT Retrieval Plugin lets you easily find personal documents
Open-source, high-performance Mixture-of-Experts large language model
Open Multilingual Multimodal Chat LMs