LLM abstractions that aren't obstructions
User toolkit for analyzing and interfacing with Large Language Models
Chinese and English multimodal conversational language model
Tensor search for humans
Unleashing 10,000+ Word Generation from Long Context LLMs
Qwen3 is the large language model series developed by Qwen team
The official repo of Qwen chat & pretrained large language model
Using AI models to automatically provide commentary and edit videos
A New Axis of Sparsity for Large Language Models
Simple, Pythonic building blocks to evaluate LLM applications
Extension of Google Research’s PaperBanana
SQL-Driven RAG Engine
Data Infrastructure providing an approach to multimodal AI workloads
Build multimodal language agents for fast prototype and production
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Retrieval and Retrieval-augmented LLMs
95% token savings. 155x faster queries. 16 languages
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Low-latency REST API for serving text-embeddings
Inference code for CodeLlama models
Open source libraries and APIs to build custom preprocessing pipelines
Open source demo platform where you can easily showcase your AI models
A Pioneering Open-Source Alternative to GPT-4o
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Central interface to connect your LLM's with external data