Unleashing 10,000+ Word Generation from Long Context LLMs
Autoregressive Model Beats Diffusion
An agentless approach to automatically solve software development
Empowering Code Generation with OSS-Instruct
Neural Network architecture based on ideas of the original LSTM
A simple, performant and scalable Jax LLM
A lightweight framework for building LLM-based agents
TigerBot: A multi-language multi-task LLM
SDG is a specialized framework
Overcoming Group Chat Scenarios with LLM-based Technical Assistance
The Security Toolkit for LLM Interactions
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Document (PDF, Word, PPTX ...) extraction and parse API
Implementation for MatMul-free LM
Leaderboard Comparing LLM Performance at Producing Hallucinations
Skywork-R1V is an advanced multimodal AI model series
A new open-source framework to build and deploy intelligent agents
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
DepGraph: Towards Any Structural Pruning
A dataset consists of 15,140 ChatGPT prompts from Reddit
Code and models for ICML 2024 paper, NExT-GPT
High-performance Inference and Deployment Toolkit for LLMs and VLMs
Examples and tutorials to help developers build AI systems
CV, NLP, LLM project applications, and advanced engineering deployment
Instruction-tuning LLM with Chinese Medical Knowledge