General technology for enabling AI capabilities w/ LLMs and MLLMs
Analyzing Hacker News discussions from a decade ago in hindsight
A New Axis of Sparsity for Large Language Models
A large-scale model of medical consultation in Chinese
A.S.E (AICGSecEval) is a repository-level AI-generated code security
MemoryOS is designed to provide a memory operating system
Driving with Graph Visual Question Answering
Learning to Reason with Search for LLMs via Reinforcement Learning
Traditional Mandarin LLMs for Taiwan
Recipes to train reward model for RLHF
A tension reasoning engine over 131 S-class problems
An agentless approach to automatically solve software development
A simple, performant and scalable Jax LLM
LISA: Reasoning Segmentation via Large Language Model
Implementation for MatMul-free LM
Skywork-R1V is an advanced multimodal AI model series
Code and models for ICML 2024 paper, NExT-GPT
Examples and tutorials to help developers build AI systems
Robust recipes to align language models with human and AI preferences
An Open-source Framework for Data-centric Language Agents
Open Source Deep Research Alternative to Reason and Search
Set of tools to assess and improve LLM security
Automatic question answering for local knowledge bases based on LLM
A security scanner for custom LLM applications
Unifying 3D Mesh Generation with Language Models