Self-evolving AI agent framework for automated workflows
Multimodal embedding and reranking models built on Qwen3-VL
"Big Model" trains a visual multimodal VLM with 26M parameters
End-to-end speech processing toolkit
Bringing BERT into modernity via both architecture changes and scaling
Make your agents learn from experience
Neural Network architecture based on ideas of the original LSTM
Overcoming Group Chat Scenarios with LLM-based Technical Assistance
Implementation for MatMul-free LM
High-performance Inference and Deployment Toolkit for LLMs and VLMs
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
Chinese XLNet pre-trained model
Implementation of Vision Transformer, a simple way to achieve SOTA
The best ChatGPT that $100 can buy
A personal context-agent that learns how you work
Gracefully face hCaptcha challenge with multimodal llms
Learn to build your Second Brain AI assistant with LLMs
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Language modeling in a sentence representation space
Implementation of Make-A-Video, new SOTA text to video generator
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm
Python example app from the OpenAI API quickstart tutorial
Deep and online learning with spiking neural networks in Python
fast C++ library for linear algebra & scientific computing
Open Multilingual Multimodal Chat LMs