VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Qlib is an AI-oriented quantitative investment platform
WikiChat is an improved RAG
An open-source RAG-based tool for chatting with your documents
Implementation of Vision Transformer, a simple way to achieve SOTA
Implementation of DeepLabCut
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster
Harness LLMs with Multi-Agent Programming
PaddlePaddle End-to-End Development Toolkit
Code for Cicero, an AI agent that plays the game of Diplomacy
A Universal Customization Method for Single and Multi Conditioning
A text-to-speech, speech-to-text and speech-to-speech library
ChatGPT interface with better UI
Enable AI to control your desktop, mobile and HMI devices
Build effective agents using Model Context Protocol
Semantic search and workflows for medical/scientific papers
A Heterogeneous Benchmark for Information Retrieval
Tools like web browser, computer access and code runner for LLMs
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Agent S: an open agentic framework that uses computers like a human
Build portable, production-ready MLOps pipelines
SAPIEN Manipulation Skill Framework
AI Agent Networks for Open Collaboration
Open-source framework for intelligent speech interaction
Training Large Language Model to Reason in a Continuous Latent Space