Official code for StoryMem: Multi-shot Long Video Storytelling
Contains the code for CM-SS13
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Gorilla: An API store for LLMs
Training course for Ansible automation platform
A minimal yet professional single agent demo project
An open-source framework for building serverless applications
Learn to build your Second Brain AI assistant with LLMs
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Manage the release notes for your project
A minimalist environment for decision-making in autonomous driving
BioNeMo Framework: For building and adapting AI models
AI video agents framework for next-gen video interactions
Install K8S cluster anintroduce the principle of component interaction
Why use many token when few token do trick
Audio foundation model excelling in audio understanding
Ministack: Free, open-source local AWS emulator
Open multimodal web agent built by Ai2
Vertical novel search engine with unified reading and tracking tools
Hypernetworks that adapt LLMs for specific benchmark tasks
Specification and documentation for Agent Skills
A lightweight text-to-speech model with zero-shot voice cloning
Run all your local AI together in one package
Omnilingual ASR Open-Source Multilingual SpeechRecognition
A Model Context Protocol server for searching and analyzing arXiv