A framework to enable multimodal models to operate a computer
Improve human sleep through scientifically
Official implementation of Watermark Anything with Localized Messages
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Harness LLMs with Multi-Agent Programming
Automate browser-based workflows with LLMs and Computer Vision
Low-code framework for building custom LLMs, neural networks
Open source platform for the machine learning lifecycle
An Open Source implementation of Notebook LM with more flexibility
AI assistant based on large models that can actively think and plan
Chat with it via text and voice
Automatically translates the text of a video based on a subtitle file
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
The Memory layer for AI Agents
AI-powered code generation tool for scratch development of web apps
Bash is all you need, write a claude code with only 16 line code
A nearly-live implementation of OpenAI's Whisper
Code for running inference with the SAM 3D Body Model 3DB
Contexts Optical Compression
Uncommon Objects in 3D dataset
Audiocraft is a library for audio processing and generation
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
A lightweight, powerful framework for multi-agent workflows
An AI personal assistant for your digital brain
A lightweight, powerful framework for multi-agent workflows