A Customizable Image-to-Video Model based on HunyuanVideo
Lets make video diffusion practical
Official code for StoryMem: Multi-shot Long Video Storytelling
Let agents classify your bank transactions
Release for Improved Denoising Diffusion Probabilistic Models
No-code AI workflow
Diversity-driven optimization and large-model reasoning ability
Build high-quality LLM apps
No-code LLM Platform to launch APIs and ETL Pipelines
Visual Causal Flow
CLIP, Predict the most relevant text snippet given an image
Code for running inference with the SAM 3D Body Model 3DB
Large Multimodal Models for Video Understanding and Editing
A command-line productivity tool powered by AI large language models
Codebase to Tutorial
Open source no-code system for text annotation and building of text
Dealing with all unstructured data, such as reverse image search
ChatGPT interface with better UI
Open-source, code-first Python toolkit for building, evaluating, etc.
Z80-μLM is a 2-bit quantized language model
Ultimate meta-skill for generating best-in-class Claude Code skills
Multilingual sentence & image embeddings with BERT
Your open-source LLM evaluation toolkit
DeepMind model for tracking arbitrary points across videos & robotics
gpt-oss-120b and gpt-oss-20b are two open-weight language models