AWS Skills for Agents
Our first fully AI generated deep learning system
AI-driven public opinion trend monitor with multi-platform aggregation
Research code artifacts for Code World Model (CWM)
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
PersonaPlex code
An experimental version of DeepSeek model
Best Practices on Recommendation Systems
Jupyter notebook tutorials for OpenVINO
Generate Any 3D Scene in Seconds
Cloud-native open source data warehouse for analytics and AI queries
Open-source multi-speaker long-form text-to-speech model
The fast, Pythonic way to build Model Context Protocol servers
No-code in the front, Python in the back. An open-source framework
A Pragmatic VLA Foundation Model
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Reference implementations of MLPerf™ training benchmarks
Run Local LLMs on Any Device. Open-source
Secure local-first microVM sandbox for running untrusted code fast
An open-source, modern-design AI training tracking and visualization
Large Multimodal Models for Video Understanding and Editing
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Build and connect intelligent bots that interact naturally
Private chat with local GPT with document, images, video, etc.
Benchmarking Multimodal Agents for Open-Ended Tasks