Autonomous Agents (LLMs) research papers. Updated Daily
Refer and Ground Anything Anywhere at Any Granularity
Project Lyra: Open Generative 3D World Models
Unsupervised Learning for Image Registration
Models for object and human mesh reconstruction
Visual Causal Flow
A Systematic Framework for Interactive World Modeling
HeavyDB (formerly MapD/OmniSciDB)
Video understanding codebase from FAIR for reproducing video models
Open-source 2D IDE for managing AI agents in native CLIs
Build your own AI application system for free
Foundational Models for State-of-the-Art Speech and Text Translation
Unifying 3D Mesh Generation with Language Models
Gracefully face hCaptcha challenge with multimodal llms
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
The world's only naturally intelligent knowledge technology
Learning multi-scale deep model correcting over- and under- exposed
Let us control diffusion models
Navigation mesh generation and pathfinding toolkit for game AI systems
Code release for "Masked-attention Mask Transformer
The official pytorch implementation of our paper
A PyTorch implementation of the NIPS 2017 paper
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201
Style transfer, deep learning, feature transform