Text and image to video generation: CogVideoX and CogVideo
Global weather forecasting model using graph neural networks and JAX
Stable Diffusion WebUI optimized for AMD GPUs with editing tools
Convert AI papers to GUI
Open source libraries and APIs to build custom preprocessing pipelines
StarVector is a foundation model for SVG generation
Integrating LLMs into structured NLP pipelines
Implementation of the Surya Foundation Model for Heliophysics
Programmatic access to the AlphaGenome model
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
The 100 line AI agent that solves GitHub issues
High-resolution models for human tasks
A Unified Framework for Text-to-3D and Image-to-3D Generation
Stanford NLP Python library for many human languages
Genome modeling and design across all domains of life
An agentless approach to automatically solve software development
A system for agentic LLM-powered data processing and ETL
Generate Any 3D Scene in Seconds
The Memory layer for AI Agents
Sharp Monocular Metric Depth in Less Than a Second
OCR expert VLM powered by Hunyuan's native multimodal architecture
Implementation of 'lightweight' GAN, proposed in ICLR 2021
Large-language-model & vision-language-model based on Linear Attention
High-Resolution Image Synthesis with Latent Diffusion Models
AI Suite for upscaling, interpolating & restoring images/videos