Towards Real-World Vision-Language Understanding
Multimodal-Driven Architecture for Customized Video Generation
A framework for managing your zsh configuration
Framework for automatic construction of vulnerable infrastructures
Unified Multimodal Understanding and Generation Models
Volcano Engine Reinforcement Learning for LLMs
Dataset of GPT-2 outputs for research in detection, biases, and more
A dev-first open source autonomous AI agent framework
Code for running inference and finetuning with SAM 3 model
Models for object and human mesh reconstruction
Fire up your models with the flame
A library to handle Apple Property List format in binary or XML
A neural network that transforms a design mock-up into static websites
Clojure Desktop UI framework
SAPIEN Manipulation Skill Framework
A unified analytics engine for large-scale data processing
Removes backgrounds from pictures. Extension for webui
The best ChatGPT that $100 can buy
Anthropic's educational courses
Diffusion Transformer with Fine-Grained Chinese Understanding
A Customizable Image-to-Video Model based on HunyuanVideo
Efficient library for processing 3D data
SENAITE Meta Package
Central interface to connect your LLM's with external data
Learn AI and LLMs from scratch using free resources