High-quality implementations of standard and SOTA methods
Multimodal-Driven Architecture for Customized Video Generation
Changelog CI is a GitHub Action that enables a project
User toolkit for analyzing and interfacing with Large Language Models
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
A Universal Customization Method for Single and Multi Conditioning
Repo of Qwen2-Audio chat & pretrained large audio language model
Turn your website into a GIF
The Agent-User Interaction Protocol
Gives you one-liners that aids in penetration testing operations
A Personalized LLM-powered Agent Frameworks
Deep learning driven jazz generation using Keras & Theano
Designed for training LLM/VLM agents via RL
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
95% token savings. 155x faster queries. 16 languages
A best practices guide for day 2 operations
Flax is a neural network library for JAX
4M: Massively Multimodal Masked Modeling
Image-to-Image Translation in PyTorch
Segmentation models with pretrained backbones. PyTorch
PyTorch extensions for fast R&D prototyping and Kaggle farming
Multi-Agent daTa geneRation Infra and eXperimentation framework
Reading book source
Open source framework for deep learning satellite and aerial imagery
High-Fidelity and Controllable Generation of Textured 3D Assets