Inference script for Oasis 500M
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
An advanced paper search agent powered by large language models
Open-weight, large-scale hybrid-attention reasoning model
Building a Secure and Interoperable Future for AI-Driven Payments
AI-powered tool to quickly remove watermarks from videos flawlessly
Framework that is dedicated to making neural data processing
TensorFlow-based neural network library
Open Source Computer Vision Library
Multi-Voice and Prompt-Controlled TTS Engine
Renren Film and Television bot, fully connected to Renren resources
Overcoming Data Limitations for High-Quality Video Diffusion Models
Chatbot daemon that connects to your favorite chat services
A Customizable Image-to-Video Model based on HunyuanVideo
Suite with Real-ESRGAN, BSRGAN , RealESRNet, IRCNN, GFPGAN & RIFE.
MMEditing is a low-level vision toolbox based on PyTorch
An Autonomous LLM Agent for Complex Task Solving
High-Resolution Image Synthesis with Latent Diffusion Models
OpenFieldAI is an AI based Open Field Test Rodent Tracker
SoundTranscriber can be used to generate automatic transcription / aut
It's possible for machines to become self-aware.
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Translate English to Bangla using CSV file format and range wise.
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation