Open Source Speech Language Model
Cross-platform GUI for image upscaler Real-ESRGAN
State-of-the-art (SoTA) text-to-video pre-trained model
Qwen3 is the large language model series developed by Qwen team
A security scanner for custom LLM applications
An advanced paper search agent powered by large language models
LLM-based Reinforcement Learning audio edit model
text and image to video generation: CogVideoX (2024) and CogVideo
multiaddr implementation in Python
Protect your eyes from eye strain using this simple break reminder
Unified framework for robot learning built on NVIDIA Isaac Sim
borb is a library for reading, creating and manipulating PDF files
Modular AI image and video generation web UI with extensible tools
Convert Python notebook to web app and share with non-technical users
Weaving the Digital Agent Galaxy
Long-form streaming TTS system for multi-speaker dialogue generation
Open-Source Dual-Arm Mobile Robot with Motorized Lift
Video understanding codebase from FAIR for reproducing video models
Management of Yandex Station and other smart home devices
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Pure-Python Git implementation
Repo of Qwen2-Audio chat & pretrained large audio language model
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Diversity-driven optimization and large-model reasoning ability
Dual-channel spectroscopic receiver using LimeSDR-USB