| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| README.md | 2026-01-26 | 2.6 kB | |
| v1.16.0-ce source code.tar.gz | 2026-01-26 | 9.0 MB | |
| v1.16.0-ce source code.zip | 2026-01-26 | 9.6 MB | |
| Totals: 3 Items | 18.6 MB | 0 | |
v1.16.0-ce
✨ New Features
- Advanced Agent Services
- Introduced Agent Memory Service to provide persistent context for agent interactions.
- Hardware & Compute Enhancements
- vGPU Support: Initial support for vGPU and physical GPU resource reporting and usage calculation.
- AMD GPU Support: Full compatibility for both Inference and Finetuning tasks on AMD hardware.
- CUDA Version Management: Added a new CUDA-version API, allowing resource switching based on specific CUDA version requirements.
- Enabled multi-host inference support for NVIDIA vLLM and SGLang.
- Storage & Infrastructure
- Added PVC support for Spaces, allowing persistent storage for application environments.
- Support for network interface configuration in "in-cluster" mode.
- Multimodal
- Support for Text-or-Image-to-Video (TI2V), allowing text-to-video and image-to-video generation.
- Developer Experience
- Upgraded Gradio SDK to 6.2.0 (maintaining backward compatibility with 5.1.0).
- Space optimization: Added ability to skip build steps for Gradio, Streamlit, Nginx, and MCP Server environments.
- Added GET/HEAD routes for Code and MCP resources within the CSGHub SDK.
🚀 Enhancements & Bug Fixes
- Space & Deployment Improvements
- Added status filters and availability filters for the Space Index API.
- Support for public Docker registry configuration and updated runtime images.
- Deployment responses now include the service name for easier identification.
- Added the ability to force-delete deployments.
- Repository & Mirroring
- Enhanced repo syncer with retry mechanisms for mirror tasks.
- Fixed issues where repositories total count mismatched.
- Reset running mirror tasks automatically after a mirror LFS service restart.
- Mirror tokens are now strictly limited to read-only actions for improved security.
- Safety & Content
- Refactored sensitive scenario handling and updated vocabulary for better moderation accuracy.
- Updated moderation service risk levels.
- General Fixes
- Fixed token usage recording for streaming chat completion requests.
- Resolved GGUF listing issues and collection message bugs.
- Fixed specific bugs in Notebook environments and resource checking logic.
- Improved repository sync status consistency between CE and EE versions.
- Refactoring
- Significant architectural cleanup including splitting repository implementations from interfaces and using cluster pool interfaces.
New Contributors
- @denny-zhao