| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| 0.6.0 source code.tar.gz | 2026-04-10 | 664.5 kB | |
| 0.6.0 source code.zip | 2026-04-10 | 949.5 kB | |
| README.md | 2026-04-10 | 10.4 kB | |
| Totals: 3 Items | 1.6 MB | 2 | |
CUDA-Q QEC 0.6.0 and CUDA-Q Solvers 0.6.0
This is combined release of CUDA-Q QEC and CUDA-Q Solvers, both version 0.6.0.
This is the first CUDA-Q QEC release that builds example decoder applications on top of CUDA-Q Realtime [blog]. CUDA-Q QEC 0.6 ships with two new real-time-capable decoder pipelines: the RelayBP belief-propagation decoder for qLDPC codes and an NVIDIA Ising convolutional neural network (CNN) pre-decoder paired with a global decoder (PyMatching) for the surface code. These pipelines enable quantum vendors and QEC researchers to deploy real-time GPU decoding for two popular code families via NVQLink.
Additionally, this release of CUDA-Q QEC contains speed improvements for our GPU-accelerated RelayBP decoder (up to 19X!)
For CUDA-Q Solvers 0.6.0, support was added for a new UpCCGSD ansatz solver and a Coupled Exchange Operator (CEO) pool.
Please check out the docs and examples for how to get started using the CUDA-QX libraries!
Note: CUDA-Q QEC 0.6.0 and CUDA-Q Solvers 0.6.0 both depend on CUDA-Q 0.14. For CUDA-Q Realtime usage (experimental), you need to use CUDA-Q 0.14.1.
Features and Enhancements (QEC) ๐
- Sliding window optimize by @cketcham2333 in https://github.com/NVIDIA/cudaqx/pull/343
- feat(qec): add trt_decoder_config for real-time decoding by @wsttiger in https://github.com/NVIDIA/cudaqx/pull/384
- Add trt cudagraph by @wsttiger in https://github.com/NVIDIA/cudaqx/pull/369
- Add decode batch by @wsttiger in https://github.com/NVIDIA/cudaqx/pull/383
- Create PyMatching decoder plugin by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/396
- Add optional
Oparameter to PyMatching plugin by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/449 - Update
trt_decoderto supportuint8data types for I/O by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/455 - Add graph capture functions to common decoder interface by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/475
- Follow-up to [#475] - additional decoder interface updates by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/478
- Add Hololink QLDPC graph decode bridge and CI test by @cketcham2333 in https://github.com/NVIDIA/cudaqx/pull/481
- Add realtime AI decoder / predecoder infrastructure (GPU + Host) w/ host dispatcher by @wsttiger in https://github.com/NVIDIA/cudaqx/pull/457
- Add FPGA-based test application for realtime predecoder by @wsttiger in https://github.com/NVIDIA/cudaqx/pull/490
nv-qldpc-decoder Updates (Closed Source)
- Implemented new graph capture interface functions to run RelayBP with CUDA-Q Realtime
- Add new
repeatableconfiguration option to enable bit-for-bit repeatable results when running back-to-back on the same system - Significant RelayBP optimizations, for both
fp32andfp64. Timings below show speedups relative to 0.5 for some well-known Bicycle Bivariate Codes on B200. All of the reported speedups are for non-batched, serial execution mode.
| Case Name (n_k_d) | Variant | Total Speedup (Ratio) |
|---|---|---|
| 72_12_6 | fp32 | 3.44 |
| 72_12_6 | fp64 | 2.74 |
| 144_12_12 | fp32 | 5.50 |
| 144_12_12 | fp64 | 4.51 |
| 288_12_18 | fp32 | 19.06 |
| 288_12_18 | fp64 | 13.16 |
| Average | 8.07 |
Bug Fixes (QEC) ๐
- Coverity fixes by @cketcham2333 in https://github.com/NVIDIA/cudaqx/pull/395
- Fix OOB r/w issues by @kaiqiy-nv in https://github.com/NVIDIA/cudaqx/pull/464
- Add onnxscript to trt_decoder optional dependency by @wsttiger in https://github.com/NVIDIA/cudaqx/pull/407
- Bug fix and add test cases by @kaiqiy-nv in https://github.com/NVIDIA/cudaqx/pull/424
- Fix pytorch AcceleratorError root-caused by QEC by @kaiqiy-nv in https://github.com/NVIDIA/cudaqx/pull/472
Features and Enhancements (Solvers) ๐
- UpCCGSD ansatz solver by @rr637 in https://github.com/NVIDIA/cudaqx/pull/372
- Add Coupled Exchange Operator (CEO) pool by @jgonthier in https://github.com/NVIDIA/cudaqx/pull/387
Bug Fixes (Solvers) ๐
- Fix OOB r/w issues by @kaiqiy-nv in https://github.com/NVIDIA/cudaqx/pull/464
- Fix the gradient evaluation bugs and add test cases by @kaiqiy-nv in https://github.com/NVIDIA/cudaqx/pull/434
- Fix optimiser forwarding bug and other minor bugs and add test cases by @kaiqiy-nv in https://github.com/NVIDIA/cudaqx/pull/435
- Fix BK transformation | Fix JW parity Z-chain | Add test cases by @kaiqiy-nv in https://github.com/NVIDIA/cudaqx/pull/460
- Fix GQE invalid CUDA handle / AcceleratorError when moving model to GPU by @vedika-saravanan in https://github.com/NVIDIA/cudaqx/pull/473
- Fix mixer forwarding | Fix MPI implementation | Add test cases by @kaiqiy-nv in https://github.com/NVIDIA/cudaqx/pull/452
- Add error message for missing system dependencies by @vedika-saravanan in https://github.com/NVIDIA/cudaqx/pull/467
- Fix qubit indices bug and add test case by @kaiqiy-nv in https://github.com/NVIDIA/cudaqx/pull/469
- GQE: PyTorch GPU compatibility check, exit/skip test on mismatch by @vedika-saravanan in https://github.com/NVIDIA/cudaqx/pull/494
Documentation โ๏ธ
- Update documentation for relay-bp by @melody-ren in https://github.com/NVIDIA/cudaqx/pull/346
- [Docs] Add uccgsd to doc by @marwafar in https://github.com/NVIDIA/cudaqx/pull/340
- [Docs] update gen_ham with UHF by @marwafar in https://github.com/NVIDIA/cudaqx/pull/339
- [docs] Update nv-qldpc-decoder docs to describe the new proc_float option by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/288
- Incorporate Sliding Window Decoder docs by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/359
- Add docs for realtime decoding by @kvmto in https://github.com/NVIDIA/cudaqx/pull/345
- Add docs for AI decoder training with PyTorch by @wsttiger in https://github.com/NVIDIA/cudaqx/pull/344
- fix typo in calling operator pool with uccsd in doc by @marwafar in https://github.com/NVIDIA/cudaqx/pull/366
- Add requirement for memory BP methods in docs by @melody-ren in https://github.com/NVIDIA/cudaqx/pull/376
- Added trt_decoder docs for Python and C++ by @wsttiger in https://github.com/NVIDIA/cudaqx/pull/381
Common / Misc
- License updates by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/360
- Add license agreement notification to Docker image by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/362
- [core] Fix pre-existing extension point issue by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/374
- Bump CUDA-Q commit (with support for breaking changes) by @github-actions[bot] in https://github.com/NVIDIA/cudaqx/pull/416
- Bump CUDA-Q commit (with non-trivial updates) by @github-actions[bot] in https://github.com/NVIDIA/cudaqx/pull/437
- Bump CUDA-Q commit and re-enable some tests by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/450
- Bump CUDA-Q dependencies from 0.13 to 0.14 by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/468
- Align CUDA-Q and CUDA-Q Realtime commits for 0.14.1 by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/489
- Fix minor issues reported by Coverity by @kaiqiy-nv in https://github.com/NVIDIA/cudaqx/pull/399
- Redundantly including Logger.h and FmtCore.h includes ahead of runtime refactor. by @Renaud-K in https://github.com/NVIDIA/cudaqx/pull/409
- Update cuda-quantum-devdeps:ext-... to cuda-quantum-devcontainer-... by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/420
- Fix build if CUDAQ_REALTIME_ROOT is not set by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/432
- Follow-up to [#396] and [#416] - fix wheel builds by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/439
- Update heterogeneous_map to recognize ints as bools by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/441
- Update CMake for TensorRT decoder unit test by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/448
Testing
- Update test scripts to allow easy data generation by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/354
- Follow-up to [#344] - add onnxscript to wheels test env by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/368
- [uccsd/uccgsd] Update test tolerances by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/375
- Add playback/record to the surface code 1 test by @cketcham2333 in https://github.com/NVIDIA/cudaqx/pull/406
- Add nv-qldpc-test option to surface_code-1.cpp by @cketcham2333 in https://github.com/NVIDIA/cudaqx/pull/415
- Mock decoder by @cketcham2333 in https://github.com/NVIDIA/cudaqx/pull/423
- Add cuda graph launch to mock decoder and introduce autonomous decoder CRTP by @cketcham2333 in https://github.com/NVIDIA/cudaqx/pull/429
- [ci] Update how to get cudaq::realtime by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/443
- [realtime] Mock decoder updates for latest cudaq::realtime updates by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/444
- [realtime] Move mock decoders to test directories by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/445
- [realtime] Advance cudaq::realtime commit and update names accordingly by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/446
- Skip GQE GPU tests on unsupported GPU architectures by @vedika-saravanan in https://github.com/NVIDIA/cudaqx/pull/477
- Update container validation script by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/487
- Remove cudaq-realtime from All libs CI and enable QLDPC graph test in Release CI by @cketcham2333 in https://github.com/NVIDIA/cudaqx/pull/482
New Contributors
- @kaiqiy-nv made their first contribution in https://github.com/NVIDIA/cudaqx/pull/399
- @rr637 made their first contribution in https://github.com/NVIDIA/cudaqx/pull/372
- @jgonthier made their first contribution in https://github.com/NVIDIA/cudaqx/pull/387
- @vedika-saravanan made their first contribution in https://github.com/NVIDIA/cudaqx/pull/467
Full Changelog: https://github.com/NVIDIA/cudaqx/compare/0.5.0...0.6.0