CUDA-QX - Browse /0.6.0 at SourceForge.net

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
0.6.0 source code.tar.gz	2026-04-10	664.5 kB	0
0.6.0 source code.zip	2026-04-10	949.5 kB	2
README.md	2026-04-10	10.4 kB	0
Totals: 3 Items		1.6 MB	2

CUDA-Q QEC 0.6.0 and CUDA-Q Solvers 0.6.0

This is combined release of CUDA-Q QEC and CUDA-Q Solvers, both version 0.6.0.

This is the first CUDA-Q QEC release that builds example decoder applications on top of CUDA-Q Realtime [blog]. CUDA-Q QEC 0.6 ships with two new real-time-capable decoder pipelines: the RelayBP belief-propagation decoder for qLDPC codes and an NVIDIA Ising convolutional neural network (CNN) pre-decoder paired with a global decoder (PyMatching) for the surface code. These pipelines enable quantum vendors and QEC researchers to deploy real-time GPU decoding for two popular code families via NVQLink.

Additionally, this release of CUDA-Q QEC contains speed improvements for our GPU-accelerated RelayBP decoder (up to 19X!)

For CUDA-Q Solvers 0.6.0, support was added for a new UpCCGSD ansatz solver and a Coupled Exchange Operator (CEO) pool.

Please check out the docs and examples for how to get started using the CUDA-QX libraries!

Note: CUDA-Q QEC 0.6.0 and CUDA-Q Solvers 0.6.0 both depend on CUDA-Q 0.14. For CUDA-Q Realtime usage (experimental), you need to use CUDA-Q 0.14.1.

Features and Enhancements (QEC) 🎉

Sliding window optimize by @cketcham2333 in https://github.com/NVIDIA/cudaqx/pull/343
feat(qec): add trt_decoder_config for real-time decoding by @wsttiger in https://github.com/NVIDIA/cudaqx/pull/384
Add trt cudagraph by @wsttiger in https://github.com/NVIDIA/cudaqx/pull/369
Add decode batch by @wsttiger in https://github.com/NVIDIA/cudaqx/pull/383
Create PyMatching decoder plugin by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/396
Add optional O parameter to PyMatching plugin by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/449
Update trt_decoder to support uint8 data types for I/O by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/455
Add graph capture functions to common decoder interface by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/475
Follow-up to [#475] - additional decoder interface updates by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/478
Add Hololink QLDPC graph decode bridge and CI test by @cketcham2333 in https://github.com/NVIDIA/cudaqx/pull/481
Add realtime AI decoder / predecoder infrastructure (GPU + Host) w/ host dispatcher by @wsttiger in https://github.com/NVIDIA/cudaqx/pull/457
Add FPGA-based test application for realtime predecoder by @wsttiger in https://github.com/NVIDIA/cudaqx/pull/490

`nv-qldpc-decoder` Updates (Closed Source)

Implemented new graph capture interface functions to run RelayBP with CUDA-Q Realtime
Add new repeatable configuration option to enable bit-for-bit repeatable results when running back-to-back on the same system
Significant RelayBP optimizations, for both fp32 and fp64. Timings below show speedups relative to 0.5 for some well-known Bicycle Bivariate Codes on B200. All of the reported speedups are for non-batched, serial execution mode.

Case Name (n_k_d)	Variant	Total Speedup (Ratio)
72_12_6	fp32	3.44
72_12_6	fp64	2.74
144_12_12	fp32	5.50
144_12_12	fp64	4.51
288_12_18	fp32	19.06
288_12_18	fp64	13.16
Average		8.07

Bug Fixes (QEC) 🐛

Coverity fixes by @cketcham2333 in https://github.com/NVIDIA/cudaqx/pull/395
Fix OOB r/w issues by @kaiqiy-nv in https://github.com/NVIDIA/cudaqx/pull/464
Add onnxscript to trt_decoder optional dependency by @wsttiger in https://github.com/NVIDIA/cudaqx/pull/407
Bug fix and add test cases by @kaiqiy-nv in https://github.com/NVIDIA/cudaqx/pull/424
Fix pytorch AcceleratorError root-caused by QEC by @kaiqiy-nv in https://github.com/NVIDIA/cudaqx/pull/472

Features and Enhancements (Solvers) 🎉

UpCCGSD ansatz solver by @rr637 in https://github.com/NVIDIA/cudaqx/pull/372
Add Coupled Exchange Operator (CEO) pool by @jgonthier in https://github.com/NVIDIA/cudaqx/pull/387

Bug Fixes (Solvers) 🐛

Fix OOB r/w issues by @kaiqiy-nv in https://github.com/NVIDIA/cudaqx/pull/464
Fix the gradient evaluation bugs and add test cases by @kaiqiy-nv in https://github.com/NVIDIA/cudaqx/pull/434
Fix optimiser forwarding bug and other minor bugs and add test cases by @kaiqiy-nv in https://github.com/NVIDIA/cudaqx/pull/435
Fix BK transformation | Fix JW parity Z-chain | Add test cases by @kaiqiy-nv in https://github.com/NVIDIA/cudaqx/pull/460
Fix GQE invalid CUDA handle / AcceleratorError when moving model to GPU by @vedika-saravanan in https://github.com/NVIDIA/cudaqx/pull/473
Fix mixer forwarding | Fix MPI implementation | Add test cases by @kaiqiy-nv in https://github.com/NVIDIA/cudaqx/pull/452
Add error message for missing system dependencies by @vedika-saravanan in https://github.com/NVIDIA/cudaqx/pull/467
Fix qubit indices bug and add test case by @kaiqiy-nv in https://github.com/NVIDIA/cudaqx/pull/469
GQE: PyTorch GPU compatibility check, exit/skip test on mismatch by @vedika-saravanan in https://github.com/NVIDIA/cudaqx/pull/494

Documentation ✏️

Update documentation for relay-bp by @melody-ren in https://github.com/NVIDIA/cudaqx/pull/346
[Docs] Add uccgsd to doc by @marwafar in https://github.com/NVIDIA/cudaqx/pull/340
[Docs] update gen_ham with UHF by @marwafar in https://github.com/NVIDIA/cudaqx/pull/339
[docs] Update nv-qldpc-decoder docs to describe the new proc_float option by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/288
Incorporate Sliding Window Decoder docs by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/359
Add docs for realtime decoding by @kvmto in https://github.com/NVIDIA/cudaqx/pull/345
Add docs for AI decoder training with PyTorch by @wsttiger in https://github.com/NVIDIA/cudaqx/pull/344
fix typo in calling operator pool with uccsd in doc by @marwafar in https://github.com/NVIDIA/cudaqx/pull/366
Add requirement for memory BP methods in docs by @melody-ren in https://github.com/NVIDIA/cudaqx/pull/376
Added trt_decoder docs for Python and C++ by @wsttiger in https://github.com/NVIDIA/cudaqx/pull/381

Common / Misc

License updates by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/360
Add license agreement notification to Docker image by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/362
[core] Fix pre-existing extension point issue by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/374
Bump CUDA-Q commit (with support for breaking changes) by @github-actions[bot] in https://github.com/NVIDIA/cudaqx/pull/416
Bump CUDA-Q commit (with non-trivial updates) by @github-actions[bot] in https://github.com/NVIDIA/cudaqx/pull/437
Bump CUDA-Q commit and re-enable some tests by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/450
Bump CUDA-Q dependencies from 0.13 to 0.14 by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/468
Align CUDA-Q and CUDA-Q Realtime commits for 0.14.1 by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/489
Fix minor issues reported by Coverity by @kaiqiy-nv in https://github.com/NVIDIA/cudaqx/pull/399
Redundantly including Logger.h and FmtCore.h includes ahead of runtime refactor. by @Renaud-K in https://github.com/NVIDIA/cudaqx/pull/409
Update cuda-quantum-devdeps:ext-... to cuda-quantum-devcontainer-... by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/420
Fix build if CUDAQ_REALTIME_ROOT is not set by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/432
Follow-up to [#396] and [#416] - fix wheel builds by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/439
Update heterogeneous_map to recognize ints as bools by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/441
Update CMake for TensorRT decoder unit test by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/448

Testing

Update test scripts to allow easy data generation by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/354
Follow-up to [#344] - add onnxscript to wheels test env by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/368
[uccsd/uccgsd] Update test tolerances by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/375
Add playback/record to the surface code 1 test by @cketcham2333 in https://github.com/NVIDIA/cudaqx/pull/406
Add nv-qldpc-test option to surface_code-1.cpp by @cketcham2333 in https://github.com/NVIDIA/cudaqx/pull/415
Mock decoder by @cketcham2333 in https://github.com/NVIDIA/cudaqx/pull/423
Add cuda graph launch to mock decoder and introduce autonomous decoder CRTP by @cketcham2333 in https://github.com/NVIDIA/cudaqx/pull/429
[ci] Update how to get cudaq::realtime by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/443
[realtime] Mock decoder updates for latest cudaq::realtime updates by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/444
[realtime] Move mock decoders to test directories by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/445
[realtime] Advance cudaq::realtime commit and update names accordingly by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/446
Skip GQE GPU tests on unsupported GPU architectures by @vedika-saravanan in https://github.com/NVIDIA/cudaqx/pull/477
Update container validation script by @bmhowe23 in https://github.com/NVIDIA/cudaqx/pull/487
Remove cudaq-realtime from All libs CI and enable QLDPC graph test in Release CI by @cketcham2333 in https://github.com/NVIDIA/cudaqx/pull/482