begin transforming GPU stage 1 to use structures of arrays and decouple p/q values from their roots (WIP)
minor cleanup
switch size optimization to arbitrary precision throughout; separate ODE solver state; accurately track sign change of Jacobian determinant during homotopy path tracking
add 1 to degree of rotations; generalize objective function to optimize subsets of parameters; add support for degree 4 and 5
remove rectangular sizeopt norm; it was only to compare with poly51opt
adjust parameterization; add Gerbicz collision search; modify final stages of collision search to add radix sorting
add test program for fast collision search
streamline arithmetic
do hashtable-based collision search instead of the last few radix sort passes (25% speedup)
allow CPU stage 1 to batch multiple special-q together; moderately optimize radix sort; refactor control flow
merge control flow between CPU and GPU polyselect stage1, initialize both codebases identically, and make the CPU code use radix sorting instead of a hashtable
fold in changes from specialq64 branch; overhaul CPU poly select stage 1 to use many of the same tools that the GPU code has developed
Add ODE with adaptive stepsize for homotopy predictor; experiment with automatic problem scaling; adjust curve tracking to account for determinant of Jacobian; put initial values in homotopy callback
update include
generalize solver to arbitrary homotopies; add better Newton sequencing, backtracking line search and 2nd order predictor; remove dead code
overhaul degree 6 size optimization (in progress)
refine least squares
size optimization research
Update NVIDIA architecture capability code
Remove unnecessary CUDA library references
NVIDIA CUB is now a part of the 12.5 CUDA toolkit and no longer needs to be added separately
Update NVIDIA CUB to the current version 9which is NOW a part of NVIDIA CCCL)
update to CUDA 12.5
Msieve help
Add VBITS info to output.
update to Nvidia Cuda 12.3
fix bug in singleton removal
Update to current version of Nvidia CUB
update to Nvidia Cuda 12.0
update to CUDA 11.8
Update Visual Studio build to Visual Studio 2022 and CUDA 11.6
Update GPU kernel loading code to find compute capabilities 5.0 .. 8.6
update to CUDA 11.4
update to CUDA 11.3
correct error in the non-gpu Visual Studio build
update to CUDA 11.2
fix buffer overflows caused by array offsets exceeding 2^32
update to CUDA 11.2 and Visual Studio 2019
fix MPI block Lanczos
Update Visual Studio 2019 build to use Nvidia CUDA 11.1
update changelog
avoid crash with GMP-6.2.0
remove Visual Studio *.user file committed in error
update to CUDA v11.0
add Visual Studio 2019 build files
add Visual Studio 2019 build files
update for CUDA 10
Update the Visual Studio 2017 build to CUDA 10
Update Visual Studio 2017 build files to accommodate latest source changes
merge CPU code of msieve-lacuda branch
merge trunk changes
add several polynomial selection tweaks
update to CUDA 9.2
make initial maximum merge size configurable
remove exit() call from NFS filtering error path
combine positive and negative intervals; make sieve size variable; add profiling hooks
add fixes for MPI builds
add Visual Studio 2017 build (with CUDA 9.0)
update to CUDA 9.0
add first wave of MPI changes for large vectors; specify vector size from makefile
make the choice of superblock size account for wider vectors
fix dump interval computations to account for large vectors
add support for wider (128-bit or 256-bit) vectors in the linear algebra
upate CUB library
merge trunk
update makefile to build GPU code for more rece...
start next version
Msieve v1.53 released
release 1.53
finish v1.53
correct previous commit
fix preprocessor use to avoid gcc complaints
improve windows diagnostic messages when GPU so...
correct Visual STudio 2015 build (build sort en...
test access
test access
test access
further correction to the Visual Studio 2015 build
correct error in the output of the subversion n...
Remove string subroutines available in Visual S...
add Visual Studio 2015 build files
update Visual Studio build for sort engine changes
update Visual Studio build for sort engine changes
cap the number of poly select GPU stage 1 threa...
remove debug print
replace b40c GPU sort cde with CUB library
extend the fix in r981
fix buffer overflow in NFS poly scoring for deg...
fix r984
allow -march=native
add an option to go straight to NFS with no oth...
Further update Visual Studio 2013 build to CUDA...