Commit | Date | |
---|---|---|
[r117]
by
guidrymw
Modified explicitgpuDouble.cu to convert cuPrintf statements to printf statements in the kernel. Requires compute capability 2.0, so compile with nvcc -arch sm_20 explicitgpuDouble.cu. Without the -arch sm_20 flag the printf statements in the kernel will give compile errors. |
2013-02-20 13:58:23 | Tree |
[r116]
by
guidrymw
Changed the three places where populations are updated to do serial. Now, at least for the first 100 timesteps, the CUDA and Java codes give almost exactly the same results. Thus, there seems to be an error in the parallel implementation of the population update algorithm. |
2013-02-15 00:39:27 | Tree |
2013-02-13 20:21:33 | Tree | |
2013-02-13 19:52:14 | Tree | |
2013-02-11 20:07:24 | Tree | |
[r112]
by
guidrymw
Added print statements for each timestep in parallel with those in the java version for simpler debugging. |
2013-02-11 18:12:57 | Tree |
2013-02-08 20:20:35 | Tree | |
[r110]
by
guidrymw
Corrected initial value of sumXLast to sum over actual mass fractions rather than 1. This seems to have been distorting the initial timestep correction versus the Java code. Now the first two timesteps give the same dt for CUDA and Java codes. |
2013-02-06 17:44:51 | Tree |
[r109]
by
guidrymw
Print buffer size increased by factor of 10 over default in line 1064 to get all the diagnostic printf stuff from the gpu output. Only the last several lines are output if the default buffer size is used. |
2013-02-06 15:59:41 | Tree |
2013-02-04 19:09:51 | Tree |