Thread: [Vxl-maintainers] VNL SSE alignment bug

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

Hi All,

  I'm trying to enable SSE support for VNL on my machine, and I've
encountered a bug.  The vnl test named "test_alignment" generates
several failures, followed by a segmentation fault.  The failures seem
sporadic and seem to be related to a bad choice of epsilon when
comparing results.  These failure appear in several dashboard builds.
The more serious problem is the segfault, which I don't see the on the
dashboard.  I've enabled SSE on my dashboard build, so it should show
up tomorrow under build name Linux-2.6_gcc-4.1.3_-Wall at
lems.brown.edu.

Here is an explanation of what causes the crash based on my limited
understanding of SSE.  The crash occurs in
test_alignment.cxx: line 24
called from within the nested loops when vector size = 4, matrix
offset = 0, vector offset = 0, and result offset = 1.  I've traced the
problem to
vnl_sse.h: line 555
which calls _mm_load_ps on the result vector which is not 16-byte
aligned because of the offset in the test.

I'm not sure why this isn't failing on the other dashboard builds.
The SSE code looks correct as long as the vectors have a 16-byte
aligned address.  It handles the case where the data block does not
evenly divide into 16-byte blocks, but does not handle the case where
the starting address is not 16-byte aligned.  Usually, you don't have
to worry about this case because the sse allocator function allocates
arrays of aligned memory.  However, as the test case indicates, it is
possible to create a vnl_vector_ref that uses an arbitrary block of
data.

Does this make sense or is something else wrong?  I'm I the only one
having trouble with SSE support or is this problem more widespread?  I
don't have a sense of whether people are using it with no trouble or
simple disabling it.

Thanks,
Matt

Thread: [Vxl-maintainers] VNL SSE alignment bug

vxl-maintainers