Pure Data Computer Music System / Patches / #432 Invalid memory access in s

Marvin Humphrey - 2011-10-09

In response to speed concerns raised in an IRC conversation ("well, its a
realtime engine that communicates thru the utf8 stuff, so its good to have it
efficient"), I have prepared a benchmark program to test the performance of
u8_offset(). It reads a file into RAM, then traverses it with u8_offset() a
user-specified number of iterations.

Here are some results using the French wikipedia page on Pure Data as source
material.

Without patch:

marvin@smokey:~/projects/pure-data $ cc -Wall -Wextra -std=gnu99 -O3 bench.c src/s_utf8.c -o bench
marvin@smokey:~/projects/pure-data $ time ./bench pd.html 10000
pd.html (44683 bytes) 10000 iterations

real 0m0.732s
user 0m0.716s
sys 0m0.003s

Running the test 10 times produced a range of 0.724s to 0.733s, so the
benchmark was pretty stable even on a Mac. :)

With patch:

marvin@smokey:~/projects/pure-data $ cc -Wall -Wextra -std=gnu99 -O3 bench.c src/s_utf8.c -o bench
marvin@smokey:~/projects/pure-data $ time ./bench pd.html 10000
pd.html (44683 bytes) 10000 iterations

real 0m0.618s
user 0m0.612s
sys 0m0.004s

So, in addition to eliminating the Valgrind warnings, the patched version is
actually faster. I speculate that this is because the current version of
u8_offset() needlessly traverses at least one extra byte when the header byte of
the sequence is plain ASCII.

For what it's worth, if we remove the NUL-termination check from the loop
condition...

- while (charnum > 0 && str[offs]) {
+ while (charnum > 0) {

We get even better:

marvin@smokey:~/projects/pure-data $ time ./bench pd.html 10000
pd.html (44683 bytes) 10000 iterations

real 0m0.514s
user 0m0.507s
sys 0m0.004s

However, that could yield bogus results in the event that strlen(string)
differs from the buffer length held in a separate integer. I would go that
route in code which had been engineered to not need NUL-termination from the
start, but wouldn't advocate for it here.

As a further test, I tried an implementation based on u8_seqlen() which has
the advantage of being easier to understand, and is the algorithm currently
used by the Perl core among others:

while (charnum-- > 0 && str[offs]) {
offs += u8_seqlen(str + offs);
}

It performed substantially worse:

marvin@smokey:~/projects/pure-data $ time ./bench pd.html 10000
pd.html (44683 bytes) 10000 iterations

real 0m1.906s
user 0m1.891s
sys 0m0.003s

In this case, I speculate that the lookup into the trailingBytesForUTF8 array
is costly, even though it doubtless gets into the hottest CPU cache and stays
there.

The benchmarks were run on a 2007 Macbook Pro with a 2.4 GHz Core Duo
chipset running OS X 10.6 (Snow Leopard).

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Anonymous
  
  Add attachments
  Cancel
  You seem to have CSS turned off. Please don't fill out this field.
  
  You seem to have CSS turned off. Please don't fill out this field.

Hans-Christoph Steiner - 2011-10-09

This sounds great, I'll include it in Pd-extended and we'll see how it does there. The only problem is that there are two patches included in this tracker and they seem to have the same name. Could you delete the older one, so there is only one patch?

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Anonymous
  
  Add attachments
  Cancel
  You seem to have CSS turned off. Please don't fill out this field.
  
  You seem to have CSS turned off. Please don't fill out this field.

Marvin Humphrey - 2011-10-09

bench.c

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Anonymous
  
  Add attachments
  Cancel
  You seem to have CSS turned off. Please don't fill out this field.
  
  You seem to have CSS turned off. Please don't fill out this field.

Marvin Humphrey - 2011-10-09

0001-Improve-UTF-8-processing.patch

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Anonymous
  
  Add attachments
  Cancel
  You seem to have CSS turned off. Please don't fill out this field.
  
  You seem to have CSS turned off. Please don't fill out this field.

Marvin Humphrey - 2011-10-09

I have uploaded a new version of the patch for s_utf8.c, and also a new
version of the benchmarking app. Highlights:

1. The benchmarking app now handles four different functions.
2. All changed functions have been sped up since the last patch,
especially u8_charnum().

u8_offset: 16% faster
u8_charnum: 22% faster
u8_inc: 10% faster

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Anonymous
  
  Add attachments
  Cancel
  You seem to have CSS turned off. Please don't fill out this field.
  
  You seem to have CSS turned off. Please don't fill out this field.

Hans-Christoph Steiner - 2011-10-10

I accepted this into Pd-extended.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Anonymous
  
  Add attachments
  Cancel
  You seem to have CSS turned off. Please don't fill out this field.
  
  You seem to have CSS turned off. Please don't fill out this field.

Hans-Christoph Steiner - 2011-10-10

assigned_to: nobody --> millerpuckette
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Anonymous
  
  Add attachments
  Cancel
  You seem to have CSS turned off. Please don't fill out this field.
  
  You seem to have CSS turned off. Please don't fill out this field.

Miller Puckette - 2012-12-15

this appears already to have been applied from somewhere else.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Anonymous
  
  Add attachments
  Cancel
  You seem to have CSS turned off. Please don't fill out this field.
  
  You seem to have CSS turned off. Please don't fill out this field.

Miller Puckette - 2012-12-15

status: open --> pending-fixed
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Anonymous
  
  Add attachments
  Cancel
  You seem to have CSS turned off. Please don't fill out this field.
  
  You seem to have CSS turned off. Please don't fill out this field.

IOhannes m zmölnig - 2016-05-11

Status: pending-fixed --> closed-fixed
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Anonymous
  
  Add attachments
  Cancel
  You seem to have CSS turned off. Please don't fill out this field.
  
  You seem to have CSS turned off. Please don't fill out this field.

Invalid memory access in s_utf8.c

Group

Searches

Help

#432 Invalid memory access in s_utf8.c

Discussion