-
Thanks for the workaround suggestions. I have decided to use djvudump for the time being. I have PHP function which takes the djvudump output and converts it to an XML format similar to the output of djvutoxml. It is convertDumpToXML(), available here:
http://svn.wikimedia.org/viewvc/mediawiki/trunk/phase3/includes/DjVuImage.php?revision=21411&view=markup
I imagine we will switch back to...
03:54AM UTC on Apr 21 2007 in DjVuLibre
-
Sorry, that should have read "DjVuDocument::writeDjVuXML() calls DjVuImage::get_page() with sync=true".
03:28AM UTC on Apr 20 2007 in DjVuLibre
-
Please provide the ability to compile the djvulibre command line tools without indirect multi-page file support. This feature is dangerous in the context of server operation on uploaded files.
03:26AM UTC on Apr 20 2007 in DjVuLibre
-
djvutoxml is very slow, because it decodes every single image in the document. This is unnecessary to obtain the information it outputs. DjVuDocument::writeDjVuXML() calls DjVuImage::get_page() with sync=false, which causes the page to be decoded to an image.
djvudump provides almost identical information, but is implemented efficiently. DjVuDocument::writeDjVuXML() should be rewritten to...
03:22AM UTC on Apr 20 2007 in DjVuLibre
-
I hate to nitpick, but I did say "e.g." before the two fixes I suggested. In fact there were 8 regexes in all that I identified. Oh well, I guess there's no need for you to actually understand the fixes you're applying. I'm attaching a patch with the remainder.
File Added: bad-regexps.patch.
04:06PM UTC on Apr 05 2007 in GeSHi - Generic Syntax Highlighter
-
Thanks for fixing it. Yes it should be fixed in PCRE. It's not a performance problem with typical input, the real problem is that someone may deliberately submit worst-case input to wikis, blogs and pastebins, causing the affected page to become unviewable, and perhaps even DoSing the server. It's called an algorithmic complexity attack. On Wikipedia, we've had small-scale attacks of this kind...
11:39AM UTC on Apr 01 2007 in GeSHi - Generic Syntax Highlighter
-
OK, maybe not as hard to fix as that... The assembly regexes can be fixed by limiting the repetition count, e.g.
0[0-9a-fA-F]{1,32}[hH]
The BNF regex can be fixed by making the start not match the middle:
(<)([^&]+?)(>)
And mirc was a false alarm, no problem there.
03:58AM UTC on Mar 27 2007 in GeSHi - Generic Syntax Highlighter
-
PCRE has a worst case time order of O(N^2) in the input size for certain kinds of backtracking regexes. My initial review indicates that such regexes are present in the asm, bnf, mirc, mpasm and z80 syntax modules.
The problem occurs with any regex of the form:
(start)(middle)*(end)
...where there is a string that matches both start and middle. Any quantifier allowing an unlimited...
03:29AM UTC on Mar 27 2007 in GeSHi - Generic Syntax Highlighter
-
My sister upgraded her Gallery 1.5 -> 1.5.4, but got
File .htaccess is not writable by the webserver - chmod
it 777 before proceeding
but could not work out which .htaccess file to change!
Full file path was not given. Attached is an untested
patch against 1.5.4 to give the full path for this
error message.
04:50AM UTC on Nov 03 2006 in Gallery
-
timstarling committed patchset 10433 of module phase3 to the MediaWiki CVS repository, changing 1 files.
05:15AM UTC on Mar 28 2006 in MediaWiki