Share

Pavuk Web Spider and Performance Measure

Code

Programming Languages: C

License: GNU General Public License (GPL)

Repositories

browse code, statistics, last commit on 2009-03-03 cvs -d:pserver:anonymous@pavuk.cvs.sourceforge.net:/cvsroot/pavuk login

cvs -z3 -d:pserver:anonymous@pavuk.cvs.sourceforge.net:/cvsroot/pavuk co -P modulename

Show:

What's happening?

  • Pavuk Web Spider and Performance Measure

    stoecker committed patchset 116 of module pavuk_www to the Pavuk Web Spider and Performance Measure CVS repository, changing 1 files.

    2009-03-03 17:19:10 UTC by stoecker

  • inline style urls in <h1> not followed

    If an (or , , etc) has an inline style with an url, that URL won't be recognized. For example: Foo! /foo/bar.jpg isn't recognized as an element of that page. However, if you change to , it is: Foo! Following is a patch which fixes that behavior for me: ---...

    2009-02-27 21:32:14 UTC by nobody

  • Comment: accept scenario files from other pavuk builds

    Code has been adapted to at least ALWAYS recognize all available options, even the ones the current build won't actively support. This applies to both scenario files and command line options (sharing/distributing shell scripts and batch files will thus be a bit easier... ... though there's no provision in the code (yet?) to warn you about unsupported options being passed in, which, by not...

    2009-01-28 16:21:46 UTC by i_a

  • Comment: add a debug level for url->fname

    added as 'url2lcl' level identifier (next to the existing ones, such as 'js', 'rules', ...)

    2009-01-28 16:17:17 UTC by i_a

  • add a debug level for url->fname

    Add a separate debug level to see what is going on under the hood when transforming URLs to filenames/paths. Currently, that's rather hard to follow (read: you'll need a debugger if you really want to find out) when you're using advanced (= lisp) fnrules, mixed in with various -tr_* arguments to chop off bits of filename. Plus there's the new code to ensure the filenames are "safe" on both...

    2009-01-28 16:16:17 UTC by i_a

  • Comment: Win32/64: run pavuk without args --> crash

    Fixed by two changes: removed the PTW32_STATIC_LIB define in all MSVC2005 project files as those use the DLL version of the pthread-win32 lib anyway. Also moved the pthread cleanup calls, needed for when compiling with the static lib, AFTER the mt_cleanup() call, which, under the hood, would call several pthread APIs to cleanup semaphores.

    2009-01-28 16:11:28 UTC by i_a

  • Comment: Win32/64: run pavuk without args --> crash

    Problem is due to PTW32_STATIC_LIB define in the project settings, while those same projects use the DLL version of the pthreadsW32 library. Whoops. PTW32_STATIC_LIB define removed from the project settings: no more crash. So that means the pthredWin32 'static use' exit/cleanup code is called before we destroy those semaphores (because that's where pavuk crashes: while destroying...

    2009-01-28 12:51:28 UTC by i_a

  • Comment: Win32/64: run pavuk without args --> crash

    @#$%^&*( now can I assign this bug or what?! (Bloody SF)

    2009-01-28 12:37:32 UTC by i_a

  • Comment: Win32/64: run pavuk without args --> crash

    :-(.

    2009-01-28 12:33:25 UTC by i_a

  • Win32/64: run pavuk without args --> crash

    pavuk reports crash with access to a pointer value 0x00000010 (clearly a bad pointer) after printing the help text. Turns out it crashes within the pthreadWin32 library, but somehow this doesn't look like a pthread problem to me. Running pavuk with a URL to grab does not crash the app at the end.

    2009-01-28 12:32:01 UTC by nobody

Our Numbers