Thank You for Your great work!
I just added new topic on forum https://sourceforge.net/p/phpcrawl/discussion/307696/thread/c3966b6a/
Could You make a feature, that can strip not needed tags from html source before finding references in it. For example in PHPCrawlerLinkFinder.class.php the preg_match_all() can have a lot of iterations to find references, but if we make something like $html_source = strip_tags($html_source, ''); before it - the number of iteration will be much more lower. Of course '' is user configuration, e.g. $crawler->setAllowedTags(['a', 'img']);
Anonymous