When a page with a lot of comments is parsed, the remove_noise and
restore_noise functions only deal well with the first 1000 patterns met.
Whenever the number of *noise* patterns is greater, weird results are
obtained and html code containing comments is misplaced.
The patch moves the limit to 100 thousands.
Nobody/Anonymous
None
None
Public
|
Date: 2009-11-05 09:31 The issue is described in a more detailed way here: |
| Filename | Description | Download |
|---|---|---|
| simple_html_dom.patch | Patch to correctly parse pages with a lot of comments. | Download |
| Field | Old Value | Date | By |
|---|---|---|---|
| File Added | 349609: simple_html_dom.patch | 2009-11-05 09:27 | guglielmocelata |
Copyright © 2009 Geeknet, Inc. All rights reserved. Terms of Use