I am very happy to announce the second release candidate for the next major version of simplehtmldom. It brings very important bug fixes, performance improvements and a few new features.
Important: This is a release candidate, which means some features might not yet be stable or emit unexpected behavior. Please don't hesitate to report broken or unstable features.
Here are the most notable changes:... read more
This has been requested many times and now it's here. The new composer package is available for current master:
composer require simplehtmldom/simlehtmldom dev-master
<?php
require_once 'vendor/autoload.php';
use simplehtmldom\HtmlWeb;
echo (new HtmlWeb())->load('https://google.com/')->find('title', 0)->plaintext;
Unfortunately it doesn't seem possible to automate the package with sourceforge, so I connected it with the GitHub fork instead.
I am happy to announce the first release candidate for the next major version of the parser. It brings exciting new features and performance improvements.
Important: This is a release candidate, which means some features might not yet be stable or emit unexpected behavior. Please don't hesitate to report broken or unstable features.
Here are the most notable changes:
</tr>
are being handled more efficiently. This results in much faster seek operations, especially on large documents. A performance boost of 10x or higher compared to version 1.9 is possible (when working with a lot of unclosed end tags).... read moreThis is a bug fix release which fixes support for "text" selectors.
Download version 1.9.1 at https://sourceforge.net/projects/simplehtmldom/files/simplehtmldom/1.9.1/
I'm happy to announce the release of PHP Simple HTML DOM Parser 1.9!
This release is focused on bug fixes and updates to the manuals but also brings a few new functions.
Please note that this will be the last 1.x release (except for bug fixes maybe). More details will be made available in the future.
Most notable changes in this version... read more
Great news to anyone who aims for secure data transmission!
The project page at http://simplehtmldom.sourceforge.net now redirects to https://simplehtmldom.sourceforge.io, which is much more secure (using HTTPS) and reliable (PHP 7.x) than the "old" server (HTTP + PHP 5.4)!
But there is more!
For the past weeks I've been working on updating the existing documentation.
It is not yet available on the main page, but you can take a look at https://simplehtmldom.sourceforge.io/docs... read more
Important Version 1.8 was replaced by 1.8.1 in order to fix critical bugs.
PHP Simple HTML DOM Parser 1.8.1 is now officially available at https://sourceforge.net/projects/simplehtmldom/files/simplehtmldom/1.8.1/
This release introduces lots of bug fixes and adds support for many exciting CSS features we have been longing for!
Most notable changes:
*
) now works as expected.>
, +
, ~
)|=
, ~=
).class.class.class
)[attr1][attr2][attribute3]
)i
and s
)... read morePHP Simple HTML DOM Parser 1.7 is now officially available at https://sourceforge.net/projects/simplehtmldom/files/simplehtmldom/1.7/
This release introduces bug fixes to the DOM parser and most importantly makes the project compatible to the most recent release of PHP 7.3, for which compatibility issues have been reported!
Most notable changes:
PHP Simple HTML DOM Parser 1.6, formally located at https://sourceforge.net/projects/simplehtmldom/files/simple_html_dom.php/download (and labeled as 1.5) is now an official release located in the releases folder https://sourceforge.net/projects/simplehtmldom/files/simplehtmldom/1.6/
This step is neccessary for getting back on track with new releases. The upcoming release 1.7 will be made available shortly!
Find the Git repository on the Repository tab.
With the Git repository you can fork the project, browse the commit history and open merge requests!... read more
I'd love to find someone to revamp the simplhtmldom.sourceforge.net "help"/"manual" html pages. I hate the current look, and would love to see a far more readable easy to follow set of pages. Once I can get the look, and structure overhauled, I have a number of features that are at present undocumented, that I can add to the documentation.
I'm not looking for an ongoing commitment to this project, merely an overhaul of a set of html pages that are the "manual" for the project.... read more
I've migrated to the new sourceforge project format. It doesn't appear that anything was lost, and the documentation homepage has stayed the same. I'm planning on making a pass thru the documentation sometime soon to make it way more up to date. If anyone wants to help make the formatting of those pages nicer, I'd be happy to take some help. Email me at John_Schlick@hotmail.com
I've also changed the debugging code inside of simple_html_dom to support the sourceforge debugobject project (download it at: https://sourceforge.net/projects/debugobject/ it's cool!).... read more
I have added a lot of little features and enhancements over the last year.
A number of internal issues have been ironed out, and a few new features have been added (the ability to search for specific text inside of a tag, the ability to discover the original display size of an IMG tag, and a few other little things.
Please download the code from the repository as thats ALWAYS the most current.
Many thanks to the person that emailed me the very comprehensive list of changes to support alternate character sets, the ->plaintext output is MUCH better now.
John.
Sourceforge just allowed me to take over the project. As such, I have updated the source that I have spent the last year working on.
Memory leak is fixed.
simple_html_dom now detects the character set.
plaintext looks better since it understands more about newlines in html and what things ought to look like.
All changes are fully configurable.
Many more little changes. Docs to come over the next week or two.
Fix the bug of parsing end-tag.
Fix the bug of endless "<".
Fix the bug of "remove_noise" method while stripping out tags.
Modify "example_customize_parser.php" with better regular expressions.
Add some guidelines for parser customization.