Hi!
I've ported Web Harvest to HttpComponents 4.2.3 instead of httpclient-3.1.
It seems to work in my personal use, so I will be glad if you want to review the code, and merge it in any way.
The repository is hosted on github.com, https://github.com/kLeZ/WebHarvest
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
C:\WebHarvest-master>mvn clean install [INFO] Scanning for projects... [ERROR] The build could not read 2 projects -> [Help 1] [ERROR] [ERROR] The project net.sourceforge.web-harvest:webharvest-core:2.1.0-SNAPSHOT (C:\WebHarvest-master\webharvest-core\pom.xm
l) has 20 errors [ERROR] 'dependencies.dependency.version' for cglib:cglib-nodep:jar is missing. @ line 60, column 21 [ERROR] 'dependencies.dependency.version' for org.slf4j:slf4j-api:jar is missing. @ line 65, column 21 [ERROR] 'dependencies.dependency.version' for net.sourceforge.htmlcleaner:htmlcleaner:jar is missing. @ line 69, column 2
1 [ERROR] 'dependencies.dependency.version' for commons-lang:commons-lang:jar is missing. @ line 73, column 21 [ERROR] 'dependencies.dependency.version' for commons-beanutils:commons-beanutils:jar is missing. @ line 77, column 21 [ERROR] 'dependencies.dependency.version' for commons-io:commons-io:jar is missing. @ line 81, column 21 [ERROR] 'dependencies.dependency.version' for commons-dbutils:commons-dbutils:jar is missing. @ line 85, column 21 [ERROR] 'dependencies.dependency.version' for commons-collections:commons-collections:jar is missing. @ line 89, column 2
1 [ERROR] 'dependencies.dependency.version' for commons-httpclient:commons-httpclient:jar is missing. @ line 93, column 21 [ERROR] 'dependencies.dependency.version' for org.apache.commons:commons-email:jar is missing. @ line 97, column 21 [ERROR] 'dependencies.dependency.version' for commons-net:commons-net:jar is missing. @ line 101, column 21 [ERROR] 'dependencies.dependency.version' for org.reflections:reflections:jar is missing. @ line 105, column 21 [ERROR] 'dependencies.dependency.version' for net.sourceforge.saxon:saxon:jar is missing. @ line 112, column 21 [ERROR] 'dependencies.dependency.version' for net.sourceforge.saxon:saxon:jar:dom is missing. @ line 116, column 21 [ERROR] 'dependencies.dependency.version' for org.beanshell:bsh:jar is missing. @ line 124, column 21 [ERROR] 'dependencies.dependency.version' for org.codehaus.groovy:groovy-all:jar is missing. @ line 128, column 21 [ERROR] 'dependencies.dependency.version' for rhino:js:jar is missing. @ line 132, column 21 [ERROR] 'dependencies.dependency.version' for com.google.guava:guava:jar is missing. @ line 137, column 21 [ERROR] 'dependencies.dependency.version' for com.google.inject:guice:jar is missing. @ line 142, column 21 [ERROR] 'dependencies.dependency.version' for com.google.inject.extensions:guice-assistedinject:jar is missing. @ line 14
6, column 21 [ERROR] [ERROR] The project net.sourceforge.web-harvest:webharvest-ide:2.1.0-SNAPSHOT (C:\WebHarvest-master\webharvest-ide\pom.xml)
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi!
I've ported Web Harvest to HttpComponents 4.2.3 instead of httpclient-3.1.
It seems to work in my personal use, so I will be glad if you want to review the code, and merge it in any way.
The repository is hosted on github.com, https://github.com/kLeZ/WebHarvest
very nice
C:\WebHarvest-master>mvn clean install
[INFO] Scanning for projects...
[ERROR] The build could not read 2 projects -> [Help 1]
[ERROR]
[ERROR] The project net.sourceforge.web-harvest:webharvest-core:2.1.0-SNAPSHOT (C:\WebHarvest-master\webharvest-core\pom.xm
l) has 20 errors
[ERROR] 'dependencies.dependency.version' for cglib:cglib-nodep:jar is missing. @ line 60, column 21
[ERROR] 'dependencies.dependency.version' for org.slf4j:slf4j-api:jar is missing. @ line 65, column 21
[ERROR] 'dependencies.dependency.version' for net.sourceforge.htmlcleaner:htmlcleaner:jar is missing. @ line 69, column 2
1
[ERROR] 'dependencies.dependency.version' for commons-lang:commons-lang:jar is missing. @ line 73, column 21
[ERROR] 'dependencies.dependency.version' for commons-beanutils:commons-beanutils:jar is missing. @ line 77, column 21
[ERROR] 'dependencies.dependency.version' for commons-io:commons-io:jar is missing. @ line 81, column 21
[ERROR] 'dependencies.dependency.version' for commons-dbutils:commons-dbutils:jar is missing. @ line 85, column 21
[ERROR] 'dependencies.dependency.version' for commons-collections:commons-collections:jar is missing. @ line 89, column 2
1
[ERROR] 'dependencies.dependency.version' for commons-httpclient:commons-httpclient:jar is missing. @ line 93, column 21
[ERROR] 'dependencies.dependency.version' for org.apache.commons:commons-email:jar is missing. @ line 97, column 21
[ERROR] 'dependencies.dependency.version' for commons-net:commons-net:jar is missing. @ line 101, column 21
[ERROR] 'dependencies.dependency.version' for org.reflections:reflections:jar is missing. @ line 105, column 21
[ERROR] 'dependencies.dependency.version' for net.sourceforge.saxon:saxon:jar is missing. @ line 112, column 21
[ERROR] 'dependencies.dependency.version' for net.sourceforge.saxon:saxon:jar:dom is missing. @ line 116, column 21
[ERROR] 'dependencies.dependency.version' for org.beanshell:bsh:jar is missing. @ line 124, column 21
[ERROR] 'dependencies.dependency.version' for org.codehaus.groovy:groovy-all:jar is missing. @ line 128, column 21
[ERROR] 'dependencies.dependency.version' for rhino:js:jar is missing. @ line 132, column 21
[ERROR] 'dependencies.dependency.version' for com.google.guava:guava:jar is missing. @ line 137, column 21
[ERROR] 'dependencies.dependency.version' for com.google.inject:guice:jar is missing. @ line 142, column 21
[ERROR] 'dependencies.dependency.version' for com.google.inject.extensions:guice-assistedinject:jar is missing. @ line 14
6, column 21
[ERROR]
[ERROR] The project net.sourceforge.web-harvest:webharvest-ide:2.1.0-SNAPSHOT (C:\WebHarvest-master\webharvest-ide\pom.xml)