Menu

#1733 Maximum line length for filtering?

3.0.28
closed
None
5
2023-08-15
2020-08-30
withoutname
No

Link text is not filtered: http://yandex.ru/news/_nzp/g141k632l/9a603eeK/eh2p3xYedUAFxtXqe7cULKRafbLc0oh1PPI1LacG01hHEdaHuPaaYT7W7o6wZTUBTHMn9SlwnWSgvHNVe7Dj_jqsDL-nT9RgxULVao-XVFzVWk0yyXMpxcwWpIxmd_PYpyQig4yjnoWvRz6vAFXkI40y12TYYd4UUCyHBfqUgj5qbd1vpz81cHRTNcp_VoUM9UoNE5iSKLVMlvSt52czGDZ0gjMMo3-1Pgd-n8E0dTJMEleFCcBuFaHcJ8SblFLvih0djmbu1SD1U6Srn-aULaQaXRJ4gxmkPIY0bcfGojjGRRMiHKJPFY42D36QhCTinTNG9BlRD9QBzQcFClVzPjotnb_WL5WgtLOV6o-WdLzUOx0D6TLoNT3GZAwWh6M5VuQiov1Dj1Tf1p8vcBSF06xCtwRI0D_0UVx3Vepk0h_KnMxPJ381kSXSJKuuVxRs9aoM0slD2NXsF0Vsd3cCqVelQqOtco_VnqcvP-FlpJIsQwb1aJAeBFB89pVqNYP_623dHjcO1EBFUhX6fnZUfRRKfILoQxnVLNbErEaWopl21ZPjDeM-pf6Gn__Q9TXTraLmdXlB3iTALceVe7Sy7rtsDW5W3-UwdQJkO88mpJ1FqizTmNJolez3pJ22dnN5VgQDI-yDj4WuRq7ugWRkg-wD5sVpcG7F0fznxer0gv_6rH1O1251YUVjpYv-R5U81dqM42mi-dVdBoRMVnbz2YbV0jM84q-0n2c_7lGkJNOd8hcUKICO1aC8toWbxfLuq6w8vsY-5AEUAlXqf9aVTVR7DPOZU5mEDFYULGen4vknpANSrWNfBQ9Hvn6RNaUyzWLXZXkBDtWBjDdFSwWyLvrtXY5nT9VgBSKVW85WRM2li82DqEKoRFz2lfzWt6L4huWi4wzD_nUexi8fQUXkc72j5mRZIS6lMezWlIvUs6-aPS0eVp80YNUjlfuftrQdFCqNEskjGPT99jQM1pcTWVdkExL9ki9Vv2YPL8CVJQKMwnel-PHv1ZDcl1W71MO-yq2tXlb-9TEkIoVLn_eUreVbjKLYwinFXef0HAan8ylGpAMjDHIeBS7HDi_RVFVCHGN21OjhL7UQrWekitQSXkr97V-HboWRJHK0a6_3VT3V-61iWUMp5SzWhWwWl-LYZ0XCsm1zPzSO9k4_wSR1M41iF4Vpgd5lgN03VJvE8i5abRw-pt70ANXDpNuPt1U8FTv84qgyGLUsdvRc1wYD2HcEstON435lPkce7tDk9WItAneESfBuNZDsp0Ub9JM_mi3crkaOxEFEcvRqjmY1DBS67aPJkyi0PFYVLKdXsshmBRLCPSPedE5mP1_AleSjHMLmRIlxjgSRrJcU2ySDDspdXb5GLuRh1OKFiq4ntUzlK9xzeCPIJa32ZUxnFrIJh7Wzwl1zfkRuV15PkNTFcq1zB8UIQb400N0WpQrUQk-bfHyels8UUOTytJvu5xSs5Ju9w4giKLQNxhQcByYD2Vd1spI8sm_0bsa_H0AEhLMsQzbkGNA_1DCc94TKJcLuK1wNj9euxVHE8qW6vjd1rWR67WLJktm0DEb1_Wdm06gmdULSDLPP5R7Wro_QRHQiHHM31QhB_4Qg3IbkqkQjTvpMfQ-mj9Sg5UNU-8-W5TzUK72CyRLIdQ3WVd1mh9Noh2QDs5xyDrSuRy5ecQRkgm0zdsRYQa-VoawGheslYw7LnHxet64lURRzlOvPN4TsJfrtMwiiGSUc5tRd1keiyKc0UxIdoq8Ebrbvf4FllQL8Y2dUuRDvFIAdx3W7FULvm60dz9cu1GC0YkRKfubkLdVqHVO4gujkXJdETOf3EgkHBAMT3bJuhF-W3k-BlDUCTaMHVVmQPqWgnObVSpQTbluMHc-WvsVgFFM1S6_GhE3UuxzT6PPItdwXxDzXZiM5lrUC4nyyj6RON77P4JRFQu2z58VJgB81sK1W9PoEwm6qHY1e9x4kULTCxYtvZtUdFfutwxlyyYWdhjVtxtZTCZa0ooI8w09lHref_0AURJJMAkaFSEHetFHtR2T6VMOvis28fwcOZKAVMgRr32YkXaWLDOLY4zglbIbFrKfHk-gmNCLSvaOf9d4XD37hpbRy3AMWlQlgz_UQjLYlu8TD_7od_f53rnRA9BLV215W1S20mi3zmNOIBBxHhFxH18II1uVTQuzCfkTuxi8PkASFUv3it_VYke-Fgby3VLr0I04KnZxux08FwNRSlLq-ZtSMJDr8c8miqBWs1vXsZoeTyXZF04Ltch5lDvbeToB0lcJM00ZE-HDvxTG8ZvTL1bIP6kzcDybeNeHFYxXr71aVXMUqHNOpE7h0PYakLHZm43kG1aIDzOMflG_Gvs8wBeQyjNLX9MjR3gWx3CeFClRTHksMzb7Hn-UgNeJ0W3_mNO3F-wzi6JL5tYxXZIx2xoOpN0QSM-2SXmTel28fUMUUY9wCd5TIgD_kIYyH9NrEQ-_bDD0uU
The filter is like this:

s@window\.Ya\..+\.RumLogging=100\*Math\.random.+@//clr 1@sig

This is the start of the second longest line. Presumable reason, the limit on the length of the line, approximately no more than a million characters.
Thank you!

Discussion

  • Fabian Keil

    Fabian Keil - 2020-09-02
    • status: open --> pending
    • assigned_to: Fabian Keil
     
  • Fabian Keil

    Fabian Keil - 2020-09-02

    Privoxy itself does not limit the line length for filters but relies on pcre which can be compiled with a match limit.

    Unfortunately from reading your request I don't completely understand what you are trying to do and what isn't working.

    Can you please clarify?

     
  • withoutname

    withoutname - 2020-09-03

    The link loads the file (javascript), then it is cleaned by the filter from my first message, since it works to serve ads.

     

    Last edit: withoutname 2020-09-03
  • Fabian Keil

    Fabian Keil - 2020-09-11

    I can't reproduce this on ElectroBSD with Privoxy 3.0.29 built from git and pcre 8.44.

    Using Privoxy-Filter-Test I get:

    --- /home/fk/privoxy/privoxy-filter-test/original-419505773651511744-file-fetched.html Fri Sep 11 15:01:16 2020
    +++ /home/fk/privoxy/privoxy-filter-test/filtered-419505773651511744-file-fetched.html Fri Sep 11 15:03:22 2020
    @@ -1,2 +1,2 @@
    /! For license information please see context_static_adb.js.LICENSE.txt /
    -window.Ya.jFOeuwNbwdA.RumLogging=100*Math.random()[...].Rum.sendTimeMark("60.2335.1428")}}();
    \ No newline at end of file
    +//clr 1
    \ No newline at end of file

    As the response is rather large I had to increase the buffer-limit to 4096 which is already the default.

    Which operating system and pcre version do you use?

     
  • withoutname

    withoutname - 2020-09-11

    Privoxy 3.0.28, Ubuntu Server 20.04 LTS, buffer-limit 16384.

     
  • Fabian Keil

    Fabian Keil - 2020-09-24

    Can you reproduce the problem with filter debugging enabled ("debug 64") and provide a log excerpt?

     
  • withoutname

    withoutname - 2020-09-26
    2020-09-26 22:26:49.004 7f83037fe700 Re-Filter: filtering 'GET http://yandex.ru/news/_nzp/g141k632l/9a603eeK/eh2p3xYedUAFxtXqe7cULKRafbLc0oh1PPI1LacG01hHEdaHuPaaYT7W7o6wZTUBTHMn9SlwnWSgvHNVe7Dj_jqsDL-nT9RgxULVao-XVFzVWk0yyXMpxcwWpIxmd_PYpyQig4yjnoWvRz6vAFXkI40y12TYYd4UUCyHBfqUgj5qbd1vpz81cHRTNcp_VoUM9UoNE5iSKLVMlvSt52czGDZ0gjMMo3-1Pgd-n8E0dTJMEleFCcBuFaHcJ8SblFLvih0djmbu1SD1U6Srn-aULaQaXRJ4gxmkPIY0bcfGojjGRRMiHKJPFY42D36QhCTinTNG9BlRD9QBzQcFClVzPjotnb_WL5WgtLOV6o-WdLzUOx0D6TLoNT3GZAwWh6M5VuQiov1Dj1Tf1p8vcBSF06xCtwRI0D_0UVx3Vepk0h_KnMxPJ381kSXSJKuuVxRs9aoM0slD2NXsF0Vsd3cCqVelQqOtco_VnqcvP-FlpJIsQwb1aJAeBFB89pVqNYP_623dHjcO1EBFUhX6fnZUfRRKfILoQxnVLNbErEaWopl21ZPjDeM-pf6Gn__Q9TXTraLmdXlB3iTALceVe7Sy7rtsDW5W3-UwdQJkO88mpJ1FqizTmNJolez3pJ22dnN5VgQDI-yDj4WuRq7ugWRkg-wD5sVpcG7F0fznxer0gv_6rH1O1251YUVjpYv-R5U81dqM42mi-dVdBoRMVnbz2YbV0jM84q-0n2c_7lGkJNOd8hcUKICO1aC8toWbxfLuq6w8vsY-5AEUAlXqf9aVTVR7DPOZU5mEDFYULGen4vknpANSrWNfBQ9Hvn6RNaUyzWLXZXkBDtWBjDdFSwWyLvrtXY5nT9VgBSKVW85WRM2li82DqEKoRFz2lfzWt6L4huWi4wzD_nUexi8fQUXkc72j5mRZIS6lMezWlIvUs6-aPS0eVp80YNUjlfuftrQdFCqNEskjGPT99jQM1pcTWVdkExL9ki9Vv2YPL8CVJQKMwnel-PHv1ZDcl1W71MO-yq2tXlb-9TEkIoVLn_eUreVbjKLYwinFXef0HAan8ylGpAMjDHIeBS7HDi_RVFVCHGN21OjhL7UQrWekitQSXkr97V-HboWRJHK0a6_3VT3V-61iWUMp5SzWhWwWl-LYZ0XCsm1zPzSO9k4_wSR1M41iF4Vpgd5lgN03VJvE8i5abRw-pt70ANXDpNuPt1U8FTv84qgyGLUsdvRc1wYD2HcEstON435lPkce7tDk9WItAneESfBuNZDsp0Ub9JM_mi3crkaOxEFEcvRqjmY1DBS67aPJkyi0PFYVLKdXsshmBRLCPSPedE5mP1_AleSjHMLmRIlxjgSRrJcU2ySDDspdXb5GLuRh1OKFiq4ntUzlK9xzeCPIJa32ZUxnFrIJh7Wzwl1zfkRuV15PkNTFcq1zB8UIQb400N0WpQrUQk-bfHyels8UUOTytJvu5xSs5Ju9w4giKLQNxhQcByYD2Vd1spI8sm_0bsa_H0AEhLMsQzbkGNA_1DCc94TKJcLuK1wNj9euxVHE8qW6vjd1rWR67WLJktm0DEb1_Wdm06gmdULSDLPP5R7Wro_QRHQiHHM31QhB_4Qg3IbkqkQjTvpMfQ-mj9Sg5UNU-8-W5TzUK72CyRLIdQ3WVd1mh9Noh2QDs5xyDrSuRy5ecQRkgm0zdsRYQa-VoawGheslYw7LnHxet64lURRzlOvPN4TsJfrtMwiiGSUc5tRd1keiyKc0UxIdoq8Ebrbvf4FllQL8Y2dUuRDvFIAdx3W7FULvm60dz9cu1GC0YkRKfubkLdVqHVO4gujkXJdETOf3EgkHBAMT3bJuhF-W3k-BlDUCTaMHVVmQPqWgnObVSpQTbluMHc-WvsVgFFM1S6_GhE3UuxzT6PPItdwXxDzXZiM5lrUC4nyyj6RON77P4JRFQu2z58VJgB81sK1W9PoEwm6qHY1e9x4kULTCxYtvZtUdFfutwxlyyYWdhjVtxtZTCZa0ooI8w09lHref_0AURJJMAkaFSEHetFHtR2T6VMOvis28fwcOZKAVMgRr32YkXaWLDOLY4zglbIbFrKfHk-gmNCLSvaOf9d4XD37hpbRy3AMWlQlgz_UQjLYlu8TD_7od_f53rnRA9BLV215W1S20mi3zmNOIBBxHhFxH18II1uVTQuzCfkTuxi8PkASFUv3it_VYke-Fgby3VLr0I04KnZxux08FwNRSlLq-ZtSMJDr8c8miqBWs1vXsZoeTyXZF04Ltch5lDvbeToB0lcJM00ZE-HDvxTG8ZvTL1bIP6kzcDybeNeHFYxXr71aVXMUqHNOpE7h0PYakLHZm43kG1aIDzOMflG_Gvs8wBeQyjNLX9MjR3gWx3CeFClRTHksMzb7Hn-UgNeJ0W3_mNO3F-wzi6JL5tYxXZIx2xoOpN0QSM-2SXmTel28fUMUUY9wCd5TIgD_kIYyH9NrEQ-_bDD0uU HTTP/1.1' (size 2540) with 'redirect-base64' ...
    2020-09-26 22:26:49.005 7f83037fe700 Re-Filter: ... produced 0 hits (new size 2540).
    2020-09-26 22:26:49.005 7f83037fe700 Re-Filter: filtering 'GET http://yandex.ru/news/_nzp/g141k632l/9a603eeK/eh2p3xYedUAFxtXqe7cULKRafbLc0oh1PPI1LacG01hHEdaHuPaaYT7W7o6wZTUBTHMn9SlwnWSgvHNVe7Dj_jqsDL-nT9RgxULVao-XVFzVWk0yyXMpxcwWpIxmd_PYpyQig4yjnoWvRz6vAFXkI40y12TYYd4UUCyHBfqUgj5qbd1vpz81cHRTNcp_VoUM9UoNE5iSKLVMlvSt52czGDZ0gjMMo3-1Pgd-n8E0dTJMEleFCcBuFaHcJ8SblFLvih0djmbu1SD1U6Srn-aULaQaXRJ4gxmkPIY0bcfGojjGRRMiHKJPFY42D36QhCTinTNG9BlRD9QBzQcFClVzPjotnb_WL5WgtLOV6o-WdLzUOx0D6TLoNT3GZAwWh6M5VuQiov1Dj1Tf1p8vcBSF06xCtwRI0D_0UVx3Vepk0h_KnMxPJ381kSXSJKuuVxRs9aoM0slD2NXsF0Vsd3cCqVelQqOtco_VnqcvP-FlpJIsQwb1aJAeBFB89pVqNYP_623dHjcO1EBFUhX6fnZUfRRKfILoQxnVLNbErEaWopl21ZPjDeM-pf6Gn__Q9TXTraLmdXlB3iTALceVe7Sy7rtsDW5W3-UwdQJkO88mpJ1FqizTmNJolez3pJ22dnN5VgQDI-yDj4WuRq7ugWRkg-wD5sVpcG7F0fznxer0gv_6rH1O1251YUVjpYv-R5U81dqM42mi-dVdBoRMVnbz2YbV0jM84q-0n2c_7lGkJNOd8hcUKICO1aC8toWbxfLuq6w8vsY-5AEUAlXqf9aVTVR7DPOZU5mEDFYULGen4vknpANSrWNfBQ9Hvn6RNaUyzWLXZXkBDtWBjDdFSwWyLvrtXY5nT9VgBSKVW85WRM2li82DqEKoRFz2lfzWt6L4huWi4wzD_nUexi8fQUXkc72j5mRZIS6lMezWlIvUs6-aPS0eVp80YNUjlfuftrQdFCqNEskjGPT99jQM1pcTWVdkExL9ki9Vv2YPL8CVJQKMwnel-PHv1ZDcl1W71MO-yq2tXlb-9TEkIoVLn_eUreVbjKLYwinFXef0HAan8ylGpAMjDHIeBS7HDi_RVFVCHGN21OjhL7UQrWekitQSXkr97V-HboWRJHK0a6_3VT3V-61iWUMp5SzWhWwWl-LYZ0XCsm1zPzSO9k4_wSR1M41iF4Vpgd5lgN03VJvE8i5abRw-pt70ANXDpNuPt1U8FTv84qgyGLUsdvRc1wYD2HcEstON435lPkce7tDk9WItAneESfBuNZDsp0Ub9JM_mi3crkaOxEFEcvRqjmY1DBS67aPJkyi0PFYVLKdXsshmBRLCPSPedE5mP1_AleSjHMLmRIlxjgSRrJcU2ySDDspdXb5GLuRh1OKFiq4ntUzlK9xzeCPIJa32ZUxnFrIJh7Wzwl1zfkRuV15PkNTFcq1zB8UIQb400N0WpQrUQk-bfHyels8UUOTytJvu5xSs5Ju9w4giKLQNxhQcByYD2Vd1spI8sm_0bsa_H0AEhLMsQzbkGNA_1DCc94TKJcLuK1wNj9euxVHE8qW6vjd1rWR67WLJktm0DEb1_Wdm06gmdULSDLPP5R7Wro_QRHQiHHM31QhB_4Qg3IbkqkQjTvpMfQ-mj9Sg5UNU-8-W5TzUK72CyRLIdQ3WVd1mh9Noh2QDs5xyDrSuRy5ecQRkgm0zdsRYQa-VoawGheslYw7LnHxet64lURRzlOvPN4TsJfrtMwiiGSUc5tRd1keiyKc0UxIdoq8Ebrbvf4FllQL8Y2dUuRDvFIAdx3W7FULvm60dz9cu1GC0YkRKfubkLdVqHVO4gujkXJdETOf3EgkHBAMT3bJuhF-W3k-BlDUCTaMHVVmQPqWgnObVSpQTbluMHc-WvsVgFFM1S6_GhE3UuxzT6PPItdwXxDzXZiM5lrUC4nyyj6RON77P4JRFQu2z58VJgB81sK1W9PoEwm6qHY1e9x4kULTCxYtvZtUdFfutwxlyyYWdhjVtxtZTCZa0ooI8w09lHref_0AURJJMAkaFSEHetFHtR2T6VMOvis28fwcOZKAVMgRr32YkXaWLDOLY4zglbIbFrKfHk-gmNCLSvaOf9d4XD37hpbRy3AMWlQlgz_UQjLYlu8TD_7od_f53rnRA9BLV215W1S20mi3zmNOIBBxHhFxH18II1uVTQuzCfkTuxi8PkASFUv3it_VYke-Fgby3VLr0I04KnZxux08FwNRSlLq-ZtSMJDr8c8miqBWs1vXsZoeTyXZF04Ltch5lDvbeToB0lcJM00ZE-HDvxTG8ZvTL1bIP6kzcDybeNeHFYxXr71aVXMUqHNOpE7h0PYakLHZm43kG1aIDzOMflG_Gvs8wBeQyjNLX9MjR3gWx3CeFClRTHksMzb7Hn-UgNeJ0W3_mNO3F-wzi6JL5tYxXZIx2xoOpN0QSM-2SXmTel28fUMUUY9wCd5TIgD_kIYyH9NrEQ-_bDD0uU HTTP/1.1' (size 2540) with 'del-X-Client-Data' ...
    2020-09-26 22:26:49.005 7f83037fe700 Re-Filter: ... produced 0 hits (new size 2540).
    2020-09-26 22:26:49.005 7f83037fe700 Re-Filter: filtering 'GET http://yandex.ru/news/_nzp/g141k632l/9a603eeK/eh2p3xYedUAFxtXqe7cULKRafbLc0oh1PPI1LacG01hHEdaHuPaaYT7W7o6wZTUBTHMn9SlwnWSgvHNVe7Dj_jqsDL-nT9RgxULVao-XVFzVWk0yyXMpxcwWpIxmd_PYpyQig4yjnoWvRz6vAFXkI40y12TYYd4UUCyHBfqUgj5qbd1vpz81cHRTNcp_VoUM9UoNE5iSKLVMlvSt52czGDZ0gjMMo3-1Pgd-n8E0dTJMEleFCcBuFaHcJ8SblFLvih0djmbu1SD1U6Srn-aULaQaXRJ4gxmkPIY0bcfGojjGRRMiHKJPFY42D36QhCTinTNG9BlRD9QBzQcFClVzPjotnb_WL5WgtLOV6o-WdLzUOx0D6TLoNT3GZAwWh6M5VuQiov1Dj1Tf1p8vcBSF06xCtwRI0D_0UVx3Vepk0h_KnMxPJ381kSXSJKuuVxRs9aoM0slD2NXsF0Vsd3cCqVelQqOtco_VnqcvP-FlpJIsQwb1aJAeBFB89pVqNYP_623dHjcO1EBFUhX6fnZUfRRKfILoQxnVLNbErEaWopl21ZPjDeM-pf6Gn__Q9TXTraLmdXlB3iTALceVe7Sy7rtsDW5W3-UwdQJkO88mpJ1FqizTmNJolez3pJ22dnN5VgQDI-yDj4WuRq7ugWRkg-wD5sVpcG7F0fznxer0gv_6rH1O1251YUVjpYv-R5U81dqM42mi-dVdBoRMVnbz2YbV0jM84q-0n2c_7lGkJNOd8hcUKICO1aC8toWbxfLuq6w8vsY-5AEUAlXqf9aVTVR7DPOZU5mEDFYULGen4vknpANSrWNfBQ9Hvn6RNaUyzWLXZXkBDtWBjDdFSwWyLvrtXY5nT9VgBSKVW85WRM2li82DqEKoRFz2lfzWt6L4huWi4wzD_nUexi8fQUXkc72j5mRZIS6lMezWlIvUs6-aPS0eVp80YNUjlfuftrQdFCqNEskjGPT99jQM1pcTWVdkExL9ki9Vv2YPL8CVJQKMwnel-PHv1ZDcl1W71MO-yq2tXlb-9TEkIoVLn_eUreVbjKLYwinFXef0HAan8ylGpAMjDHIeBS7HDi_RVFVCHGN21OjhL7UQrWekitQSXkr97V-HboWRJHK0a6_3VT3V-61iWUMp5SzWhWwWl-LYZ0XCsm1zPzSO9k4_wSR1M41iF4Vpgd5lgN03VJvE8i5abRw-pt70ANXDpNuPt1U8FTv84qgyGLUsdvRc1wYD2HcEstON435lPkce7tDk9WItAneESfBuNZDsp0Ub9JM_mi3crkaOxEFEcvRqjmY1DBS67aPJkyi0PFYVLKdXsshmBRLCPSPedE5mP1_AleSjHMLmRIlxjgSRrJcU2ySDDspdXb5GLuRh1OKFiq4ntUzlK9xzeCPIJa32ZUxnFrIJh7Wzwl1zfkRuV15PkNTFcq1zB8UIQb400N0WpQrUQk-bfHyels8UUOTytJvu5xSs5Ju9w4giKLQNxhQcByYD2Vd1spI8sm_0bsa_H0AEhLMsQzbkGNA_1DCc94TKJcLuK1wNj9euxVHE8qW6vjd1rWR67WLJktm0DEb1_Wdm06gmdULSDLPP5R7Wro_QRHQiHHM31QhB_4Qg3IbkqkQjTvpMfQ-mj9Sg5UNU-8-W5TzUK72CyRLIdQ3WVd1mh9Noh2QDs5xyDrSuRy5ecQRkgm0zdsRYQa-VoawGheslYw7LnHxet64lURRzlOvPN4TsJfrtMwiiGSUc5tRd1keiyKc0UxIdoq8Ebrbvf4FllQL8Y2dUuRDvFIAdx3W7FULvm60dz9cu1GC0YkRKfubkLdVqHVO4gujkXJdETOf3EgkHBAMT3bJuhF-W3k-BlDUCTaMHVVmQPqWgnObVSpQTbluMHc-WvsVgFFM1S6_GhE3UuxzT6PPItdwXxDzXZiM5lrUC4nyyj6RON77P4JRFQu2z58VJgB81sK1W9PoEwm6qHY1e9x4kULTCxYtvZtUdFfutwxlyyYWdhjVtxtZTCZa0ooI8w09lHref_0AURJJMAkaFSEHetFHtR2T6VMOvis28fwcOZKAVMgRr32YkXaWLDOLY4zglbIbFrKfHk-gmNCLSvaOf9d4XD37hpbRy3AMWlQlgz_UQjLYlu8TD_7od_f53rnRA9BLV215W1S20mi3zmNOIBBxHhFxH18II1uVTQuzCfkTuxi8PkASFUv3it_VYke-Fgby3VLr0I04KnZxux08FwNRSlLq-ZtSMJDr8c8miqBWs1vXsZoeTyXZF04Ltch5lDvbeToB0lcJM00ZE-HDvxTG8ZvTL1bIP6kzcDybeNeHFYxXr71aVXMUqHNOpE7h0PYakLHZm43kG1aIDzOMflG_Gvs8wBeQyjNLX9MjR3gWx3CeFClRTHksMzb7Hn-UgNeJ0W3_mNO3F-wzi6JL5tYxXZIx2xoOpN0QSM-2SXmTel28fUMUUY9wCd5TIgD_kIYyH9NrEQ-_bDD0uU HTTP/1.1' (size 2540) with 'yandexuid' ...
    2020-09-26 22:26:49.005 7f83037fe700 Re-Filter: ... produced 0 hits (new size 2540).
    2020-09-26 22:26:49.005 7f83037fe700 Re-Filter: filtering 'GET http://yandex.ru/news/_nzp/g141k632l/9a603eeK/eh2p3xYedUAFxtXqe7cULKRafbLc0oh1PPI1LacG01hHEdaHuPaaYT7W7o6wZTUBTHMn9SlwnWSgvHNVe7Dj_jqsDL-nT9RgxULVao-XVFzVWk0yyXMpxcwWpIxmd_PYpyQig4yjnoWvRz6vAFXkI40y12TYYd4UUCyHBfqUgj5qbd1vpz81cHRTNcp_VoUM9UoNE5iSKLVMlvSt52czGDZ0gjMMo3-1Pgd-n8E0dTJMEleFCcBuFaHcJ8SblFLvih0djmbu1SD1U6Srn-aULaQaXRJ4gxmkPIY0bcfGojjGRRMiHKJPFY42D36QhCTinTNG9BlRD9QBzQcFClVzPjotnb_WL5WgtLOV6o-WdLzUOx0D6TLoNT3GZAwWh6M5VuQiov1Dj1Tf1p8vcBSF06xCtwRI0D_0UVx3Vepk0h_KnMxPJ381kSXSJKuuVxRs9aoM0slD2NXsF0Vsd3cCqVelQqOtco_VnqcvP-FlpJIsQwb1aJAeBFB89pVqNYP_623dHjcO1EBFUhX6fnZUfRRKfILoQxnVLNbErEaWopl21ZPjDeM-pf6Gn__Q9TXTraLmdXlB3iTALceVe7Sy7rtsDW5W3-UwdQJkO88mpJ1FqizTmNJolez3pJ22dnN5VgQDI-yDj4WuRq7ugWRkg-wD5sVpcG7F0fznxer0gv_6rH1O1251YUVjpYv-R5U81dqM42mi-dVdBoRMVnbz2YbV0jM84q-0n2c_7lGkJNOd8hcUKICO1aC8toWbxfLuq6w8vsY-5AEUAlXqf9aVTVR7DPOZU5mEDFYULGen4vknpANSrWNfBQ9Hvn6RNaUyzWLXZXkBDtWBjDdFSwWyLvrtXY5nT9VgBSKVW85WRM2li82DqEKoRFz2lfzWt6L4huWi4wzD_nUexi8fQUXkc72j5mRZIS6lMezWlIvUs6-aPS0eVp80YNUjlfuftrQdFCqNEskjGPT99jQM1pcTWVdkExL9ki9Vv2YPL8CVJQKMwnel-PHv1ZDcl1W71MO-yq2tXlb-9TEkIoVLn_eUreVbjKLYwinFXef0HAan8ylGpAMjDHIeBS7HDi_RVFVCHGN21OjhL7UQrWekitQSXkr97V-HboWRJHK0a6_3VT3V-61iWUMp5SzWhWwWl-LYZ0XCsm1zPzSO9k4_wSR1M41iF4Vpgd5lgN03VJvE8i5abRw-pt70ANXDpNuPt1U8FTv84qgyGLUsdvRc1wYD2HcEstON435lPkce7tDk9WItAneESfBuNZDsp0Ub9JM_mi3crkaOxEFEcvRqjmY1DBS67aPJkyi0PFYVLKdXsshmBRLCPSPedE5mP1_AleSjHMLmRIlxjgSRrJcU2ySDDspdXb5GLuRh1OKFiq4ntUzlK9xzeCPIJa32ZUxnFrIJh7Wzwl1zfkRuV15PkNTFcq1zB8UIQb400N0WpQrUQk-bfHyels8UUOTytJvu5xSs5Ju9w4giKLQNxhQcByYD2Vd1spI8sm_0bsa_H0AEhLMsQzbkGNA_1DCc94TKJcLuK1wNj9euxVHE8qW6vjd1rWR67WLJktm0DEb1_Wdm06gmdULSDLPP5R7Wro_QRHQiHHM31QhB_4Qg3IbkqkQjTvpMfQ-mj9Sg5UNU-8-W5TzUK72CyRLIdQ3WVd1mh9Noh2QDs5xyDrSuRy5ecQRkgm0zdsRYQa-VoawGheslYw7LnHxet64lURRzlOvPN4TsJfrtMwiiGSUc5tRd1keiyKc0UxIdoq8Ebrbvf4FllQL8Y2dUuRDvFIAdx3W7FULvm60dz9cu1GC0YkRKfubkLdVqHVO4gujkXJdETOf3EgkHBAMT3bJuhF-W3k-BlDUCTaMHVVmQPqWgnObVSpQTbluMHc-WvsVgFFM1S6_GhE3UuxzT6PPItdwXxDzXZiM5lrUC4nyyj6RON77P4JRFQu2z58VJgB81sK1W9PoEwm6qHY1e9x4kULTCxYtvZtUdFfutwxlyyYWdhjVtxtZTCZa0ooI8w09lHref_0AURJJMAkaFSEHetFHtR2T6VMOvis28fwcOZKAVMgRr32YkXaWLDOLY4zglbIbFrKfHk-gmNCLSvaOf9d4XD37hpbRy3AMWlQlgz_UQjLYlu8TD_7od_f53rnRA9BLV215W1S20mi3zmNOIBBxHhFxH18II1uVTQuzCfkTuxi8PkASFUv3it_VYke-Fgby3VLr0I04KnZxux08FwNRSlLq-ZtSMJDr8c8miqBWs1vXsZoeTyXZF04Ltch5lDvbeToB0lcJM00ZE-HDvxTG8ZvTL1bIP6kzcDybeNeHFYxXr71aVXMUqHNOpE7h0PYakLHZm43kG1aIDzOMflG_Gvs8wBeQyjNLX9MjR3gWx3CeFClRTHksMzb7Hn-UgNeJ0W3_mNO3F-wzi6JL5tYxXZIx2xoOpN0QSM-2SXmTel28fUMUUY9wCd5TIgD_kIYyH9NrEQ-_bDD0uU HTTP/1.1' (size 2540) with 'yandex-skid' ...
    2020-09-26 22:26:49.005 7f83037fe700 Re-Filter: ... produced 0 hits (new size 2540).
    

    The filter that should have worked:

    FILTER: news.yandex.ru news.yandex.ru
    s@window\.Ya\..+\.RumLogging=100\*Math\.random.+@//clr 1@sig
    ...
    

    Actions:

    {+filter{distr-popup-yandex.ru} \
     +server-header-tagger{SHT_CLTJ} \
     +filter{news.yandex.ru} \
     +prevent-compression \
    }
    yastatic.net/news
    yandex.*/news
    news.yandex.*/
    yandex.ru/turbo/
    yastatic.net/s[0-9]/nerpa-static/
    
     
  • Fabian Keil

    Fabian Keil - 2020-09-27

    The log excerpt only seems to show server-header filters.

    Is it incomplete or is the filter not applied?

    Can you see it in the final results when you check it with:
    http://config.privoxy.org/show-url-info
    ?

     
  • withoutname

    withoutname - 2021-12-20

    Privoxy itself does not limit the line length for filters but relies on pcre which can be compiled with a match limit.

    How can I know if there is a limitation in pcre?

     
  • Fabian Keil

    Fabian Keil - 2021-12-21

    With a recent Privoxy version you can also look for error messages like:
    2021-12-21 15:17:36.474 800b75c00 Error: Skipped filter 'banners-by-size' after job number 1: (pcre:) Match limit reached (-8)

     
  • withoutname

    withoutname - 2021-12-22

    Okay, I am fleshing out the question. For the page www.google.com/search?q=... I apply a filter (example):

    s@(html>)@$1<!-- comment -->@sig
    

    This filter does not work. I was prompted that there may be a limit on the string and another additional filter was proposed, like this:

    s@(<\/[a-z]+?>)[\r\n]?@\n$1@g
    

    I put it above the one described. And everything works. What could be the reason? Privoxy 3.0.33, I haven't seen any error messages like Error: Skipped filter ...

     
  • Fabian Keil

    Fabian Keil - 2021-12-24

    The first pcrs command works for me as expected.

    Please provide a log excerpt with the debug settings documented at:
    https://www.privoxy.org/user-manual/contact.html

     
  • withoutname

    withoutname - 2021-12-27

    The first filter, when alone, really works. But I actually have different filters. For example, these are:

    s@(html>)@$1<!-- comment -->@sig #We leave this one, because it does not work in this case
    
    #I need these filters to work
    s@(<div\s+class=['"][\w\d]{5,7}['"]><a\s+href=['"].*['"]).*,event\)['"]>@<!--clr new tab-->$1 target="_blank" rel="noopener noreferrer">@igU
    s@(<div\s+class=['"][\w\d]{5,7}['"]><a\s+href=['"].*['"])\s+data-jsarwt.*>@<!--clr new tab-->$1 target="_blank" rel="noopener noreferrer">@igU
    s@(<a\s+(class=)?)("([\w\d]{3,6}|[\w\d]{5,6}\s[\w\d]{5,6})"\s+)?(href=['"]/search\?.+(?=(pnnext)))>@<!--clr new tab 2-->$1$2$3$4$5 target="_blank" rel="noopener noreferrer">@igU
    s@(<a\s+class=['"]fl['"]\s+href=['"]https?[^>]*)@<!--clr new tab 3-->$1 target="_blank" rel="noopener noreferrer"@ig
    s@(<a\s+class=['"]JWaTvb\sFx4vi['"]\s+href="https?[^>]*)@<!--clr new tab 4-->$1 target="_blank"@igU
    
     
  • withoutname

    withoutname - 2021-12-27

    I am sorry, the error you indicated does occur.
    2021-12-27 23:01:31.524 7f5c335f0700 Error: Skipped filter 'google.com' after job number 2: (pcre:) Match limit reached (-8)

     
  • withoutname

    withoutname - 2023-07-25

    I compiled Privoxy with the "Add pcre2 support" patch. When filtering on www.google.com, the log shows the message: "Error: Skipped filter 'google.com' after job number 9: match limit exceeded (-47)".

     
    • Lee

      Lee - 2023-07-26

      On 7/25/23, withoutname s18573f@users.sourceforge.net wrote:

      I compiled Privoxy with the "Add pcre2
      support
      "
      patch. When filtering on www.google.com, the log shows the message: "Error:
      Skipped filter 'google.com' after job number 9: match limit exceeded
      (-47)".

      There is no 'google.com' filter shipped with Privoxy - you'll have to
      show your filter before we can help debug it.

      It would also be nice to know what operating system you're using..

      Lee

       
  • withoutname

    withoutname - 2023-07-26

    OK. I removed one filter, the error is almost the same: "Skipped filter 'google.com' after job number 8: match limit exceeded (-47)"
    My filters so far :

    FILTER: google.com google.com
    s@<(div)\s+class=['"][\w\s]+cu-container['"][^>]*@<!--filtred 2--><$1 style="display:none !important"@sig
    s@<(div)\s+class=['"].{6}['"]\s+id=['"]tads['"]\s+aria-label=['"](?:\xd0|\xf2|\xd0\xa0)(?:\xe5|\xc5|\xd0\xb5)(?:\xea|\xcb|\xd0\xba)(?:\xeb|\xcc|\xd0\xbb)(?:\xe0|\xc1|\xd0\xb0)(?:\xec|\xcd|\xd0\xbc)(?:\xe0|\xc1|\xd0\xb0)['"][^>]*@<!--filtred 1--><$1 style="display:none !important"@sig
    s@<(div)\s+class=['"][\d\w]+\s[\d\w]+['"]>\s*<div>\s*<span\s+class=.+</span>\s*</div>@<!--filtred 1.1--><$1 style="display:none !important">@sigU
    s@<(div)\s+class=['"].{6,9}\s+id=['"]tadsb['"][^>]*@<!--filtred 2--><$1 style="display:none !important"@sig
    s@<(div)\s+class=['"](?:cu-container|top-pla-group-inner)['"][^>]*@<!--filtred 4--><$1 style="display:none !important"@sigU
    s@<(div)\s+id=['"]bottomads['"][^>]*@<!--filtred 5--><$1 style="display:none !important"@sig
    s@<(div)\s+class=['"].{6}['"]>(\s*<div\s+class=['"].{6}\s+.{6}\s+.{5}['"]>\s*<div>.+)@<!--filtred mobile 1--><$1 style="display:none !important">$2@igU
    s@(<div\s+class=['"][\w\d]{5,7}['"]><a\s+href=['"].*['"]).*,event\)['"]>@<!--filtred tab 1-->$1 target="_blank" rel="noopener noreferrer">@igU
    s@(<div\s+class=['"][\w\d]{5,7}['"]><a\s+href=['"].*['"])\s+data-jsarwt.*>@<!--filtred tab 1.1-->$1 target="_blank" rel="noopener noreferrer">@igU
    s@(<div\s+class=['"][\w\d]{5,7}['"]><a\s+href=['"].*['"])\s+data-ved.*>@<!--filtred tab chrome-->$1 target="_blank" rel="noopener noreferrer">@igU
    s@(<a\s+(class=)?)("([\w\d]{3,6}|[\w\d]{5,6}\s[\w\d]{5,6})"\s+)?(href=['"]/search\?.+(?=(pnnext)))>@<!--filtred tab 2-->$1$2$3$4$5 target="_blank" rel="noopener noreferrer">@igU
    s@(<a\s+class=['"]fl['"]\s+href=['"]https?[^>]*)@<!--filtred tab 3-->$1 target="_blank" rel="noopener noreferrer"@ig
    s@(<a\s+class=['"]JWaTvb\sFx4vi['"]\s+href="https?[^>]*)@<!--filtred tab 4-->$1 target="_blank"@igU
    s@(<div\s+class=['"][\w\d]{5,7}['"]\s+style=['"]white-space:nowrap['"]><a\s+href=['"].*['"]).+,event\)['"]>@<!--filtred tab pdf 1-->$1 target="_blank" rel="noopener noreferrer">@igU
    s@(<div\s+class=['"][\w\d]{5,7}['"]\s+style=['"]white-space:nowrap['"]><a\s+href=['"].*['"])\s+data-jsarwt.*>@<!--filtred tab pdf 2-->$1 target="_blank" rel="noopener noreferrer">@igU
    s@(<a\s+class=['"][\w\d\s]{6,8}['"]\s+href=['"].*['"])\s+data-jsarwt.*>@<!--filtred tab translate-->$1 target="_blank" rel="noopener noreferrer">@igU
    s@(<a\s+class=['"][\w\d\s]{5,8}['"]\s+href=['"]/search\?q=.*['"](?:\s+role=['"]button['"])?[^>]*)@<!--filtred tab kartinki 1-->$1 target="_blank" rel="noopener noreferrer"@igU
    s@(<title-with-lhs-icon><a\s+class=['"][\w\d]{5,7}\s[\w\d]{5,7}['"]\s+href=['"].*['"])@<!--filtred tab kartinki 2-->$1 target="_blank" rel="noopener noreferrer"@igU
    s@(data\-visible=['"]true['"]\s+href=(['"])https://www.youtube.com/watch\?v=.*)\&amp.*>@$1$2 target="_blank" rel="noopener noreferrer" /*!- filtred tab youtube 1 -*/ >@igU
    s@(<a\s+class=['"][\w\d]{5,7}['"]\s+href=['"]https://www.youtube.com/watch\?v=.*['"]\s+aria-label=['"].*['"])(.*)>@$1$2 target="_blank" rel="noopener noreferrer" /*!- filtred tab youtube 2 -*/ >@igU
    s@(<div\s+class=['"][\w\d]{5,7}['"]\s+style=['"]height:\d+px['"]><a\s+href=['"].*['"])\s+data-jsarwt.*>@<!--filtred tab youtube 2-->$1 target="_blank" rel="noopener noreferrer">@igU
    s@(<div\s+class=['"][\w\d]{5,7}['"]><h3\s+class=['"][\w\d]{5,7}['"]><a\s+href=['"].*['"])\s+data-jsarwt.*>@<!--filtred tab youtube 2-->$1 target="_blank" rel="noopener noreferrer">@igU
    s@(<div\s+jscontroller=['"][\w\d]{5,7}['"]\s+jsaction=['"][\w\d]{5,7}:[\w\d]{5,7}['"]><a\s+class=['"][\w\d]{5,7}['"]\s+href=['"].*['"])\s+aria-label.*>@<!--filtred tab youtube 3-->$1 target="_blank" rel="noopener noreferrer">@igU
    

    os archlinux.

     

    Last edit: withoutname 2023-07-26
  • withoutname

    withoutname - 2023-08-05

    If I use the latest snapshot, then the problem is not observed. In a recent post I use the "Add pcre2 support" patch from 2023.07.11. Looks like I'm wrong again...

     
  • Fabian Keil

    Fabian Keil - 2023-08-08
     
  • Fabian Keil

    Fabian Keil - 2023-08-08

    I'm not sure I understand you correctly.

    Are you saying you saw the issue with pcre1 but not with pcre2?

     
  • withoutname

    withoutname - 2023-08-12

    The problem was when I used only this one patch for Privoxy 3.0.34. But according to the latest snapshots, there is no problem.

     
  • Fabian Keil

    Fabian Keil - 2023-08-15
    • status: pending --> closed
     
  • Fabian Keil

    Fabian Keil - 2023-08-15

    Great. Thanks for the confirmation.

     

Log in to post a comment.

MongoDB Logo MongoDB