Work at SourceForge, help us to make it a better place! We have an immediate need for a Support Technician in our San Francisco or Denver office.

Close

#254 Error when indexing website : CDB___memp_cmpr_read

open
nobody
None
5
2005-01-12
2005-01-12
EUZENOT
No

Hello,

I've this error when I run htdig.
WordDB: CDB___memp_cmpr_read: unable to uncompress page
at pgno = 3
WordDB: PANIC: Erreur d'entrée/sortie
DB_RUNRECOVERY: Fatal error, run database recovery
WordDB: CDB___memp_cmpr_read: unable to uncompress page
at pgno = 454032
WordDB: PANIC: Erreur d'entrée/sortie
DB_RUNRECOVERY: Fatal error, run database recovery

it's a snapshot : htdig-3.2.0b6-20040606

Here my config :
---------------------------------------------------------
common_dir: /opt/www/share/htdig
database_dir: /opt/www/var/htdig
database_base: ${database_dir}/dr20_sb00
start_url: http://www.dr20.cnrs.fr/demarre.php
limit_urls_to: ${start_url}
http://www.dr20.cnrs.fr/
exclude_urls: /cgi-bin/ .cgi /_vti_bin/ /map
/map1 /Lettre/ /testpdf/ /_vti_bin/
minimum_word_length : 1
maximum_word_length : 999
max_head_length : 100000000
max_doc_size : 20000000
no_excerpt_show_top: true
search_algorithm: exact:1 synonyms:1 accents:0,1
prefix:0,1 endings:1
start_highlight: __COLORHIGHLIGHT__
end_highlight: __END_COLORHIGHLIGHT__
nothing_found_file :
/opt/www/share/htdig/nomatch-dr20.html
search_results_header :
/opt/www/share/htdig/header-dr20.html
search_results_footer :
/opt/www/share/htdig/footer-dr20.html
template_map : cnrs cnrs-long
/opt/www/share/htdig/cnrs-long.html
template_name: cnrs-long
locale: fr_FR
valid_punctuation : °¨¤Ł§
method_names: and 'Tous les mots' or 'Un des mots'
boolean Booléen
sort_names: time Date score Pertinence
endings_dictionary: ${common_dir}/francais.0
endings_affix_file: ${common_dir}/francais.aff
bad_word_list: ${common_dir}/bad_words.fr
synonym_dictionary: ${common_dir}/synonyms.dr20_sb00.db
accents_db: ${common_dir}/accents.dr20_sb00.db
endings_word2root_db: ${common_dir}/word2root.db
endings_root2word_db: ${common_dir}/root2word.db
external_parsers: application/pdf->text/html
/usr/local/scripts/conv_doc.pl \

application/postscript->text/html
/usr/local/scripts/doc2html.pl \ application/msword->text/html
/usr/local/scripts/conv_doc.pl \ application/rtf->text/html
/usr/local/scripts/doc2html.pl \ text/rtf->text/html
/usr/local/scripts/doc2html.pl \

application/vnd.ms-powerpoint->text/html
/usr/local/scripts/doc2html.pl
template_patterns: .pdf ${common_dir}/form-pdf-xml.html \ .doc ${common_dir}/form-pdf-xml.html \ .rtf ${common_dir}/form-pdf-xml.html \ .xls ${common_dir}/form-pdf-xml.html \ .ppt ${common_dir}/form-pdf-xml.html
excerpt_show_top: yes
allow_in_form : restrict_input details restrict_file
choix_deleg etat excerpt_show_top restrict
allow_numbers: true
use_meta_description : false
remove_default_doc :
maximum_pages: 30
keywords_factor : 0
doc_excerpt: /opt/www/var/htdig/dr20_sb00.excerpts.db
check_unique_date: true

---------------------------------------------------------

Someone know how to fix this issue?

Thanks a lot.
EUZENOT Franck
feuzenot@alcyonis.fr

Discussion