From: Juan M. A. H. <Ju...@ar...> - 2006-08-18 13:01:57
|
Hi, the first, thanks for your time ;-) =20 I have two doubts about index a lot of sites. =20 The first doubt: =20 Imagin two sites, site1.com and site2.com =20 The monday i need index only the site1.com The rest of days site2.com =20 All days of the week I need to make search in two sites. =20 I try change the htdig.conf to site1.com or site2.com, never both, but = index two sites all days =20 The second doubt: =20 How can i remove site1.com from db? Which is the command to make this? =20 Thanks in advance, and sorry for my english, i=C2=B4m = spanish............ =20 Juanmi. =20 =20 =20 =20 =20 ______________________________________ " Le informamos, en virtud del art.5 de la LO 15/1999 de Proteccion de = datos de caracter personal, que sus datos personales forman parte de un = fichero de datos informatizado de esta sociedad y empresas de su grupo. = Asi mismo, le informamos, de la posibilidad de ejercitar sus derechos de = acceso, rectificacion, cancelacion y oposicion de los mismos = dirigiendose a la sociedad, C/ Altamira num.1 de Azuqueca de Henares = (19200) Guadalajara" ______________________________________ |
From: Jim <li...@yg...> - 2006-08-21 19:49:12
|
On Fri, 18 Aug 2006, Juan Miguel Alcarria Herrera wrote: >The first doubt: > >Imagin two sites, site1.com and site2.com > >The monday i need index only the site1.com >The rest of days site2.com > >All days of the week I need to make search in two sites. It might make the most sense to create two separate sets of databases. Then you can perform your indexing of one site independent of the other. See http://www.htdig.org/dev/htdig-3.2/FAQ.html#q4.4. If you want to search both databases simultaneously from a single form, you can use the collection_names attribute (http://www.htdig.org/dev/htdig-3.2/attrs.html#collection_names). >The second doubt: > >How can i remove site1.com from db? >Which is the command to make this? If you use separate databases you won't need to remove one site from the database. I don't think there is any simple way to remove all URLs for a given site from a database; you would probably need to find some way to enumerate all URLs from the site you want to remove and then feed those to htpurge. Jim |
From: Juan M. A. H. <Ju...@ar...> - 2006-08-22 08:56:32
|
Hi Jim, thanks in advance for your reply ;-) =20 I,m executing htdig 3.1.6-11 od Debian Sarge, its possible make this on = this distribution ( 3.1.6-11). I read on = http://www.htdig.org/dev/htdig-3.2/attrs.html#collection_names = <http://www.htdig.org/dev/htdig-3.2/attrs.html#collection_names> only = with version 3.2.0b2 or later=20 =20 If it=C2=B4s possible, you can put the example of htdig.conf with two = sites. =20 Thanks other time. =20 Jaunmi. -----Mensaje original-----=20 De: htd...@li... en nombre de Jim=20 Enviado el: lun 21/08/2006 21:48=20 Para: Juan Miguel Alcarria Herrera=20 CC: htd...@li...=20 Asunto: Re: [htdig] Indexing a lot of sites =09 =09 On Fri, 18 Aug 2006, Juan Miguel Alcarria Herrera wrote: =09 >The first doubt: > >Imagin two sites, site1.com and site2.com > >The monday i need index only the site1.com >The rest of days site2.com > >All days of the week I need to make search in two sites. =09 It might make the most sense to create two separate sets of databases. Then you can perform your indexing of one site independent of the = other. See http://www.htdig.org/dev/htdig-3.2/FAQ.html#q4.4. If you want to search both databases simultaneously from a single form, you can use = the collection_names attribute (http://www.htdig.org/dev/htdig-3.2/attrs.html#collection_names). =09 >The second doubt: > >How can i remove site1.com from db? >Which is the command to make this? =09 If you use separate databases you won't need to remove one site from = the database. I don't think there is any simple way to remove all URLs for = a given site from a database; you would probably need to find some way to enumerate all URLs from the site you want to remove and then feed those to htpurge. =09 Jim =09 = -------------------------------------------------------------------------= Using Tomcat but need to do more? Need to support web services, = security? Get stuff done quickly with pre-integrated technology to make your job = easier Download IBM WebSphere Application Server v.1.0.1 based on Apache = Geronimo = http://sel.as-us.falkag.net/sel?cmd=3Dlnk&kid=3D120709&bid=3D263057&dat=3D= 121642 _______________________________________________ ht://Dig general mailing list: <htd...@li...> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general =09 ______________________________________ " Le informamos, en virtud del art.5 de la LO 15/1999 de Proteccion de = datos de caracter personal, que sus datos personales forman parte de un = fichero de datos informatizado de esta sociedad y empresas de su grupo. = Asi mismo, le informamos, de la posibilidad de ejercitar sus derechos de = acceso, rectificacion, cancelacion y oposicion de los mismos = dirigiendose a la sociedad, C/ Altamira num.1 de Azuqueca de Henares = (19200) Guadalajara" ______________________________________ |
From: Jim <li...@yg...> - 2006-08-27 06:51:12
|
Hi - The 3.1.6 version of the code does not support collections. There=20 is a patch, but you would need to apply it to the source and build your=20 own executables. ftp://ftp.ccsf.org/htdig-patches/3.1.6/collections.0 If you want to stay with your 3.1.6 package, your best bet is probably=20 to read up on htmerge. You can build your separate databases as=20 appropriate and then just merge them into a single production database=20 whenever you have a new update. Jim On Tue, 22 Aug 2006, Juan Miguel Alcarria Herrera wrote: > Hi Jim, thanks in advance for your reply ;-) =20 I,m executing htdig 3.1.6-11 od Debian Sarge, its possible make this on thi= s distribution ( 3.1.6-11). I read on http://www.htdig.org/dev/htdig-3.2/at= trs.html#collection_names <http://www.htdig.org/dev/htdig-3.2/attrs.html#co= llection_names> only with version 3.2.0b2 or later=20 =20 If it=C2=B4s possible, you can put the example of htdig.conf with two sites= =2E =20 Thanks other time. =20 Jaunmi. =09-----Mensaje original-----=20 =09De: htd...@li... en nombre de Jim=20 =09Enviado el: lun 21/08/2006 21:48=20 =09Para: Juan Miguel Alcarria Herrera=20 =09CC: htd...@li...=20 =09Asunto: Re: [htdig] Indexing a lot of sites =09 =09 =09On Fri, 18 Aug 2006, Juan Miguel Alcarria Herrera wrote: =09 =09>The first doubt: =09> =09>Imagin two sites, site1.com and site2.com =09> =09>The monday i need index only the site1.com =09>The rest of days site2.com =09> =09>All days of the week I need to make search in two sites. =09 =09It might make the most sense to create two separate sets of databases. =09Then you can perform your indexing of one site independent of the other. =09See http://www.htdig.org/dev/htdig-3.2/FAQ.html#q4.4. If you want to =09search both databases simultaneously from a single form, you can use the =09collection_names attribute =09(http://www.htdig.org/dev/htdig-3.2/attrs.html#collection_names). =09 =09>The second doubt: =09> =09>How can i remove site1.com from db? =09>Which is the command to make this? =09 =09If you use separate databases you won't need to remove one site from the =09database. I don't think there is any simple way to remove all URLs for a =09given site from a database; you would probably need to find some way to =09enumerate all URLs from the site you want to remove and then feed those =09to htpurge. =09 =09Jim =09 =09------------------------------------------------------------------------= - =09Using Tomcat but need to do more? Need to support web services, security= ? =09Get stuff done quickly with pre-integrated technology to make your job e= asier =09Download IBM WebSphere Application Server v.1.0.1 based on Apache Geroni= mo =09http://sel.as-us.falkag.net/sel?cmd=3Dlnk&kid=3D120709&bid=3D263057&dat= =3D121642 =09_______________________________________________ =09ht://Dig general mailing list: <htd...@li...> =09ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html =09List information (subscribe/unsubscribe, etc.) =09https://lists.sourceforge.net/lists/listinfo/htdig-general =09 ______________________________________ " Le informamos, en virtud del art.5 de la LO 15/1999 de Proteccion de dato= s de caracter personal, que sus datos personales forman parte de un fichero= de datos informatizado de esta sociedad y empresas de su grupo. Asi mismo,= le informamos, de la posibilidad de ejercitar sus derechos de acceso, rect= ificacion, cancelacion y oposicion de los mismos dirigiendose a la sociedad= , C/ Altamira num.1 de Azuqueca de Henares (19200) Guadalajara" ______________________________________ ------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easi= er Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=3Dlnk&kid=3D120709&bid=3D263057&dat=3D1= 21642 _______________________________________________ ht://Dig general mailing list: <htd...@li...> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general |