You can subscribe to this list here.
2005 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
(1) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
2006 |
Jan
(3) |
Feb
(9) |
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
From: Pino C. <pc...@ne...> - 2006-02-23 09:26:58
|
This is a forwarded message From: Pino Calzo <pc...@ne...> To: Manuel Zini <ml...@ya...> Date: Thursday, February 16, 2006, 4:31:52 PM Subject: Fwd: Re: classification ===8<==============Original message text=============== Manuel where my explanations helpful? think isobel can help us? tried to install it btw - but the documentation seems to be close to inexistent and without i'm quite lost :) another question: what dictionaries is isobel using? the same as ibm? (*.dic ?) Best Pino Calzo NewsIsFree pc...@ne... ===8<===========End of original message text=========== -- Best regards, Pino mailto:pc...@ne... |
From: Alexander <hl...@bk...> - 2006-02-18 04:37:04
|
Please tell me more about events - intention to content sources, routing table also questnions about scheduling... how does it work? Alexander |
From: Alexander <hl...@bk...> - 2006-02-18 03:32:29
|
Hello, I managed to compile Isobel and i'm figuring out the architecture. can you send me all the info you got: class diagrams, notes and simillar. If you got it would be greate. Alexander |
From: Manuel Z. <ml...@ya...> - 2006-02-16 19:21:41
|
Thanks Alexander, As I said in a previous mail Isobel 0.8 is an alpha version and by now is intended just as a demo of isobel main ideas, as we state in the readme file. We are working hard toward release 0.9, but a release date is not yet planned, I'm sorry but by now there's no much time left for the 'community' side of our work. From what I see in your e-mail the errors you have got are in some obsolete test code that we didn't delete yet. You shouldn't need that code to run Isobel... Anyway if you send us some of the errors you have got we could be more helpful. Having said that if you want to collaborate with us we will be glad and accept any suggestion that we feel could help the project. For further agreement on our collaboration please write to my personal mail: ml...@ya..., since this list is more appropriate for merely technical matters. Best Regards, Manuel --- Alexander <hl...@bk...> ha scritto: > > > Data: Thu, 16 Feb 2006 20:58:05 +0300 > Da: Alexander <hl...@bk...> > A: iso...@li... > Oggetto: Isobel compile problem > > People, tell me how you compile that code? I checked > out from cvs and > downloaded version 0.8.1 and i got the same problem: > When i try to compile isobel i got an error, in > class > pythonProcessorTest, at line 51 > dataBus.putCustomData(new > String("PythonString"),new String("Python � > bello"));. > The compiler can't find symbol method putCustomData. > And this is just > one of many other errors i got, can't provide you > here with all of them. > It's really strange to see class source and binary > located in the same > place. I can't decide if i downloaded the source and > the binary at once. > I thought i I was going to d/l the sources. Guys > this ain't good. The > project is totally messed up and the code base is > broken. Do you have > anyone responsible for setting up the project? Maybe > you need my help? > > Alexander. > > ___________________________________ Yahoo! Mail: gratis 1GB per i messaggi e allegati da 10MB http://mail.yahoo.it |
From: Alexander <hl...@bk...> - 2006-02-16 18:12:13
|
From: Alexander <hl...@bk...> - 2006-02-16 17:57:58
|
People, tell me how you compile that code? I checked out from cvs and downloaded version 0.8.1 and i got the same problem: When i try to compile isobel i got an error, in class pythonProcessorTest, at line 51 dataBus.putCustomData(new String("PythonString"),new String("Python � bello"));. The compiler can't find symbol method putCustomData. And this is just one of many other errors i got, can't provide you here with all of them. It's really strange to see class source and binary located in the same place. I can't decide if i downloaded the source and the binary at once. I thought i I was going to d/l the sources. Guys this ain't good. The project is totally messed up and the code base is broken. Do you have anyone responsible for setting up the project? Maybe you need my help? Alexander. |
From: Manuel Z. <ml...@ya...> - 2006-02-16 10:34:12
|
There's an answer for you in isobel-users since yesterday, I've seen it :), please check again on archives Anyway, there's no defined release date for 1.0 by now, but I think the code we have now is suitable for your purposes. download the code release isobel 0.8.1 from sourceforge, we'll be glad to help on your questions. If you want to collaborate we will be happy with it, we always need help. If you are really motivated, first of all get familiar with the architecture and the code, and you will have all our support on doing this. Bye, Manuel > I tried to mail to > iso...@li..., but there's no > response from there, so i disturb you here again... > :) > > I'd like to know when a new release is supposed to > be and if i can get > somehow a working code to build from. Maybe it's > enough for me to have a > binary version, i just want to add some > functionality as i suppose there > may be some lack in it for my project. I even could > contribute to your > project, for example, implement collections support. > At the moment i want to setup a website which > provide users news > aggregations. I want to setup Isobel to collect news > info and make a > search on it. I already want to start using the > current version of > Isobel, but there's no good description of API... :( > can i use the > current version for my site or should i wait for the > new release. If so, > how long will it take? > > Thank you! > > Alexander. > > ___________________________________ Yahoo! Mail: gratis 1GB per i messaggi e allegati da 10MB http://mail.yahoo.it |
From: Andrea A. <ag...@li...> - 2006-02-15 02:18:18
|
Alexander ha scritto: > Hello guys! > > I'd like to know what stage isoble is on. I already want to start using > it as a search platform for my news aggregation site. Is it possible? I > saw a demo and i feel i can run it for indexing for awhile. But i also > need to pass a query to the platform from my site and i'd like to have > an API for this with some comments to figureout things. > > You told me that you refactored the architecture for the 1.0 release. > Can you tell what it is now? Hi Alexander, the features we have added since we released the demo on SF include an improved querying machinery ( through the SearchManager_1 class), we have just released the source code we are working on now. The querying mechanism now is specular to the analysis with applications and processors, I don't know if you are familiar with the ISobel structure, I think the documentation we have produced is a bit poor... feel free to ask any question regarding Isobel Isobel uses Lucene to build the indexes so you can use any tool compatible with this product to read and query the indexes The code uploaded is an Eclipse project, you should not have any problem to build Isobel with it Now you can pass a String or an HashMap to the querying mechanism, a Pipeline will be created with the Processors specified in isobel.xml in the queryApplications section As you will see, the configuration structure has changed: we have introduced the Collection as a set of sites, soon the application definitions will be removed from isobel.xml and will be placed in a separated directory (like sites) To start this mechanism you need to call one of the search() methods of the SearchManager_1 class, it will return the processed results. This is a part under heavy development: you will find bugs and something will not work, be patient and submit us the bugs you will find Andrea Agili |
From: Alexander <hl...@bk...> - 2006-02-14 18:04:05
|
Hello guys! I'd like to know what stage isoble is on. I already want to start using it as a search platform for my news aggregation site. Is it possible? I saw a demo and i feel i can run it for indexing for awhile. But i also need to pass a query to the platform from my site and i'd like to have an API for this with some comments to figureout things. You told me that you refactored the architecture for the 1.0 release. Can you tell what it is now? Thank! Alexander. |
From: Manuel Z. <ml...@ya...> - 2006-01-29 21:41:39
|
let's try to make an example: could you try to define when two news e.g. one in italian and the other in english are related ? (let's skip the obvious case of proper names) --- Pino Calzo <pi...@ca...> ha scritto: > Ciao Manuel, > > looking at our users behaviour i would say > inter-language (btw - we > know the language per news item). We could even > narrow it down by > category - but that could also hinder possible > interesting results. > > I'm imaging that if something like this is done > there will be clouds > of inter-language related headlines. Some articles > could be > part of multiple clouds. > > So if I see a news headline from a feed i'm > subscribed to, i could on > request also see which articles are related to it > (from other > sources). Another interesting thing is to evaluate > the size of the > clouds and the difference of its size by time (that > way it would be quite > nice to see which clouds are "hot topics" (grow fast > in the last x > hours") and which ones are not anymore.)) > > appended is an example on how the info is in our > database. > > <document> > <id> 125312416 </id> > <title> <b>Berlusconi</b> swears off sex > until election </title> > <info> > http://www.news.com.au/story/0,10117,17979986-401,00.html?from=rss > </info> > <rate> 1138550792 </rate> > <spectags> > <adddate> 2006-01-29 17:10:00 > </adddate> > <source> 6298 </source> > <lang> en </lang> > <ftwords> hpecatchall nkcat1 > nkcat10 nkcat46 nkcat110 nklangen nksource6298 > </ftwords> > <popularity> 0.2929137 > </popularity> > <shortdate> 060129 </shortdate> > <src_name> NEWS.com.au: The > World </src_name> > <src_desc> The top stories from > around the world through Australian eyes. Features > reports from correspondents in Bangkok, Beijing, > Jakarta, London, Los Angeles, New York, Tokyo, > Washington and Wellington and the Australian > Associated Press, Associated Press and... > </src_desc> > </spectags> > <text> ITALIAN Prime Minister Silvio > Berlusconi is famous for his ambitious promises, but > he is unlikely to be called to task if he breaks his > latest pledge: not to have sex before the April 9 > general election. <title> Berlusconi swears off sex > until election </title> > </text> > </document> > > First ID tag is database internal ID. Source is the > ID of Source Name. > Lang is the 2-letter code of the language. FTWords > are keywords we > currently use (a source can belong to multiple > sources). > > > cheers > > Pino Calzo > pi...@ca... > > > > Sunday, January 29, 2006, 2:45:39 PM, you wrote: > > > Actually we are using Isobel for news > classification > > and I think it's feasible to use it to spot news > that > > could be considered 'related'. > > > My first question is: what language are you > interested > > in ? > > > Are you interested in an intra-language relation > or > > inter-language relations ? > > > I'm asking because there is no universal language > tool > > to perform classification or any other kind of > > analysis based on text. > > > Regards > > Manuel > > > > --- Pino Calzo <pi...@ca...> ha scritto: > > >> Ciao Manuel, > >> > >> well - the characteristics of our site is best > >> described as a "river > >> of news". there's a lot of news headlines coming > in > >> across many > >> languages in a short timeframe > >> > >> - 20'118 sources > >> - 30 languages (including arabic etc) > >> - 4'079'951 documents (document is mostly: > >> headline, link and link description) > >> - by hour we add almost 10'000 headlines (and > >> delete 1000) > >> > >> we keep 30 days archive - therefore the > deletions. > >> Most of our users are > >> interested in what's currently happening - and > not > >> really in the > >> archives. > >> > >> So - having said this i guess you understand why > >> I'm talking about a > >> "river of news". The problem we have is that > there > >> are many headlines > >> which might be related, but we have currently no > >> way to see these > >> relations in a machine way. > >> As we classify every source manually in a > category > >> ("channel") we > >> know sometimes that a headline belongs to > "soccer" > >> or "celebrities" > >> because the source itself in general publishes > >> headlines from this > >> category. > >> > >> I would actually be interested to go beyond that > >> and have some kind > >> of "headlines clustering". (e.g. show "related > >> headlines" behind a > >> headline). This could be quite interesting and a > >> world-first (at > >> least i don't know good implementations of > >> something like this in an > >> international way. > >> > >> Is Isobel the right tool for something like > this? > >> Would you see other > >> possibilities/implementations? > >> > >> Grazie e a presto > >> Pino > >> > >> > >> Pino Calzo > >> pi...@ca... > >> > >> > >> > >> Friday, January 27, 2006, 11:17:37 AM, you wrote: > >> > >> > Ciao, > >> > > >> >> I'm wondering if you'd be interested in an > >> >> integration of the > >> >> NewsIsFree news headline database with isobel. > >> > >> > >> > It sounds really interesting ! We would really > >> > appreciate any opportunity to collaborate. > >> > >> > >> > I'm > >> >> not sure if this > >> >> would be possible at all - but the idea is > >> >> interesting. NewsIsFree > >> >> gets thousands of new headlines per hour which > >> >> should be analyzed and > >> >> related to each other. We spider constantly > >> 20'000 > >> >> news sources, > >> >> covering over 20 languages. > >> >> > >> >> As far as I understood this analysis should be > in > === message truncated === ___________________________________ Yahoo! Mail: gratis 1GB per i messaggi e allegati da 10MB http://mail.yahoo.it |
From: Manuel Z. <ml...@ya...> - 2006-01-29 13:45:46
|
Actually we are using Isobel for news classification and I think it's feasible to use it to spot news that could be considered 'related'. My first question is: what language are you interested in ? Are you interested in an intra-language relation or inter-language relations ? I'm asking because there is no universal language tool to perform classification or any other kind of analysis based on text. Regards Manuel --- Pino Calzo <pi...@ca...> ha scritto: > Ciao Manuel, > > well - the characteristics of our site is best > described as a "river > of news". there's a lot of news headlines coming in > across many > languages in a short timeframe > > - 20'118 sources > - 30 languages (including arabic etc) > - 4'079'951 documents (document is mostly: > headline, link and link description) > - by hour we add almost 10'000 headlines (and > delete 1000) > > we keep 30 days archive - therefore the deletions. > Most of our users are > interested in what's currently happening - and not > really in the > archives. > > So - having said this i guess you understand why > I'm talking about a > "river of news". The problem we have is that there > are many headlines > which might be related, but we have currently no > way to see these > relations in a machine way. > As we classify every source manually in a category > ("channel") we > know sometimes that a headline belongs to "soccer" > or "celebrities" > because the source itself in general publishes > headlines from this > category. > > I would actually be interested to go beyond that > and have some kind > of "headlines clustering". (e.g. show "related > headlines" behind a > headline). This could be quite interesting and a > world-first (at > least i don't know good implementations of > something like this in an > international way. > > Is Isobel the right tool for something like this? > Would you see other > possibilities/implementations? > > Grazie e a presto > Pino > > > Pino Calzo > pi...@ca... > > > > Friday, January 27, 2006, 11:17:37 AM, you wrote: > > > Ciao, > > > >> I'm wondering if you'd be interested in an > >> integration of the > >> NewsIsFree news headline database with isobel. > > > > It sounds really interesting ! We would really > > appreciate any opportunity to collaborate. > > > > I'm > >> not sure if this > >> would be possible at all - but the idea is > >> interesting. NewsIsFree > >> gets thousands of new headlines per hour which > >> should be analyzed and > >> related to each other. We spider constantly > 20'000 > >> news sources, > >> covering over 20 languages. > >> > >> As far as I understood this analysis should be in > >> theory possible with isobel. > > > Isobel is actually designed to do gathering and > > analysis of huge collections of data. > > >> Technically we work with APIs on PHP. They should > be > >> talking with a > >> isobel API (like sending input "analyze this" and > >> somehow getting the > >> resulting analysis results back). > > > No problem at all, integration can be achieved > through > > soap or xml/rpc > > >> > >> How fast is isobel? would it be posssible to do > >> something like the > >> above in near-realtime? > > > > That's what isobel is meant for, by the way, what > is > > not clear to me is the kind of analysis you need. > > Of course time depends on the analysis task you > > perform on text. > > > Can you explain in more detail what are your > > requirements/expectations ? > > > Thanks for your interest, > > Hope to hear from you soon, > > > Manuel Zini > > The Isobel Team > > > > > > ___________________________________ > > Yahoo! Messenger with Voice: chiama da PC a > telefono a tariffe esclusive > > http://it.messenger.yahoo.com > > > > > ___________________________________ Yahoo! Mail: gratis 1GB per i messaggi e allegati da 10MB http://mail.yahoo.it |
From: Manuel Z. <ml...@ya...> - 2006-01-27 10:17:43
|
Ciao, > I'm wondering if you'd be interested in an > integration of the > NewsIsFree news headline database with isobel. It sounds really interesting ! We would really appreciate any opportunity to collaborate. I'm > not sure if this > would be possible at all - but the idea is > interesting. NewsIsFree > gets thousands of new headlines per hour which > should be analyzed and > related to each other. We spider constantly 20'000 > news sources, > covering over 20 languages. > > As far as I understood this analysis should be in > theory possible with isobel. Isobel is actually designed to do gathering and analysis of huge collections of data. > Technically we work with APIs on PHP. They should be > talking with a > isobel API (like sending input "analyze this" and > somehow getting the > resulting analysis results back). No problem at all, integration can be achieved through soap or xml/rpc > > How fast is isobel? would it be posssible to do > something like the > above in near-realtime? That's what isobel is meant for, by the way, what is not clear to me is the kind of analysis you need. Of course time depends on the analysis task you perform on text. Can you explain in more detail what are your requirements/expectations ? Thanks for your interest, Hope to hear from you soon, Manuel Zini The Isobel Team ___________________________________ Yahoo! Messenger with Voice: chiama da PC a telefono a tariffe esclusive http://it.messenger.yahoo.com |
From: Manuel Z. <ml...@ya...> - 2005-12-15 10:24:17
|
Isobel alpha 0.8 is the first public release of Isobel. Please check on http://www.isobel-project.org Isobel 0.8 is fully functional but still several bugs have to be fixed. Every contribution is welcomed, if you want to contribute please write to ml...@ya... To ease the understanding of Isobel, its configuration and personalization two demo applications are provided: * Batch pdf and doc conversion and indexing: o Isobel gathers every .doc and .pdf files found in a configurable directory and indexes them. o You can start/stop Isobel from the admin tab of the graphical interface o You can query Isobel from the query tab of the graphical interface * Isobel site gathering and indexing o Isobel gathers its own site and indexes every pdf and doc file. o You can change the site url through the configuration file o You can start/stop Isobel from the admin tab of the graphical interface o You can query Isobel from the query tab of the graphical interface ___________________________________ Yahoo! Mail: gratis 1GB per i messaggi e allegati da 10MB http://mail.yahoo.it |