From: Pekka H. <pek...@ik...> - 2007-03-24 14:20:39
|
I just stumbled into Aperture while surfing in the net and try to figure out its capabilities. Perhaps someone who knows the Aperture well can help a little. I am thinking about building a software to be used in the study of social communication networks by analyzing emails sent and received by a person. The app would first capture messages (perhaps to a simple ascii file) and then exctract data from the messages by analyzing their headers: who is the sender, who is the receiver, whether they are in the same domain, etc. It seems that Aperture could be used in the process, am I right? Does it have tools for extracting header information? With best regards Pekka Henttonen |
From: Leo S. <leo...@df...> - 2007-03-26 08:10:52
|
Hi Pekka, It was Pekka Henttonen who said at the right time 24.03.2007 15:20 the following words: > I just stumbled into Aperture while surfing in the net and try to > figure out its capabilities. Perhaps someone who knows the Aperture > well can help a little. > > I am thinking about building a software to be used in the study of > social communication networks by analyzing emails sent and received by > a person. The app would first capture messages (perhaps to a simple > ascii file) and then exctract data from the messages by analyzing > their headers: who is the sender, who is the receiver, whether they > are in the same domain, etc. > > It seems that Aperture could be used in the process, am I right? Does > it have tools for extracting header information? > Aperture could be used in the process, it does not exactly parse header information, but parses whole e-mails including text and attachments, including the headers. As far as I know, aperture does only crawl mails from IMAP servers, so you have to move the test mails to an imap server. Or we hack aperture to also crawl MBOX files (which is possible using the javamail api we use inside) If you already exactly know what you want and want it quick, it may be better for you to use Java Mail - then you get the headers pretty easy. If you want to be able to do more on the data, store it in a database, perhaps add annotations using algorithms, then aperture may be good for you. Aperture uses RDF to represent information, if you are familiar with RDF (Semantic Web) then aperture is of use. Aperture can be used in combination with an RDF database, so then you have all the e-mail fields in a database and can program your statistical analysis based on the database, you would then use a RDF query language for htis, such as SERQL or SPARQL. hth Leo > With best regards > Pekka Henttonen > > > ------------------------------------------------------------------------- > Take Surveys. Earn Cash. Influence the Future of IT > Join SourceForge.net's Techsay panel and you'll get the chance to share your > opinions on IT & business topics through brief surveys-and earn cash > http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV > _______________________________________________ > Aperture-devel mailing list > Ape...@li... > https://lists.sourceforge.net/lists/listinfo/aperture-devel > -- ____________________________________________________ DI Leo Sauermann http://www.dfki.de/~sauermann Deutsches Forschungszentrum fuer Kuenstliche Intelligenz DFKI GmbH Trippstadter Strasse 122 P.O. Box 2080 Fon: +49 631 20575-116 D-67663 Kaiserslautern Fax: +49 631 20575-102 Germany Mail: leo...@df... Geschaeftsfuehrung: Prof.Dr.Dr.h.c.mult. Wolfgang Wahlster (Vorsitzender) Dr. Walter Olthoff Vorsitzender des Aufsichtsrats: Prof. Dr. h.c. Hans A. Aukes Amtsgericht Kaiserslautern, HRB 2313 ____________________________________________________ |
From: Leo S. <leo...@df...> - 2007-03-27 15:40:50
|
Hi Pekka, I cannot answer now, I am on a three day review meeting, Perhaps you want to try out Aduna Autofocus, which does most of the problems for you out of the box. or other Aperture developers may want to answer, or try to reach me again next week. kindest regards, sorry not to reacto now Leo It was Pekka Henttonen who said at the right time 26.03.2007 21:16 the following words: > Viestistä 26. maaliskuuta 2007 klo 11:10 (Leo Sauermann): > > >> Hi Pekka, >> As far as I know, aperture does only crawl mails from IMAP servers, so >> you have to move the test mails to an imap server. Or we hack aperture >> to also crawl MBOX files (which is possible using the javamail api we >> use inside) >> > > >> If you already exactly know what you want and want it quick, it may be >> better for you to use Java Mail - then you get the headers pretty easy. >> > > >> If you want to be able to do more on the data, store it in a database, >> perhaps add annotations using algorithms, then aperture may be good for you. >> Aperture uses RDF to represent information, if you are familiar with RDF >> (Semantic Web) then aperture is of use. Aperture can be used in >> combination with an RDF database, so then you have all the e-mail fields >> in a database and can program your statistical analysis based on the >> database, you would then use a RDF query language for htis, such as >> SERQL or SPARQL. >> > > Thanks for the answer, Leo. I have been thinking about JavaMail. There are > several questions: > > 1) how to capture email messages without interfering with the email > usage > 2) how to process headers, and > 3) how to analyze the information extract > > Ultimately the goal is to build intelligent systems for information > management, so I find RDF is an interesting alternative. > > Pekka > > > > -- ____________________________________________________ DI Leo Sauermann http://www.dfki.de/~sauermann Deutsches Forschungszentrum fuer Kuenstliche Intelligenz DFKI GmbH Trippstadter Strasse 122 P.O. Box 2080 Fon: +49 631 20575-116 D-67663 Kaiserslautern Fax: +49 631 20575-102 Germany Mail: leo...@df... Geschaeftsfuehrung: Prof.Dr.Dr.h.c.mult. Wolfgang Wahlster (Vorsitzender) Dr. Walter Olthoff Vorsitzender des Aufsichtsrats: Prof. Dr. h.c. Hans A. Aukes Amtsgericht Kaiserslautern, HRB 2313 ____________________________________________________ |
From: Herko t. H. <her...@ad...> - 2007-03-27 16:00:07
|
Hello Pekka, > I cannot answer now, I am on a three day review meeting, > > Perhaps you want to try out Aduna Autofocus, which does most of the > problems for you out of the box. While I'm of course thrilled to see a recommendation of our product (thanks Leo!), I'm not entirely sure AutoFocus is what you are looking for in this case. Of course it is still a nice product that could be useful to you, for inspiration if nothing else. I would recommend you try the AutoFocus 4.0 Beta 2 version, which is based on Aperture and in general is much improved over the current 3.2 version. However, I don't think AutoFocus is the full answer to your question. I believe Aperture would be more appropriate by itself for your application, as you intend to build on top of information Aperture can extract from (among other things) e-mail messages. Please know that Aperture was designed to be very extensible, so if a particular feature (for example: crawling MBOX files) is missing, it would be worthwhile looking into adding it to Aperture. Cheers, Herko ter Horst Senior Software Engineer -- Aduna - Guided Exploration www.aduna-software.com Prinses Julianaplein 14-b 3817 CS Amersfoort The Netherlands +31-33-4659987 (office) |