Learn how easy it is to sync an existing GitHub or Google Code repo to a SourceForge project! See Demo

Close

Information Retrieval in Database

Developers
2010-02-09
2012-08-29
  • Hi,

    The search feature is a very powerful functionality of RSSOwl (the main reason
    why I was interested in it).

    I would like to develop a small information retrieval program over the data
    stored by the application for personal purposes.

    More precisely, I would like to have access to the content of the feeds and/or
    the web pages they refer in order to count frequent words and other stuff. Is
    there an easy way to get that information? I mean, to query the database and
    produce simple text or XML files to process them in a second step? I have no
    idea on how to do that. Maybe you can give me some hints.

     
  • You should be able to get this information from the Lucene index, its the
    search engine used. otherwise, the DB is db4o, you would need to write some
    Java code to access its data.

     
  • Is there some documentation on the RSSOwl database model ("tables"*, fields,
    a.s.o.)? Which project file could help me to understand how it is interfaced
    with db4o (e.g. DBManager.java)?

    • I am not familiar with object databases.
     
  • No there is no real documentation but basically anything that extends IEntity
    is part of persisted entities in the db4o database. DBManager is a good start
    for how we configure db4o for use, otherwise e verything of the package
    org.rssowl.core.persist/internal and below that.