Learn how easy it is to sync an existing GitHub or Google Code repo to a SourceForge project! See Demo

Close

#5 Apache Tika

Next Release
closed
Converters (5)
5
2011-04-05
2011-01-08
John Dickinson
No

Apache Tika™ is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries. It provides command-line access to content parsers for various document formats developed for the Apache Lucene Project, notably OOXML, ODF, RTF, PDF, ePub, HTML and XML. It is also capable of parsing metadata from several audio, and image formats, as well as flv. It is available from http://tika.apache.org.

Discussion

  • John Dickinson
    John Dickinson
    2011-01-13

    • status: open --> pending
     
  • This Tracker item was closed automatically by the system. It was
    previously set to a Pending status, and the original submitter
    did not respond within 14 days (the time period specified by
    the administrator of this Tracker).

     
    • status: pending --> closed
     
  • John Dickinson
    John Dickinson
    2011-04-05

    Implemented

     
  • John Dickinson
    John Dickinson
    2011-04-05

    • assigned_to: nobody --> einarin
    • status: closed --> open
     
  • John Dickinson
    John Dickinson
    2011-04-05

    • status: open --> closed