#104 Create an experimental TikaMimeTypeIdentifier

1.6.0 - features
closed
Antoni Mylka
None
5
2011-11-09
2010-08-11
Antoni Mylka
No

The latest improvements in the Tika mime type detection code look very promising - the ContainerAwareDetector class, together with the normal MimeTypes as a fallback is a potential solution to:

https://sourceforge.net/tracker/?func=detail&aid=1838840&group_id=150969&atid=779503 (distinguish between various XML filetypes)
https://sourceforge.net/tracker/?func=detail&aid=2210328&group_id=150969&atid=779503 (distinguish between various ZIP filetypes (odt,open xml) without knowing the name)
... and the age-old problem of distinguishing between ms office files without knowing the name

Discussion

  • Antoni Mylka
    Antoni Mylka
    2011-11-09

    This is as good as done already. Further issues with the tika mime type identifier could be reported as separate bugs.

     
  • Antoni Mylka
    Antoni Mylka
    2011-11-09

    • status: open --> closed