Monday, July 20, 2015
File identification tools, part 7: Apache Tika
File identification tools, part 7: Apache Tika. Gary McGath. Mad File Format Science Blog. July 1, 2015.
Apache Tika is a Java-based open source toolkit that can identify a wide range of formats and extract metadata from others. It doesn’t distinguish variants as much as DROID. Plugins can be added for formats that it does not regularly support.