Apache Tika 0.5 Released, Solr Cell updated on trunk
The Apache Tika team just announced Tika 0.5 is released (c&p of announcement below). I upgraded Apache Solr’s Tika integration (aka Solr Cell) to use the new libraries this morning. To use, check out SVN trunk from Apache Solr.
The Apache Lucene project is pleased to announce the release of Apache Tika 0.5. The release contents have been pushed out to the main Apache release site and the m2 ibiblio sync, so the releases should be available as soon as the mirrors get the syncs.
Apache Tika, a subproject of Apache Lucene, is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
Apache Tika 0.5 contains a number of improvements and bug fixes. Details can be found in the changes file:
http://www.apache.org/dist/lucene/tika/CHANGES-0.5.txt
Apache Tika is available in source form from the following download page:
http://www.apache.org/dyn/closer.cgi/lucene/tika/apache-tika-0.5-src.zipApache Tika is also available in binary form or for use using Maven 2 from the Central Maven Repositories:
http://repo1.maven.org/maven2/org/apache/tika/0.5/
http://mirrors.ibiblio.org/pub/mirrors/maven2/org/apache/tika/0.5/In the initial 48 hours, the release may not be available on all mirrors.
When downloading from a mirror site, please remember to verify the downloads using signatures found on the Apache site:
http://www.apache.org/dist/lucene/tika/KEYS-0.5.txtFor more information on Apache Tika, visit the project home page:
http://lucene.apache.org/tika
Best of the Month. Straight to Your Inbox!
Dive into the best content with our monthly Roundup Newsletter!
Each month, we handpick the top stories, insights, and updates to keep you in the know.