Posts Tagged: Nutch

The Apache Lucene Ecosystem: My View of 2010

After a week off to enjoy time with my family, I thought I would kick off the last week of 2010 with a look back at the year as it…
Continue Reading

[REFRESH] Using Nutch with Solr

In my preparation for my upcoming talk on Apache Hadoop and Search, I thought I would try out using Nutch (the genesis for Hadoop) to index some content to Solr. …
Continue Reading

The Apache Lucene Ecosystem: My view of 2009

It’s that time of year, so I thought I would take a look back at the year that was for the Lucene Ecosystem and maybe look ahead just a little…
Continue Reading

Apache Nutch 1.0 released

Apache Nutch, a subproject of Apache Lucene, is open source web-search software. It builds on Lucene Java, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and…
Continue Reading