Articles: SearchHub

Blog

Apache Solr 4 Enterprise Search Server: RAW

As a developer building an app then you know how important a good search experience is. Apache Solr, built on...
Read Now
Blog

Apache Zookeeper Tutorial for Distributed Apps

By Sean Mackrory, Cloudera Inc. It’s widely accepted that you should never design or implement your own cryptographic algorithms but...
Read Now
Blog

A Short Introduction to Indexing / Search using Lucene

At times I find I need an indexing tool to do something akin to an embedded database.  This is an...
Read Now
Blog

Starfish: A Hadoop Performance Tuning Tool

This post discusses Starfish a Self-tuning System For Big Data Analytics. It's an open source project hosted at GitHub: https://github.com/jwlent55/Starfish Starfish enables Hadoop...
Read Now
Blog

Creating Custom Solr Request Params

Recently, someone on the #solr IRC channel asked me a question about using multiple filter queries in a request, while...
Read Now
Blog

High throughput indexing in Solr

Sometimes you need to index a bunch of documents really, really fast. Even with Solr 4.0 and soft commits, if...
Read Now
Blog

DuckDuckGo – 1 Million Deep Searches A Day And Growing

Who knew! The search engine DuckDuckGo runs on Solr and easily handles 1 million deep searches a day. Based on...
Read Now
Blog

Taming Text is released!

A new exciting book just published from Manning, with the catchy title Taming Text, by Grant S. Ingersoll (fellow Apache...
Read Now
Blog

Article: Apache Solr – Slow queries and frequent terms

Interesting article that presents findings that demonstrate the effect of term frequency and number of frequent terms in a single...
Read Now
Blog

Lucene/Solr 4 – Management & Administrative User Interface

Interested in Lucene/Solr 4? Want to see how the User Interface stacks up? For many enterprise IT customers, while they...
Read Now
Blog

Article: HPCC: An Open Source Big Data Competitor to Hadoop

This article discusses an Open Source alternative to Hadoop: HPCC (High-Performance Computing Cluster). HPCC was developed by LexisNexis Risk Solutions and is released...
Read Now
Blog

Solr4 Documentation

The Solr 4 reference guide is now live in the Lucidworks and SearchHub Documentation Center. New topics covered include: Major...
Read Now