Apache Lucene, Apache Solr, Open Source, SearchHub

Actual mileage may vary

by Lucidworks
March 19, 2010

A few weeks after the announcement from Microsoft that FAST is no longer to be available on Linux/Unix, interesting stories continue to pop up about use of Lucene and Solr in its place. Most recently, a benchmark from Technology Services Group, an open source content management solutions consultancy and integration shop based out of Chicago. In a blog post earlier this week, they describe a proof of concept for a large pharmaceutical client, benchmarking search on 156,000 documents in an external data source indexed by Lucene. The search application was part of a larger CMS solution centered around EMC documentum.

Lucene/HPI [the TSG Documentum Lucene-based solution] and the external repository was found to be considerably quicker that the existing FAST/Webtop implementation on most queries.

Specific results:

Query FAST/Webtop Lucene/HPI

1200 Results 90 seconds 3 seconds

8 Results 5 seconds 3 seconds

10 Results 8 seconds 4 seconds

76 Results 10 seconds 5 seconds

5100 Results 72 seconds 5 seconds

65 Results 6 seconds 3 seconds

Simple configuration of the Lucene index did a better job of returning a more complete search result set than the standard FAST/webtop configuration. Examples included additional documents that were logical derivatives of the initial search word. For example – a search for “exception report” could return “exceptions report” or “exception reports”. The proof of concept data set also included German documents and Lucene demonstrated multilingual stemming capability.

Better than 10x reduction sure sounds sweet. Now, with any benchmark, the devil is in the details: lies, damned lies, and benchmarks. They’re tougher to construct objectively than a sweet set of outputs might imply. And so for me, the real punchline is in a different set of numbers:

The flexibility of Lucene to index both the metdata and full-text values allowed the client to avoid adding an additional Oracle database to their external cache for attribute storage.

One less check to Oracle — that’s real money.

About Lucidworks

LEARN MORE

Contact us today to learn how Lucidworks can help your team create powerful search and discovery applications for your customers and employees.

Fusion Platform Overview

Fusion Platform Pricing

AI Hub

Lucidworks Features and capabilities (all Included)

Product Discovery

Searchandising

Site Search

Workplace Search

Ingest Data and Capture Signals

Employee Search Experience

Customer Service and Case Resolution

AI and Large Language Models

Solutions

Commerce

Customer Service

Knowledge Management

Industries

Retail

Government and Public Sector

Healthcare

B2B Commerce and Distribution

B2B Manufacturing

Financial Services

EXPLORE OUR CONTENT

Ebooks & Reports

Blog

Videos

Press

Resources

About Lucidworks

Documentation

Careers

LucidAcademy

Contact Us

Technical Support

Actual mileage may vary

About Lucidworks

LEARN MORE

Query	FAST/Webtop	Lucene/HPI
1200 Results	90 seconds	3 seconds
8 Results	5 seconds	3 seconds
10 Results	8 seconds	4 seconds
76 Results	10 seconds	5 seconds
5100 Results	72 seconds	5 seconds
65 Results	6 seconds	3 seconds

Fusion Platform Overview

Fusion Platform Pricing

AI Hub

Lucidworks Features and capabilities (all Included)

Product Discovery

Searchandising

Site Search

Workplace Search

Ingest Data and Capture Signals

Employee Search Experience

Customer Service and Case Resolution

AI and Large Language Models

Solutions

Commerce

Customer Service

Knowledge Management

Industries

Retail

Government and Public Sector

Healthcare

B2B Commerce and Distribution

B2B Manufacturing

Financial Services

EXPLORE OUR CONTENT

Ebooks & Reports

Blog

Videos

Press

Resources

About Lucidworks

Documentation

Careers

LucidAcademy

Contact Us

Technical Support

About Lucidworks

Related Articles

LEARN MORE