Approaching Join Index in Apache Lucene

As we countdown to the annual Lucene/Solr Revolution conference in Austin this October, we’re highlighting talks and sessions from past conferences. Today, we’re highlighting Mikhail Khludnev’s session on joins and block-joins in Lucene.

Lucene works great with independent text documents, but real life problems often require to handle relations between documents. Aside from several workarounds, like term encodings, field collapsing or term positions, we have two mainstream approaches to handle document relations: join and block-join. Both have their downsides. Join lacks performance, while block-join makes is really expensive to handle index updates, since it requires to wipe a whole block of related documents.

This session presents an attempt to apply join index, borrowed from RDBMS world, for addressing drawbacks of the both join approaches currently present in Lucene. We will look into the idea per se, possible implementation approaches, and review the benchmarking results.

Mikhail has years of experience building backend systems for retail industry. His interests span from general systems architecture, API design and performance engineering all the way to testing approaches. For last few years he works on eCommerce search platform extending Lucene and Solr, contributes back to community, spokes at Lucene Revolution and other conferences.

http://www.slideshare.net/lucidworks/approaching-join-index-presented-by-mikhail-khludnev-grid-dynamics

lucenerevolution-avatarJoin us at Lucene/Solr Revolution 2015, the biggest open source conference dedicated to Apache Lucene/Solr on October 13-16, 2015 in Austin, Texas. Come meet and network with the thought leaders building and deploying Lucene/Solr open source search technology. Full details and registration…

Share the knowledge

You Might Also Like

The New SEO: How to Make Your Products Discoverable by AI Assistants

Quick Take: AI assistants are increasingly performing product research on behalf of...

Read More

AI Product Discovery vs. Traditional Search in B2B Manufacturing and Distribution

In the high-stakes world of B2B manufacturing and distribution, the "findability" of...

Read More

Is Your Product Catalog Ready for AI Buyers?

AI assistants are increasingly acting as buyers on customers' behalf. Instead of...

Read More

Quick Links