Approaching Join Index in Apache Lucene

As we countdown to the annual Lucene/Solr Revolution conference in Austin this October, we’re highlighting talks and sessions from past conferences. Today, we’re highlighting Mikhail Khludnev’s session on joins and block-joins in Lucene.

Lucene works great with independent text documents, but real life problems often require to handle relations between documents. Aside from several workarounds, like term encodings, field collapsing or term positions, we have two mainstream approaches to handle document relations: join and block-join. Both have their downsides. Join lacks performance, while block-join makes is really expensive to handle index updates, since it requires to wipe a whole block of related documents.

This session presents an attempt to apply join index, borrowed from RDBMS world, for addressing drawbacks of the both join approaches currently present in Lucene. We will look into the idea per se, possible implementation approaches, and review the benchmarking results.

Mikhail has years of experience building backend systems for retail industry. His interests span from general systems architecture, API design and performance engineering all the way to testing approaches. For last few years he works on eCommerce search platform extending Lucene and Solr, contributes back to community, spokes at Lucene Revolution and other conferences.

http://www.slideshare.net/lucidworks/approaching-join-index-presented-by-mikhail-khludnev-grid-dynamics

lucenerevolution-avatarJoin us at Lucene/Solr Revolution 2015, the biggest open source conference dedicated to Apache Lucene/Solr on October 13-16, 2015 in Austin, Texas. Come meet and network with the thought leaders building and deploying Lucene/Solr open source search technology. Full details and registration…

Share the knowledge

You Might Also Like

Lucidworks Named a Leader: What This Means for Search, AI—and Your Business

Lucidworks’ recognition as a Leader signals that enterprise search and AI now...

Read More

MCP vs. ACP: What’s the Difference, and When Should Each Be Used?

Artificial intelligence is changing how people interact with data, products, and content....

Read More

The Future of Digital Commerce with ACP: From Static Catalogs to Agent Negotiations

For decades, digital commerce has been built around a familiar concept: the...

Read More

Quick Links