Search Results for: document/f05d9e47783f6fe8/do_we_agree_on_our_rtc_way_of_working_was_welcome_ryan_mckinley

Getting Started with Lucene Setup

document in the collection and is a tuple of words (actually they are weights for the words.) The input query q is also mapped into this space based on its…

Tags:

Content Extraction with Tika

so that the parsed source document does not need to be loaded into memory all at once but only as it is needed. Ultimately, however, the amount of resources consumed…

Tags:

Interview with Ian Holsman of Relegence (AOL)

just all documents. You can also filter them on different things and stuff like that. Obviously it was a big hit in the financial services stuff. So we could actually…

Tags: ,

Scaling Lucene and Solr

simple, full blown solution that can scale to billions of documents. In a distributed configuration, one server ‘shard’ will get a query request and then search itself, as well as…

Tags:

Solr Cloud Document Routing

Overview Solr Cloud document routing was released in Solr 4.1. This feature expanded upon the simple hash based routing that was available in Solr 4.0 by introducing a new…

Tags: , , ,

Options to tune document’s relevance in Solr

Working at Lucid Imagination a customer once asked me about how they could modify the score of the documents in Solr in order to get most relevant results higher…

Tags: , ,

pipeline_preview_5a

When the mapping gets tough, the tough use JavaScript

as the first stage of the pipeline, we see the two input documents. The output of the Javascript stage shows that for the first document, the conditional statement doc.hasField(“body”) evaluates…

Debugging Search Application Relevance Issues

sole means of testing. The collection is not free or open. Doing well in TREC doesn’t necessarily translate to doing well in real-life. Online Ratings Let users rate documents using…

Tags:

Optimizing Findability in Lucene and Solr

a common query is the best result, then make it the best result. Don’t waste one iota of your time thinking about why a document occurs in position two versus…

Tags:

Exploring Lucene's Indexing Code: Part 2

our Document, break up the text into terms, and then build the above referenced structures. Next, we start by exploring the Lucene indexing chain that is kicked off with addDocument……