Information: we all love to make more of it, but it sure piles up. Forbes Magazine Online blogger Quentin Hardy takes on making sense of it in a nice post about Solr/Lucene and Lucid Imagination. Some soundbites:
Our civilization may pride itself on the amount of information we create – more than every conversation, ever, in a couple of years; enough to jack The Library of Congress to the Moon on data-packed CD-Roms, take your superlative – but we’ve also made a holy mess of it. It is mostly, as they say, “unstructured,” meaning as random as your last 25 emails, your tweets, and all those spreadsheets and documents piling up at work. That’s why Solr/Lucene and Lucid Imagination are names you need to know in tech.
That’s because of the phenomenal amount of data that lies behind firewalls, inside corporate and private databases, or is otherwise unseen by the Web crawlers. “The U.S. Department of Energy’s database is probably larger than the Internet, just by itself,” says Marc Krellenstein, Lucid’s Chief Technical Officer. “There is a far bigger amount of material out there that has to be searched, all the time.”
“Enterprise search is almost the opposite of Web search,” says Lucid chief executive Eric Gries. “For them, scale and performance matter most, while relevance comes third. In an enterprise, we’d better hit a query right.”
Read the full article here.