This article discusses an Open Source alternative to Hadoop: HPCC (High-Performance Computing Cluster). HPCC was developed by LexisNexis Risk Solutions and is released under an Apache 2.0 license and is available in a free community edition and an enterprise edition. The HPCC platform includes system configurations to support both parallel batch data processing (Thor) and high-performance online query applications using indexed data files (Roxie). The HPCC platform also includes a data-centric declarative programming language for parallel data processing called ECL.

There are four key differentiators between HPCC and Hadoop platforms. You can read more about them here. Of the four differentiators, one of the biggest is that HPCC doesn’t rely on MapReduce.

About Lucidworks

Read more from this author


Contact us today to learn how Lucidworks can help your team create powerful search and discovery applications for your customers and employees.