HPCC Systems® (www.hpccsystems.com) from LexisNexis® Risk Solutions is a proven, open source solution for Big Data insights that can be implemented by businesses of all sizes. With HPCC Systems, developers can design applications with Big Data at their core, enabling businesses to better analyze and understand data at scale, improving business time to results and decisions. HPCC Systems offers a consistent data-centric programming language, two processing platforms and a single, complete end-to-end architecture for efficient processing. Read our blog (http://hpccsystems.com/blog ), or connect with us on Twitter (@hpccsystems), Facebook (https://www.facebook.com/hpccsystems ) and LinkedIn (http://www.linkedin.com/company/hpcc-systems)
HPCC Systems is available on AWS & can be configured through the Instant Cloud Solution.
Features
- End to End configuration for big data processing and analytics
- Standard hardware, operating system and protocols
- High redundancy and availability
- Practical tools and extensions
- Efficient programming with the declarative, modular, extensible Enterprise Control Language (ECL)
- Optimized distributed file systems (DFS)
- Massive scalability and performance
- Completely free to the open source community
- Want to learn more? https://hpccsystems.com/about#Features
License
Apache License V2.0User Reviews
-
This has so far been the most fun and satisfying experience I have had when it comes to adopting a system for your everyday work. The tagline says it all, 'Simple, Fast, Scalable'. I have worked on datasets with billions of records in computing scenarios as complex as would make a Hadoop developer cringe. Granted that it uses a declarative language, a paradigm which is not the most familiar to programmers in 2013, and the system has its own non-standard quirks, it is still a platform which makes you feel at ease.
-
I give this a five star rating! My personal experience has been that I was able to download the HPCC VM image and get started with loading and transforming data in a few minutes. Thanks to their central deployment tool, setting up a 3 node Ubuntu cluster was really simple. The inherent parallelism and data flow nature of the powerful ECL language removes the worry about trying to parallelize my jobs, as was the case in my experience with Hadoop MapReduce. In fact, I have to say ECL is somewhat similar to SQL from the perspective both are declarative data programming languages. So if you are a good SQL developer, ECL should be a breeze to understand and use. It is a mature platform and provides for a data delivery engine together with a data transformation and linking system. The main advantages over other alternatives are the real-time delivery of data queries and the extremely powerful ECL language programming model.
Additional Project Details
Database Environment
Other network-based DBMSRegistered
Add-ons & Plugins

Eclipse

JDBC Driver

R Integration

Pentaho Kettle Plugins

Platform Extensions

Machine Learning
