Top 16 Hadoop Technology Companies

Hadoop, a platform developed by The Apache Software Foundation, is a popular open-source Big Data platform for distributed processing of large datasets across clusters of computers. Each system in Apache Hadoop acts as a storage device and as a computation platform. It is one of the most widely used platforms for developers to build Big Data solutions. It offers easy scalability options from a single system to thousands of machines and uses commodity hardware, which reduces costs for organizations.

Due to the increasing need for Big Data solutions and many other growth drivers, the Global Hadoop Market is expected to reach US$2.11 billion by 2018, growing at a CAGR of 42.04 percent during the forecast period 2013-2018. 

Since last year, the Hadoop market vendor landscape has shifted, yielding new competitors. TechNavio analysts have pinpointed the top 16 hadoop technology companies expected to contribute to this fast-growing market:

Hadoop: Amazon Web Services

“Amazon Elastic MapReduce provides a managed, easy to use analytics platform built around the powerful Hadoop framework. Focus on your map/reduce queries and take advantage of the broad ecosystem of Hadoop tools, while deploying to a high scale, secure infrastructure platform.”

 

Hadoop: Cloudera

“Cloudera develops open-source software for a world dependent on Big Data. With Cloudera, businesses and other organizations can now interact with the world’s largest data sets at the speed of thought — and ask bigger questions in the pursuit of discovering something incredible.”

 

Hadoop: EMC2

“Pivotal HD features native integration of EMC’s industry leading Greenplum® massively parallel processing (MPP) database with Apache Hadoop—the most cost-effective and flexible open source Big Data platform ever developed. The new EMC Greenplum-developed HAWQ™ technology brings 10 years of large scale data management research and development to Hadoop and delivers more than 100X performance improvements when compared to existing SQL-like services on top of Hadoop , making Pivotal HD the single most powerful Hadoop distribution in the industry.”

 

Hadoop: Hortonworks

“At Hortonworks, we believe that Hadoop is an enterprise viable data platform and that the most effective path to its delivery is within the open community. To this end, we build, distribute and support a 100% open source distribution of Apache Hadoop that is truly enterprise grade and follow these three key principles:  identify and introduce enterprise requirements into the public domain, work with the community to advance and incubate open source projects, and apply Enterprise Rigor to deliver the most stable and reliable distribution”

 

Hadoop: IBM

“IBM InfoSphere BigInsights makes it simpler for people to use Hadoop and build big data applications. It enhances this open source technology to withstand the demands of your enterprise, adding administrative, discovery, development, provisioning, and security features, along with best-in-class analytical capabilities from IBM Research. The result is that you get a more developer and user-friendly solution for complex, large scale analytics.”

 

Hadoop: MAPR

“MapR delivers on the promise of Hadoop with a proven, enterprise-grade platform that supports a broad set of mission-critical and real-time production uses. MapR brings unprecedented dependability, ease-of-use and world-record speed to Hadoop, NoSQL, database and streaming applications in one unified Big Data platform. MapR is used across financial services, retail, media, healthcare, manufacturing, telecommunications and government organizations as well as by leading Fortune 100 and Web 2.0 companies.” 

 

Hadoop: Microsoft

“Quickly build a Hadoop cluster in minutes when you need it, and delete it when your work is done. Choose the right cluster size to optimize for time to insight or cost. Seamlessly integrate HDInsight into your existing analysis workflows with Windows Azure PowerShell and Windows Azure Command-Line Interface.”

 

Hadoop: Datameer

“Datameer’s Big Data analytics application for Hadoop ensures the fastest time to discovering insights in any data. Anyone can use Datameer’s wizard-based data integration, iterative point-and-click analytics, and drag-and-drop visualizations to find the insights that matter to drive their business forward. Founded by Hadoop veterans in 2009, Datameer scales from a laptop to thousands of nodes and is available for all major Hadoop distributions”

 

Hadoop: Hadapt

“Hadapt’s flagship product is the Adaptive Analytical Platform, which brings a native implementation of SQL to the Apache Hadoop open-source project. By combining the robust and scalable architecture of Hadoop with a hybrid storage layer that incorporates a relational data store, Hadapt allows interactive SQL-based analysis of massive data sets. Hadapt 2.0 delivers the industry’s first interactive applications on Hadoop, via Hadapt Interactive Query; the Hadapt Development Kit™ (HDK) for custom analytics; and integration with Tableau Software.”

 

Hadoop: Adello

“With AdCTRL, Adello developed cutting-edge technology and tested and proven proprietary algorithms for realtime analytics, user-identification and decisioning. Combining various tested methods for device-recognition and running realtime-analytics with proprietary algorithms, AdCTRL is probably the first solution to reliably target cross-device. With the ubiquity of new devices the advantage of targeting audiences is obvious and has become a necessity. We’re glad to provide a solution with AdCTRL.”

 

Hadoop: Karmasphere

“Karmasphere is designed for teams of analysts to explore and analyze Big Data on Hadoop, and to discover business insights about their customers that can be applied to all points of customer engagement. Installed on a physical or virtual Linux server and accessed via industry standard web browsers, Karmasphere is an intuitive, self-service environment for maximizing the value of any and all available data.”

 

Hadoop: NG DATA

“Lily is a data management platform combining planet-sized data storage, indexing and search with on-line, real-time usage tracking, audience analytics and content recommendations. Lily unifies Apache HBase, Hadoop and Solr into a comprehensively integrated, interactive data platform with easy-to-use access APIs, a high-level data model and schema language, flexible, real-time indexing and the expressive search power of Apache Solr. Best of all, Lily is open source – allowing anyone to explore and learn what Lily can do.”

 

Hadoop: Oracle

“Oracle Loader for Hadoop (OLH), part of Oracle Big Data Connectors, is a MapReduce utility to optimize data loading from Hadoop into Oracle Database. OLH sorts, partitions, and converts data into Oracle Database formats on Hadoop, and loads the converted data into the database. By preprocessing the data to be loaded as a Hadoop job on a Hadoop cluster, Oracle Loader for Hadoop reduces the CPU and IO utilization on the database. Oracle Loader for Hadoop has online and offline options.”

 

Hadoop: Pentaho

“Pentaho’s visual development tools drastically reduce the time to design, develop and deploy Hadoop analytics solutions by as much as 15x, compared to traditional custom coding and ETL approaches. Pentaho provides a powerful visual user interface for ingesting and manipulating data within Hadoop, and makes it easy to enrich Hadoop data with reference data from other sources.”

 

Hadoop: Teradata

“The Teradata Portfolio for Hadoop together with Strategic Consulting Services for Hadoop are a one-stop-shop strategy that includes ready-to-deploy Hadoop appliances, advanced ecosystem integration  and a complete set of client services to sharply accelerate production and time to value. Optimized hardware, high-speed connectors, enhanced software usability features, and Teradata’s world-class service and support are all combined in one integrated package. Unlock new data sources with enterprise-class Hadoop.”

 

Hadoop: Zettaset

“We develop enterprise software that is transparent and compatible with open source Hadoop distributions, and augments them by providing the capabilities that enterprises expect and need for their critical data center deployments…now. Our Orchestrator™ software suite protects Hadoop clusters with hardened security and high availability, and streamlines Hadoop installation and infrastructure management.