Hadoop: The Engine for Powering Big Data
Hadoop is the engine powering the Big Data era, an unstoppable force boasting massive investments and a rich ecosystem. But this is only the beginning: Hadoop has the potential to reach beyond Big Data and become the Foundation for Change, catalyzing new levels of business productivity and transformation. Hadoop will become the Foundation for Change. Apache Hadoop has become the dominant platform for Big Data analytics in recent years, thanks to its flexibility, reliability, scalability, and ability to suit the needs of developers, web startups, and enterprise IT. A fast and economic way to leverage the massive amounts of data produced by new sources such as social media, mobile sensors, social media, and Internet of Things devices, Hadoop has become the preferred platform for storage and analytics of large unstructured datasets. Originally developed in 2003 by data scientists at Yahoo!, Hadoop was quickly embraced by the open source community, as well as consumer-facing Internet giants such as Google and Facebook. In recent years, Hadoop has been embraced by enterprises who similarly need to gain actionable insight from Big Data produced by new data sources, technology innovations, cloud services, and business opportunities. IDC has predicted the Hadoop software market will be worth $813 million by 2016. Hadoop is a game changer for enterprises, transforming the economics of large-scale data analytics. It eliminates data silos, and reduces the need to migrate data between storage and analytics software, providing businesses with a more holistic view of their customers and operations, leading to quicker and more effective business insights. Its extensibility and numerous integrations can power a new generation of data-aware business applications. The software’s “refreshingly unique approach to data management is transforming how companies store, process, analyze and share big data,” according to Forrester analyst Mike Gualtieri. “Forrester believes that Hadoop will become must-have infrastructure for large enterprises.” For enterprises using proprietary data solutions and staff familiar with SQL analytics tools, transitioning to Hadoop can be challenging, despite its many advantages. Integration with existing infrastructure can present a major hurdle. To this end, Pivotal offers its enterprise-grade Hadoop distribution Pivotal HD as either a standalone product or part of the Pivotal Big Data Suite. Pivotal HD builds upon Hadoop’s strong foundation by adding features that enhance enterprise adoption and use of the platform. It enables the Business Data Lake, allowing businesses to bring their existing analytics tools to their data. Pivotal HD is the Foundation for the Business Data Lake delivering the World’s Most Advanced Real-Time Analytics Platform through GemFire XD, and the most extensive set of Advanced Analytical Toolsets through HAWQ, MADlib, OpenMPI, GraphLab and even Spring XD. Featuring HAWQ, the world’s fastest SQL query engine on Hadoop, Pivotal HD accelerates data analytics projects, leverages existing skillsets, and significantly expands Hadoop’s capabilities. Pivotal GemFire brings real time analytics to Hadoop, enabling businesses to process and make critical business decisions immediately. While leveraging Hadoop’s proven benefits, Pivotal HD adds features that ease adoption, increase productivity, and provide robust management tools. It supports leading data science tools such as MADlib, GraphLab (OpenMPI), and User-Defined Functions, adding support for popular languages such as R, Java, and Python. Pivotal HD also integrates with Spring ecosystem projects such as Spring XD, easing the development of data-driven applications and services. Allowing enterprises to collect and leverage both structured and unstructured data types, Pivotal HD enables a flexible, fault-tolerant, and scalable Business Data Lake. Pivotal’s engineers, many of whom were integral to Hadoop’s development and evolution, have built an enterprise-grade Hadoop distribution. Visit http://www.pivotal.io/big-data/pivotal-hd to learn more about Hadoop for the Enterprise.