A unified platform for BI to AI

Tanzu Greenplum is a data warehouse, analytics and AI platform that allows you to unify all your data, transforming it into actionable insights and maintaining a single source of truth.

Talk to an expert

Speed and scale

Faster time to insight due to in-database analytics and AI. Query and data ingestion for petabyte-size data sets.

Productivity

Diverse data types on a single platform (structured, semi-structured, unstructured, vector, or geospatial graph) wherever the data is located.

Flexibility

Deployment on any infrastructure type with optimizations for bare metal, public cloud and vSphere-based private cloud.

Resilience

Based on OSS Postgres. A time-tested and proven platform with features including redundant components, remote disaster recovery, enhanced security, 24x7 enterprise support.

Architecture


Greenplum architecture diagram

Features


Supporting icon

Cloud-agnostic for flexible deployment

Greenplum is available on leading public cloud marketplaces—Amazon Web Services (AWS), Microsoft Azure and Google Cloud Platform (GCP)—with “bring your own license” (BYOL) and hourly consumption models. It’s also available for VCF and OpenStack private clouds. Best of all, it’s the same Greenplum version and the same tools across all clouds for a consistent experience.

Supporting icon

Value and performance in an appliance-like experience

Dell Greenplum Reference Architecture is the most performant way to run Tanzu Greenplum in an on-premises deployment. It’s a VMware-certified and supported blueprint for Dell hardware configurations that replace proprietary appliances. Users can also deploy Greenplum on HP- and Cisco-certified configurations, as well as their own commodity hardware.

Supporting icon

Analytics from business intelligence to artificial intelligence

Machine learning, deep learning, graph, text and statistical methods are all provided in one scale-out MPP database. Get expanded text search capabilities, supporting both lexical and AI-powered semantic searches, and high speed and feature-rich geospatial querying. Extensive support for R and Python analytical libraries, as well as Keras and Tensorflow.

Supporting icon

Easily handled streaming data and cloud data

Greenplum includes integration with the messaging and streaming ecosystem, such as RabbitMQ. Together with improved low-latency writes, Greenplum provides fast event processing for streaming use cases.

Supporting icon

Maximized uptime and protected data integrity

Greenplum has features for high availability, intelligent fault detection and fast online differential recovery, as well as full and incremental backup and disaster recovery. Security and authentication features address enterprise policy and regulatory requirements.

Supporting icon

Industry-leading performance

With its unique, cost-based query optimizer designed for large-scale data workloads, Greenplum scales interactive and batch-mode analytics to large datasets in the petabytes without degrading query performance and throughput.

Supporting icon

Massively parallel, highly concurrent architecture

Greenplum features a shared-nothing architecture that automates parallel processing of data and queries and petabyte-scale data ingestion. Its cost-based query optimizer (GPORCA) was developed specifically to address advanced analytics, creating query plans that execute complex joins at breakthrough performance on large data volumes.

Supporting icon

Enhanced data federation with PXF

The Platform Extension Framework (PXF) in Greenplum has undergone improvements, enabling superior data federation. Businesses can now query datasets in Amazon Simple Storage Service (S3) object stores, Hadoop Distributed File System (HDFS) and other relational databases via JDBC. It leverages the Foreign Data Wrapper API from PostgreSQL to access remote data sources in parallel, offering an abstracted data model for managing security and statistics about the remote data for query optimizations.

Supporting icon

Multiple index types supported

Greenplum supports a broad spectrum of index types, including B-tree, Hash, Bitmap, Block Range Index, text indices, geospatial indices and AI vector indices. This feature optimizes data retrieval and query performance.

Use Cases

Enterprise analytics and AI

With support for advanced algorithms such as multi-layer perceptron and convolutional neural networks in Apache MADlib, users can begin to tackle cutting edge use cases in speech recognition, image recognition, machine translation and computer vision. With optional support for REST APIs, you can train, test and deploy in a single language (SQL), reducing the occurrence of errors when putting models into production at scale.

Flexible deployment on-premises or in the cloud

Move your analytics workloads to the platform of your choice under the terms and in the timeframe you choose. Deploy on private, sovereign, or public clouds (like AWS, Microsoft Azure, or GCP) or on-premises with Greenplum Building Blocks (GBB). Have the freedom to select the best platform for each project and workload based on ease of use, performance and total cost of ownership (TCO).

Enterprise data warehouse modernization and replatforming

Replatform legacy enterprise data warehouses (EDWs) to replace expensive, proprietary databases. Modernize with a reliable multi-cloud platform for analytics offering the full range of data warehouse functionality that your enterprise demands. Gain the power of an MPP system in conjunction with proven technology to reduce the cost and complexity of application migration.

Vector management for RAG processing

Efficient vector management is at the core of RAG processing and Tanzu Greenplum offers a robust solution. With its high-performance capabilities, Greenplum streamlines the handling and analysis of vectors, ensuring data accuracy and speed, making it an indispensable tool for optimizing RAG processing workflows.

Powering IoT applications

By seamlessly ingesting and analyzing vast streams of IoT data, Tanzu Greenplum empowers businesses to make real-time, data-driven decisions. Whether it's predictive maintenance, smart city management, or supply chain optimization, Tanzu Greenplum's high performance and scalability excel in IoT applications.