Dataiku can be deployed on-premises or in the cloud (e.g. AWS, Azure, etc) and connect via JDBC to Tanzu® Greenplum deployments. Dataiku users can then connect to, load, transform and query data tables stored within VMware Tanzu Greenplum.
To facilitate visual development, data engineers can create custom SQL Recipes in Dataiku to invoke in-database analytics functions of VMware Tanzu Greenplum such as those for data preparation and machine learning in Apache MADlib, for geospatial analysis in PostGIS, and text analytics in GPText. This allows data science teams to leverage the MPP architecture of VMware Tanzu Greenplum to process terabyte and petabyte sized data sets in parallel for faster results.