This topic describes the Greenplum Database 6 platform and operating system software requirements.

Parent topic: Installing and Upgrading Greenplum

Operating Systems

Greenplum Database 6 runs on the following operating system platforms:

  • Red Hat Enterprise Linux 64-bit 8.7 or later
  • Oracle Linux 64-bit 8, using the Red Hat Compatible Kernel (RHCK)
  • Rocky Linux 8
Important

A kernel issue in Red Hat Enterprise Linux 8.5 and 8.6 can cause I/O freezes and synchronization problems with XFS filesystems. This issue is fixed in RHEL 8.7. See RHEL8: xfs_buf deadlock between inode deletion and block allocation.

Note Do not install anti-virus software on Greenplum Database hosts as the software might cause extra CPU and IO load that interferes with Greenplum Database operations.

Greenplum Database server supports TLS version 1.2 on RHEL/CentOS systems, and TLS version 1.3 on Ubuntu systems.

Software Dependencies

Greenplum Database 7 requires the following software packages on RHEL systems. The packages are installed automatically as dependencies when you install the Greenplum RPM package):

  • apr
  • apr-util
  • bash
  • bzip2
  • curl
  • krb5
  • libcurl
  • libevent
  • libxml2
  • libyaml
  • zlib
  • openldap
  • openssh-client
  • openssl
  • openssl-libs (RHEL7/Centos7)
  • perl
  • readline
  • rsync
  • R
  • sed (used by gpinitsystem)
  • tar
  • zip

VMware Greenplum Database 7 client software requires these operating system packages:

  • apr
  • apr-util
  • libyaml
  • libevent

Greenplum Database 7 uses Python 2.7.12, which is included with the product installation (and not installed as a package dependency).

Important

SSL is supported only on the Greenplum Database coordinator host system. It cannot be used on the segment host systems.

For all Greenplum Database host systems, if SELinux is enabled in Enforcing mode then the Greenplum process and users can operate successfully in the default Unconfined context. If increased confinement is required, then you must configure SELinux contexts, policies, and domains based on your security requirements, and test your configuration to ensure there is no functionality or performance impact to Greenplum Database. Similarly, you should either deactivate or configure firewall software as needed to allow communication between Greenplum hosts. See Deactivate or Configure SELinux.

Java

Greenplum Databased 7 supports these Java versions for PL/Java and PXF:

  • Open JDK 8 or Open JDK 11, available from AdoptOpenJDK
  • Oracle JDK 8 or Oracle JDK 11

Hardware and Network

The following table lists minimum recommended specifications for hardware servers intended to support Greenplum Database on Linux systems in a production environment. All host servers in your Greenplum Database system must have the same hardware and software configuration. Greenplum also provides hardware build guides for its certified hardware platforms. Work with a Greenplum Systems Engineer to review your anticipated environment to ensure an appropriate hardware configuration for Greenplum Database.

Minimum Hardware Requirements
Minimum CPU Any x86_64 compatible CPU
Minimum Memory 16 GB RAM per server
Disk Space Requirements
  • 150MB per host for Greenplum installation
  • Approximately 300MB per segment instance for metadata
  • Cap disk capacity at 70% full to accommodate temporary files and prevent performance degradation
Network Requirements 10 Gigabit Ethernet within the array

NIC bonding is recommended when multiple interfaces are present

Greenplum Database can use either IPV4 or IPV6 protocols.

Hyperthreading

Resource Groups - one of the key Greenplum Database features - can control transaction concurrency, CPU and memory resources, workload isolation, and dynamic bursting.

When using resource groups to control resource allocation on Intel based systems, consider switching off Hyper-Threading (HT) in the server BIOS (for Intel cores the default is ON). Switching off HT might cause a small throughput reduction (less than 15%), but can achieve greater isolation between resource groups, and higher query performance with lower concurrency workloads.

VMware Greenplum on DCA Systems

You must run VMware Greenplum version 6.9 or later on Dell EMC DCA systems, with software version 4.2.0.0 and later.

Storage

The only file system supported for running Greenplum Database is the XFS file system. All other file systems are explicitly not supported by VMware.

Greenplum Database is supported on network or shared storage if the shared storage is presented as a block device to the servers running Greenplum Database and the XFS file system is mounted on the block device. Network file systems are not supported. When using network or shared storage, Greenplum Database mirroring must be used in the same way as with local storage, and no modifications may be made to the mirroring scheme or the recovery scheme of the segments.

Other features of the shared storage such as de-duplication and/or replication are not directly supported by Greenplum Database, but may be used with support of the storage vendor as long as they do not interfere with the expected operation of Greenplum Database at the discretion of VMware.

Greenplum Database can be deployed to virtualized systems only if the storage is presented as block devices and the XFS file system is mounted for the storage of the segment directories.

Greenplum Database is supported on Amazon Web Services (AWS) servers using either Amazon instance store (Amazon uses the volume names ephemeral[0-23]) or Amazon Elastic Block Store (Amazon EBS) storage. If using Amazon EBS storage the storage should be RAID of Amazon EBS volumes and mounted with the XFS file system for it to be a supported configuration.

Data Domain Boost (VMware Greenplum)

VMware Greenplum 6 supports Data Domain Boost for backup on Red Hat Enterprise Linux. This table lists the versions of Data Domain Boost SDK and DDOS supported by VMware Greenplum 6.

VMware Greenplum Data Domain Boost DDOS
6.x 3.3 6.1 (all versions), 6.0 (all versions)
Note

In addition to the DDOS versions listed in the previous table, VMware Greenplum supports all minor patch releases (fourth digit releases) later than the certified version.

VMware Greenplum Tools and Extensions Compatibility

Client Tools

VMware releases a Clients tool package on various platforms that can be used to access Greenplum Database from a client system. The Greenplum 7 Clients tool package is supported on the following platforms:

  • Red Hat Enterprise Linux x86_64 8.x (RHEL 8)
  • Oracle Linux 64-bit 8, using the Red Hat Compatible Kernel (RHCK)
  • Rocky Linux 8
  • Windows 10 (64-bit)
  • Windows 8 (64-bit)
  • Windows Server 2012 (64-bit)
  • Windows Server 2012 R2 (64-bit)
  • Windows Server 2008 R2 (64-bit)

The Greenplum 7 Clients package includes the client and loader programs plus database/role/language commands and the Greenplum Streaming Server command utilities. Refer to Greenplum Client and Loader Tools Package for installation and usage details of the Greenplum 7 Client tools.

Extensions

This table lists the versions of the Greenplum Extensions that are compatible with this release of Greenplum Database 6.

Greenplum Extensions Compatibility
Component Package Version Additional Information
PL/Java 2.0.4 Supports Java 8 and 11.
Python Data Science Module Package 2.0.2  
PL/R 3.0.3 (CentOS) R 3.3.3

(Ubuntu) You install R 3.5.1+.

R Data Science Library Package 2.0.2  
PL/Container 2.1.2  
PL/Container Image for R 2.1.2 R 3.6.3
PL/Container Images for Python 2.1.2 Python 2.7.12

Python 3.7

PL/Container Beta 3.0.0-beta  
PL/Container Beta Image for R 3.0.0-beta R 3.4.4
GreenplumR 1.1.0 Supports R 3.6+.
MADlib Machine Learning 1.19, 1.18, 1.17, 1.16 Support matrix at MADlib FAQ.
PostGIS Spatial and Geographic Objects 2.5.4+pivotal.7.build.1, 2.1.5+pivotal.3.build.3  

For information about the Oracle Compatibility Functions, see Oracle Compatibility Functions.

These Greenplum Database extensions are installed with Greenplum Database

  • Fuzzy String Match Extension
  • PL/Python Extension
  • pgcrypto Extension

Data Connectors

  • Greenplum Platform Extension Framework (PXF) - PXF provides access to Hadoop, object store, and SQL external data stores. Refer to Accessing External Data with PXF in the Greenplum Database Administrator Guide for PXF configuration and usage information.

    Note

    VMware Greenplum Database versions starting with 6.19.0 no longer bundle a version of PXF. You can install PXF in your Greenplum cluster by installing the independent distribution of PXF as described in the PXF documentation.

  • Greenplum Streaming Server v1.5.3 - The VMware Greenplum Streaming Server is an ETL tool that provides high speed, parallel data transfer from Informatica, Kafka, Apache NiFi and custom client data sources to a VMware Greenplum cluster. Refer to the VMware Greenplum Streaming Server Documentation for more information about this feature.
  • Greenplum Streaming Server Kafka integration - The Kafka integration provides high speed, parallel data transfer from a Kafka cluster to a Greenplum Database cluster for batch and streaming ETL operations. It requires Kafka version 0.11 or newer for exactly-once delivery assurance. Refer to the VMware Greenplum Streaming Server Documentation for more information about this feature.
  • Greenplum Connector for Apache Spark v1.6.2 - The VMware Greenplum Connector for Apache Spark supports high speed, parallel data transfer between Greenplum and an Apache Spark cluster using Spark’s Scala API.
  • Greenplum Connector for Apache NiFi v1.0.0 - The VMware Greenplum Connector for Apache NiFi enables you to set up a NiFi dataflow to load record-oriented data from any source into Greenplum Database.
  • Greenplum Informatica Connector v1.0.5 - The VMware Greenplum Connector for Informatica supports high speed data transfer from an Informatica PowerCenter cluster to a VMware Greenplum cluster for batch and streaming ETL operations.
  • Progress DataDirect JDBC Drivers v5.1.4+275, v6.0.0+181 - The Progress DataDirect JDBC drivers are compliant with the Type 4 architecture, but provide advanced features that define them as Type 5 drivers.
  • Progress DataDirect ODBC Drivers v7.1.6+7.16.389 - The Progress DataDirect ODBC drivers enable third party applications to connect via a common interface to the VMware Greenplum system.
  • R2B X-LOG v5.x and v6.x - Real-time data replication solution that achieves high-speed database replication through the use of Redo Log Capturing method.
Note

Greenplum 5.x clients (gpload, gpfdist) are supported with Greenplum 6.x Server and Informatica PowerCenter and PowerExchange 10.4.

VMware Greenplum 6 does not support the ODBC driver for Cognos Analytics V11.

Connecting to IBM Cognos software with an ODBC driver is not supported. Greenplum Database supports connecting to IBM Cognos software with the DataDirect JDBC driver for VMware Greenplum. This driver is available as a download from VMware Tanzu Network.

VMware Greenplum Text

VMware Greenplum 6.0 through 6.4 are compatible with VMware Greenplum Text 3.3.1 through 3.4.1. VMware Greenplum 6.5 and later are compatible with VMware Greenplum Text 3.4.2 and later. See the Greenplum Text documentation for additional compatibility information.

Greenplum Command Center

VMware Greenplum 6.15 is compatible only with VMware Greenplum Command Center 6.4.0 and later. See the Greenplum Command Center documentation for additional compatibility information.

Hadoop Distributions

Greenplum Database provides access to HDFS with the Greenplum Platform Extension Framework (PXF).

PXF can use Cloudera, Hortonworks Data Platform, MapR, and generic Apache Hadoop distributions. PXF bundles all of the JAR files on which it depends, including the following Hadoop libraries:

PXF Version Hadoop Version Hive Server Version HBase Server Version
6.x, 5.15.x, 5.14.0, 5.13.0, 5.12.0, 5.11.1, 5.10.1 2.x, 3.1+ 1.x, 2.x, 3.1+ 1.3.2
5.8.2 2.x 1.x 1.3.2
5.8.1 2.x 1.x 1.3.2
Note

If you plan to access JSON format data stored in a Cloudera Hadoop cluster, PXF requires a Cloudera version 5.8 or later Hadoop distribution.

check-circle-line exclamation-circle-line close-line
Scroll to top icon