Your Greenplum Database deployment consists of a coordinator host, a standby coordinator host, and multiple segment hosts. After you configure the Greenplum Platform Extension Framework (PXF), you start a single PXF JVM process (PXF Service) on each Greenplum Database host.
PXF provides connectors to Hadoop, Hive, HBase, object stores, network file systems, and external SQL data stores. You must configure PXF to support the connectors that you plan to use.
To configure PXF, you must:
Install Java 8 or 11 on each Greenplum Database host as described in Installing Java for PXF. If your
JAVA_HOME is different from
/usr/java/default, you must inform PXF of the $JAVA_HOME setting by specifying its value in the
pxf-env.sh configuration file.
$PXF_BASE/conf/pxf-env.sh file on the Greenplum coordinator host.
gpadmin@coordinator$ vi /usr/local/pxf-gp6/conf/pxf-env.sh
JAVA_HOME setting in the
pxf-env.sh file, uncomment if necessary, and set it to your
$JAVA_HOME value. For example:
Register the PXF extension with Greenplum Database (see pxf cluster register). Run this command after your first installation of a PXF version 6.x, and/or after you upgrade your Greenplum Database installation:
gpadmin@coordinator$ pxf cluster register
If you plan to use the Hadoop, Hive, or HBase PXF connectors, you must perform the configuration procedure described in Configuring PXF Hadoop Connectors.
If you plan to use the PXF connectors to access the Azure, Google Cloud Storage, MinIO, or S3 object store(s), you must perform the configuration procedure described in Configuring Connectors to Azure, Google Cloud Storage, MinIO, and S3 Object Stores.
If you plan to use the PXF JDBC Connector to access an external SQL database, perform the configuration procedure described in Configuring the JDBC Connector.
If you plan to use PXF to access a network file system, perform the configuration procedure described in Configuring a PXF Network File System Server.
After making any configuration changes, synchronize the PXF configuration to all hosts in the cluster.
gpadmin@coordinator$ pxf cluster sync
After synchronizing PXF configuration changes, Start PXF.