Iceberg Connector

Overview

The Iceberg connector allows querying data stored in Iceberg tables.

Note

It is recommended to use Iceberg 0.9.0 or later.

Configuration

To configure the Iceberg connector, create a catalog properties file etc/catalog/iceberg.properties with the following contents, replacing the properties as appropriate:

  1. connector.name=iceberg
  2. hive.metastore.uri=hostname:port
  3. iceberg.catalog.type=hive

Configuration Properties

The following configuration properties are available:

Property Name

Description

hive.metastore.uri

The URI(s) of the Hive metastore.

iceberg.file-format

The storage file format for Iceberg tables.

iceberg.compression-codec

The compression codec to use when writing files.

iceberg.catalog.type

The catalog type for Iceberg tables.

iceberg.catalog.warehouse

The catalog warehouse root path for Iceberg tables.

iceberg.catalog.cached-catalog-num

The number of Iceberg catalogs to cache.

iceberg.hadoop.config.resources

The path(s) for Hadoop configuration resources.

hive.metastore.uri

The URI(s) of the Hive metastore to connect to using the Thrift protocol. If multiple URIs are provided, the first URI is used by default and the rest of the URIs are fallback metastores. This property is required. Example: thrift://192.0.2.3:9083 or thrift://192.0.2.3:9083,thrift://192.0.2.4:9083

iceberg.file-format

The storage file format for Iceberg tables. The available values are PARQUET and ORC.

The default is PARQUET.

iceberg.compression-codec

The compression codec to use when writing files. The available values are NONE, SNAPPY, GZIP, LZ4, and ZSTD.

The default is GZIP.

iceberg.catalog.type

The catalog type for Iceberg tables. The available values are hive and hadoop, corresponding to the catalogs in the Iceberg.

The default is hive.

iceberg.catalog.warehouse

The catalog warehouse root path for Iceberg tables. Example: hdfs://nn:8020/warehouse/path.

This property is required if the iceberg.catalog.type is hadoop. Otherwise, it will be ignored.

iceberg.catalog.cached-catalog-num

The number of Iceberg catalogs to cache.

The default is 10. This property is required if the iceberg.catalog.type is hadoop. Otherwise, it will be ignored.

iceberg.hadoop.config.resources

The path(s) for Hadoop configuration resources. Example: /etc/hadoop/conf/core-site.xml.

This property is required if the iceberg.catalog.type is hadoop. Otherwise, it will be ignored.