Using the Hudi File Format

Impala supports Read Optimized Queries on Hudi tables as an experimental feature. Read Optimized Queries treat the latest table snapshot as the commit or compaction action, expose only base/columnar files in latest file slices, and guarantee the same columnar query performance as a non-Hudi columnar table.

Table 1. Hudi Format Support in Impala
File TypeFormatCompression CodecsImpala Can CREATE?Impala Can INSERT?
HudiStructuredSnappy, gzip, zstd, lz4; currently Snappy by defaultYes, support for Read Optimized Queries is experimental.No. Create an external table in Impala. Set the table location to the Hudi table directory. Alternatively, create the Hudi table in Hive.

Parent topic: How Impala Works with Hadoop File Formats