11.47. Release 0.183

General Changes

  • Fix planning failure for queries that use GROUPING and contain aggregation expressionsthat require implicit coercions.
  • Fix planning failure for queries that contains a non-equi left join that is semanticallyequivalent to an inner join.
  • Fix issue where a query may have a reported memory that is higher than actual usage whenan aggregation is followed by other non-trivial work in the same stage. This can lead to failuresdue to query memory limit, or lower cluster throughput due to perceived insufficient memory.
  • Fix query failure for CHAR functions trim(), rtrim(), and substr() whenthe return value would have trailing spaces under VARCHAR semantics.
  • Fix formatting in EXPLAIN ANALYZE output.
  • Improve error message when a query contains an unsupported form of correlated subquery.
  • Improve performance of CAST(json_parse(…) AS …).
  • Add map_from_entries() and map_entries() functions.
  • Change spilling for aggregations to only occur when the cluster runs out of memory.
  • Remove the experimental.operator-memory-limit-before-spill config propertyand the operator_memory_limit_before_spill session property.
  • Allow configuring the amount of memory that can be used for merging spilled aggregation datafrom disk using the experimental.aggregation-operator-unspill-memory-limit configproperty or the aggregation_operator_unspill_memory_limit session property.

Web UI Changes

  • Add output rows, output size, written rows and written size to query detail page.

Hive Changes

  • Work around ORC-222 which results ininvalid summary statistics in ORC or DWRF files when the input data contains invalid string data.Previously, this would usually cause the query to fail, but in rare cases it couldcause wrong results by incorrectly skipping data based on the invalid statistics.
  • Fix issue where reported memory is lower than actual usage for table columns containingstring values read from ORC or DWRF files. This can lead to high GC overhead or out-of-memory crash.
  • Improve error message for small ORC files that are completely corrupt or not actually ORC.
  • Add predicate pushdown for the hidden column "$path".

TPCH Changes

  • Add column statistics for schemas tiny and sf1.

TPCDS Changes

  • Add column statistics for schemas tiny and sf1.

SPI Changes

  • Map columns or values represented with ArrayBlock and InterleavedBlock areno longer supported. They must be represented as MapBlock or SingleMapBlock.
  • Extend column statistics with minimal and maximal value.
  • Replace nullsCount with nullsFraction in column statistics.