5.23. TPCH Connector

The TPCH connector provides a set of schemas to support the TPCBenchmark™ H (TPC-H). TPC-H is a database benchmark used to measure theperformance of highly-complex decision support databases.

This connector can also be used to test the capabilities and querysyntax of Presto without configuring access to an external datasource. When you query a TPCH schema, the connector generates thedata on the fly using a deterministic algorithm.

Configuration

To configure the TPCH connector, create a catalog properties fileetc/catalog/tpch.properties with the following contents:

  1. connector.name=tpch

TPCH Schemas

The TPCH connector supplies several schemas:

  1. SHOW SCHEMAS FROM tpch;

  1. Schema

information_schema sf1 sf100 sf1000 sf10000 sf100000 sf300 sf3000 sf30000 tiny(11 rows)

Ignore the standard schema information_schema which exists in everycatalog and is not directly provided by the TPCH connector.

Every TPCH schema provides the same set of tables. Some tables areidentical in all schemas. Other tables vary based on the _scale factor_which is determined based on the schema name. For example, the schemasf1 corresponds to scale factor 1 and the schema sf300corresponds to scale factor 300. The TPCH connector provides aninfinite number of schemas for any scale factor, not just the few commonones listed by SHOW SCHEMAS. The tiny schema is an alias for scalefactor 0.01, which is a very small data set useful for testing.