5.22. TPCH Connector

The TPCH connector provides a set of schemas to support the TPCBenchmark™ H (TPC-H). TPC-H is a database benchmark used to measure theperformance of highly-complex decision support databases.

This connector can also be used to test the capabilities and querysyntax of Presto without configuring access to an external datasource. When you query a TPCH schema, the connector generates thedata on the fly using a deterministic algorithm.

Configuration

To configure the TPCH connector, create a catalog properties fileetc/catalog/tpch.properties with the following contents:

  1. connector.name=tpch

TPCH Schemas

The TPCH connector supplies several schemas:

  1. SHOW SCHEMAS FROM tpch;

  1. Schema

information_schema
sf1
sf100
sf1000
sf10000
sf100000
sf300
sf3000
sf30000
tiny
(11 rows)

Ignore the standard schema information_schema which exists in everycatalog and is not directly provided by the TPCH connector.

Every TPCH schema provides the same set of tables. Some tables areidentical in all schemas. Other tables vary based on the _scale factor_which is determined based on the schema name. For example, the schemasf1 corresponds to scale factor 1 and the schema sf300corresponds to scale factor 300. The TPCH connector provides aninfinite number of schemas for any scale factor, not just the few commonones listed by SHOW SCHEMAS. The tiny schema is an alias for scalefactor 0.01, which is a very small data set useful for testing.