5.22. TPCDS Connector

The TPCDS connector provides a set of schemas to support the TPCBenchmark™ DS (TPC-DS). TPC-DS is a database benchmark used to measure theperformance of complex decision support databases.

This connector can also be used to test the capabilities and querysyntax of Presto without configuring access to an external datasource. When you query a TPCDS schema, the connector generates thedata on the fly using a deterministic algorithm.

Configuration

To configure the TPCDS connector, create a catalog properties fileetc/catalog/tpcds.properties with the following contents:

  1. connector.name=tpcds

TPCDS Schemas

The TPCDS connector supplies several schemas:

  1. SHOW SCHEMAS FROM tpcds;

  1. Schema

information_schema sf1 sf10 sf100 sf1000 sf10000 sf100000 sf300 sf3000 sf30000 tiny(11 rows)

Ignore the standard schema information_schema which exists in everycatalog and is not directly provided by the TPCDS connector.

Every TPCDS schema provides the same set of tables. Some tables areidentical in all schemas. The scale factor of the tables in a particularschema is determined from the schema name. For example, the schemasf1 corresponds to scale factor 1 and the schema sf300corresponds to scale factor 300. Every unit in the scale factorcorresponds to a gigabyte of data. For example, for scale factor 300,a total of 300 gigabytes will be generated. The tiny schema is analias for scale factor 0.01, which is a very small data set useful fortesting.