gpmt gpstatscheck

This tool can be used to verify if all tables involved in a query have optimal statistics. When a query is running slower than expected, it is possible that outdated or invalid statistics on the tables involved in the query are causing the slowness. This can happen if new data has been loaded into the table but analyze was never run, so the database is using wrong statistics when generating the query plan.

Usage

  1. gpmt gpstatscheck -f <query-file>
  2. [-p <port>] [-d <database>]

Options

-f

File containing the query to analyze.

-p

Database port.

-d

Database where the query is being run.

Examples

Run the query in query1.sql in database Postgres and check for missing stats.

  1. gpmt gpstatscheck -f query1.sql -d postgres

If invalid statistics are detected the tool will generate a script listing the suggested commands to run. For example:

  1. $ cat gpstatscheck_20160926_134946.sql
  2. ANALYZE public.nums;

You can then run the provided script against the affected database:

  1. $ psql -p 5432 -d postgres -f gpstatscheck_20160926_134946.sql