Determining AWS Redshift Spectrum Spend
01 June 2018
I’ve enjoyed using AWS Redshift Spectrum for quick-and-easy analysis of
fairly large-ish data sets using one of my favorite tools: SQL. However, as far as I could tell, there’s no way to see a
breakdown of Amazon Web Services Redshift Spectrum usage for your most recent billing cycle. You can, however, pull this
information from SVL_S3QUERY_SUMMARY
.
Assuming that the pricing is still $5 per terabyte scanned, you can run this query.
SELECT
SUBSTRING(querytxt, 1, 20) AS query_snippet,
s3.starttime,
s3_scanned_bytes,
(s3_scanned_bytes / (10^12) * 5) AS cost
FROM SVL_S3QUERY_SUMMARY s3 JOIN STL_QUERY q ON s3.query = q.query ORDER BY starttime DESC;
This will return a table something like this:
+-----------------+----------------------------+--------------------+----------------+
| query_snippet | starttime | s3_scanned_bytes | cost |
|-----------------+----------------------------+--------------------+----------------|
| select foo, bar,| 2018-06-01 19:32:11.362227 | 209502504899 | 1.0475125245 |
| select foo, bar,| 2018-06-01 19:08:41.76035 | 209502504899 | 1.0475125245 |
| select foo, bar,| 2018-06-01 03:26:47.284144 | 347280195825 | 1.73640097912 |
| select foo, bar,| 2018-05-31 21:34:25.677985 | 209502504899 | 1.0475125245 |
| select foo, bar,| 2018-05-31 21:27:14.332106 | 209502504899 | 1.0475125245 |
+-----------------+----------------------------+--------------------+----------------+
Was this tip useful? Leave a comment and let me know!