SQL query optimization hints
Project description
## Introduction This API will help you optimize your sql queries for better performance.
## How to use Create an optimizer object specific to a single (database, schema) pair e.g. optimizer = get_optimizer(“presto”, schema)
Get optimization hints using optimize_query(query) e.g. presto_op.optimize_query(query)
- Initial Optimization Checks
Using approximate algorithms (approx_distinct() instead of COUNT(DISTINCT …))
Selecting the columns the user wants explicitly, rather than using (SELECT *)
Filtering on partitioned columns
Try to extract nested subqueries using a WITH clause.
Suggest filtering on most effective columns, by parquet-file ordering
Eliminate filtering overhead for partitioned columns
- Other Stuff
Replace UNION with UNION ALL if duplicates do not need to be removed
Aggregate a series of LIKE clauses into one regexp_like expression
Push down a complex join condition into a sub query
Specify GROUP BY targets with numbers for expressions
### Testing To run unit tests, run py.test (or py.test -s to see stdout) in the tests directory.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file user-query-optimizer-0.1.3.tar.gz
.
File metadata
- Download URL: user-query-optimizer-0.1.3.tar.gz
- Upload date:
- Size: 12.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: Python-urllib/2.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6edf745ba98b12d75d588dd5dccee15a89b285ba816bf2645050f3a3aface87a |
|
MD5 | bc28d0959ad1b30bd43525c2b32db9e5 |
|
BLAKE2b-256 | fcdc088a2a7d6bb81d1a573d3947be9da3fc739999e08db4ff950caeabe1b2e9 |