GitHub

jrlasak/databricks_optimization_techniques

Delta Lake Optimization Project: Hands‑on lab to explore partitioning, Z‑Ordering, compaction (manual & auto), Liquid Clustering, and VACUUM using a synthetic sales dataset in Databricks. Includes a step‑by‑step notebook to measure file scans, bytes read, and query performance for each optimization.

Project

Owner
jrlasak
City
Warszawa
Language
Python

Ranking metrics

Snapshot:

Stars
19
New stars
2

GitHub badge

Polish Repo badge [![Polish Repo badge](https://maciej-ciemborowicz.eu/polish-open-source-rank/badges/repositories/github/jrlasak/databricks_optimization_techniques.svg)](https://maciej-ciemborowicz.eu/polish-open-source-rank/latest)