Clustered Tables

Clustering or Bucketing i a data organization tecnique (like partition) for decomposing large datasets into more manageable ports.

Fundamental of it is that when table is bucketed in a column, then all the records with same column name will go to the same bucket.

Example: salaries DB

Bucketing on salary column.

A bucket will contain people with same salary (purple box) and nowhere else

Note: a bucket can contain multiple salaries values (orange box))

"alt"

Do and Don'ts

==section 8==