Skip to content

[Spark] support accepting clusterBy from both v1 and v2 dataframe writers#7060

Open
mwc360 wants to merge 1 commit into
delta-io:masterfrom
mwc360:mcole_fix_clusterBy_4823
Open

[Spark] support accepting clusterBy from both v1 and v2 dataframe writers#7060
mwc360 wants to merge 1 commit into
delta-io:masterfrom
mwc360:mcole_fix_clusterBy_4823

Conversation

@mwc360

@mwc360 mwc360 commented Jun 19, 2026

Copy link
Copy Markdown
Contributor

Closes #4823

Which Delta project/connector is this regarding?

  • Spark
  • Standalone
  • Flink
  • Kernel
  • Other (fill in here)

Description

Resolves #4823 mapping clusterBy from dataframe writers was not supported.

How was this patch tested?

New tests to ensure clusterBy results in Liquid Clustering being enabled for a new table and tests to allow clusterBy on non-create operations as long as the keys match.

Does this PR introduce any user-facing changes?

No, only new support.

@mwc360

mwc360 commented Jun 23, 2026

Copy link
Copy Markdown
Contributor Author

@timothyw553 - could you trigger CI on this one? This is another gap that would be nice to address.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[BUG] Pyspark 4.0 clusterBy raises strange errors Partitioning by expressions

1 participant