You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[SPARK-37627][SQL] Add sorted column in BucketTransform
### What changes were proposed in this pull request?
In V1, we can create table with sorted bucket like the following:
```
sql("CREATE TABLE tbl(a INT, b INT) USING parquet " +
"CLUSTERED BY (a) SORTED BY (b) INTO 5 BUCKETS")
```
However, creating table with sorted bucket in V2 failed with Exception
`org.apache.spark.sql.AnalysisException: Cannot convert bucketing with sort columns to a transform.`
### Why are the changes needed?
This PR adds sorted column in BucketTransform so we can create table in V2 with sorted bucket
### Does this PR introduce _any_ user-facing change?
No
### How was this patch tested?
new UT
Closesapache#34879 from huaxingao/sortedBucket.
Authored-by: Huaxin Gao <[email protected]>
Signed-off-by: Wenchen Fan <[email protected]>
0 commit comments