Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Spark: DVs + Positional Deletes #11675

Draft
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

nastra
Copy link
Contributor

@nastra nastra commented Nov 29, 2024

this is part of #11122

@nastra nastra force-pushed the position-deletes-table branch 5 times, most recently from a52f450 to 221a485 Compare December 6, 2024 08:17
@nastra nastra force-pushed the position-deletes-table branch 2 times, most recently from 283145d to 36e65b2 Compare December 9, 2024 09:51
@github-actions github-actions bot added the docs label Dec 9, 2024
@nastra nastra force-pushed the position-deletes-table branch 3 times, most recently from 782aa62 to ffb3824 Compare December 16, 2024 11:15
@nastra nastra force-pushed the position-deletes-table branch from ffb3824 to b9adfba Compare December 16, 2024 11:31
Copy link

This pull request has been marked as stale due to 30 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull request requires a review, please simply write any comment. If closed, you can revive the PR at any time and @mention a reviewer or discuss it on the [email protected] list. Thank you for your contributions.

@github-actions github-actions bot added the stale label Jan 17, 2025
Schema positionDeleteRowSchema = positionDeleteRowSchema();
StructType deleteSparkType = deleteSparkType();
StructType deleteSparkTypeWithoutRow = deleteSparkTypeWithoutRow();
if (TableUtil.formatVersion(underlyingTable(table)) >= 3) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think minor compaction for DVs would look quite different from the existing logic. At the very least, we don't want to keep separate jobs per partition. I'd say we should throw an exception right now if the action is called on V3 tables until we figure out a good way to perform minor compaction for DVs.

@github-actions github-actions bot removed the stale label Jan 22, 2025
@nastra nastra marked this pull request as draft January 22, 2025 08:23
@nastra nastra force-pushed the position-deletes-table branch from b9adfba to f9f58e8 Compare January 24, 2025 10:06
@nastra nastra changed the title Core, Spark: Include content offset/size in PositionDeletesTable Spark: DVs + Positional Deletes Jan 24, 2025
@nastra nastra force-pushed the position-deletes-table branch from f9f58e8 to 27766cb Compare January 24, 2025 10:07
@nastra nastra force-pushed the position-deletes-table branch from 27766cb to 67d45e8 Compare January 30, 2025 11:28
@nastra nastra closed this Jan 30, 2025
@nastra nastra reopened this Jan 30, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants