Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Conditional overwrite into Delta Lake Table #3853

Open
datanikkthegreek opened this issue Feb 24, 2025 · 1 comment
Open

Conditional overwrite into Delta Lake Table #3853

datanikkthegreek opened this issue Feb 24, 2025 · 1 comment
Assignees
Labels
enhancement New feature or request p2 (backlog) Nice to have features

Comments

@datanikkthegreek
Copy link

Is your feature request related to a problem?

No

Describe the solution you'd like

Currently, the delta lake implementation in daft allows only a complete overwrite but not a conditional overwrite based on a predicate.

Delta on Spark supports this already for some time: https://delta.io/blog/delta-lake-replacewhere/

I also see that the rust delta lake implementation supports the "predicate" options https://delta-io.github.io/delta-rs/api/delta_writer/#deltalake.write_deltalake.

Have you considered this besides implementing delete, update and merge?

Describe alternatives you've considered

The current alternative would be transforming into a pyarrow and then use the python deltalake library

Additional Context

No response

Would you like to implement a fix?

No

@datanikkthegreek datanikkthegreek added enhancement New feature or request needs triage labels Feb 24, 2025
@datanikkthegreek datanikkthegreek changed the title Conditional overwrite Conditional overwrite into Delta Lake Table Feb 24, 2025
@universalmind303
Copy link
Contributor

@datanikkthegreek that seems like it'd be a great addition that should be relatively easy to add.

The only concern I have is that the predicate takes in a SQL string?? that may not be compatible with our SQL dialect. (IIRC, delta-io uses datafusion internally).

@universalmind303 universalmind303 added p2 (backlog) Nice to have features and removed needs triage labels Feb 26, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request p2 (backlog) Nice to have features
Projects
None yet
Development

No branches or pull requests

2 participants