Replace parallel condition/result vectors with single CaseWhen vector in Expr::Case #1733

lovasoa · 2025-02-20T16:31:14Z

The primary motivation for this change is to fix the visitor traversal order for CASE expressions. In SQL, CASE expressions follow a specific syntactic order (e.g., CASE a WHEN 1 THEN 2 WHEN 3 THEN 4 ELSE 5), AST visitors now process nodes in the same order as they appear in the source code. The previous implementation, using separate conditions and results vectors, would visit all conditions first and then all results, which didn't match the source order. The new CaseWhen structure ensures visitors process expressions in the correct order: a,1,2,3,4,5.

A secondary benefit is making invalid states unrepresentable in the type system. The previous implementation using parallel vectors (conditions and results) made it possible to create invalid CASE expressions where the number of conditions didn't match the number of results. When this happened, the Display implementation would silently drop elements from the longer list, potentially masking bugs. The new CaseWhen struct couples each condition with its result, making it impossible to create such mismatched states.

While this is a breaking change to the AST structure, sqlparser has a history of making such changes when they improve correctness. I don't expect significant downstream breakages, and the benefits of correct visitor ordering and type safety are significant, so I think the trade-off is worthwhile.

lovasoa · 2025-02-20T16:38:59Z

fixes sqlpage/SQLPage#818

… in Expr::Case The primary motivation for this change is to fix the visitor traversal order for CASE expressions. In SQL, CASE expressions follow a specific syntactic order (e.g., `CASE a WHEN 1 THEN 2 WHEN 3 THEN 4 ELSE 5`), AST visitors now process nodes in the same order as they appear in the source code. The previous implementation, using separate `conditions` and `results` vectors, would visit all conditions first and then all results, which didn't match the source order. The new `CaseWhen` structure ensures visitors process expressions in the correct order: `a,1,2,3,4,5`. A secondary benefit is making invalid states unrepresentable in the type system. The previous implementation using parallel vectors (`conditions` and `results`) made it possible to create invalid CASE expressions where the number of conditions didn't match the number of results. When this happened, the `Display` implementation would silently drop elements from the longer list, potentially masking bugs. The new `CaseWhen` struct couples each condition with its result, making it impossible to create such mismatched states. While this is a breaking change to the AST structure, sqlparser has a history of making such changes when they improve correctness. I don't expect significant downstream breakages, and the benefits of correct visitor ordering and type safety are significant, so I think the trade-off is worthwhile.

iffyio

LGTM! Thanks @lovasoa!
cc @alamb

lovasoa force-pushed the case_representation branch 2 times, most recently from 0503160 to afd8e6e Compare February 20, 2025 16:38

lovasoa force-pushed the case_representation branch from afd8e6e to 6b00c3b Compare February 20, 2025 16:39

lovasoa force-pushed the case_representation branch from 6b00c3b to 04732cf Compare February 21, 2025 16:45

iffyio approved these changes Feb 22, 2025

View reviewed changes

iffyio merged commit 72312ba into apache:main Feb 22, 2025
9 checks passed

lovasoa deleted the case_representation branch February 22, 2025 13:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace parallel condition/result vectors with single CaseWhen vector in Expr::Case #1733

Replace parallel condition/result vectors with single CaseWhen vector in Expr::Case #1733

lovasoa commented Feb 20, 2025

lovasoa commented Feb 20, 2025

iffyio left a comment

Replace parallel condition/result vectors with single CaseWhen vector in Expr::Case #1733

Replace parallel condition/result vectors with single CaseWhen vector in Expr::Case #1733

Conversation

lovasoa commented Feb 20, 2025

lovasoa commented Feb 20, 2025

iffyio left a comment

Choose a reason for hiding this comment