Prefer more equal signs before a break when splitting chained assignments #4010

henriholopainen · 2023-10-31T16:17:25Z

Description

This PR makes rhs processing prefer more equal signs before breaking the line.

Checklist - did you ...

Add an entry in CHANGES.md if necessary?
Add / update tests if necessary?

tests/data/cases/preview_prefer_rhs_split.py

github-actions · 2023-10-31T16:40:21Z

diff-shades results comparing this PR (4ad7c39) to main (be336bb). The full diff is available in the logs under the "Generate HTML diff report" step.

╭─────────────────────── Summary ────────────────────────╮
│ 2 projects & 19 files changed / 176 changes [+86/-90]  │
│                                                        │
│ ... out of 2 522 686 lines, 11 793 files & 23 projects │
╰────────────────────────────────────────────────────────╯

Differences found.

What is this? | Workflow run | diff-shades documentation

…on error

MichaReiser · 2023-11-02T06:35:35Z

Thanks

The following diff seems interesting

         elif constraint is None:
-            self.constraint_target = (
-                self.inferred_target_elements
-            ) = self.inferred_target_whereclause = None
+            self.constraint_target = self.inferred_target_elements = (
+                self.inferred_target_whereclause
+            ) = None

But I guess there's not much we can do about (we need the parentheses because it's python)

src/black/linegen.py

JelleZijlstra · 2023-11-03T04:35:44Z

I'm not quite sure about this. Looking at the diff-shades output, there's a few good ones:

-                            collected_attributes[name] = column_copies[
-                                obj
-                            ] = ret
+                            collected_attributes[name] = column_copies[obj] = (
+                                ret
+                            )

It's nicer to keep the subscript on one line.

-        self.selectable = (
-            self.persist_selectable
-        ) = self.local_table = selectable
+        self.selectable = self.persist_selectable = self.local_table = (
+            selectable
+        )

It's better to keep all the assignment targets next to each other, instead of randomly enclosing one in parentheses.

But there's also a lot of cases like the one @MichaReiser highlighted, where we just change which of the LHS assignment targets get parenthesized. In those cases, I don't think either formatting is clearly better; it looks ugly either way. I'd rather have Black not make a change there, because every time we change the formatting of existing code, that's disruptive to users. I'd want to make changes only if we believe the new formatting is better. Therefore, I'd prefer to stick with the current formatting in those cases.

Perhaps the rule could be that we split around the RHS if possible, and if that doesn't work, we preserve the current behavior and split around the second assignment target from the left.

henriholopainen · 2023-11-03T10:07:24Z

I'm not quite sure about this. Looking at the diff-shades output, there's a few good ones:
-                            collected_attributes[name] = column_copies[
-                                obj
-                            ] = ret
+                            collected_attributes[name] = column_copies[obj] = (
+                                ret
+                            )
It's nicer to keep the subscript on one line.
-        self.selectable = (
-            self.persist_selectable
-        ) = self.local_table = selectable
+        self.selectable = self.persist_selectable = self.local_table = (
+            selectable
+        )
It's better to keep all the assignment targets next to each other, instead of randomly enclosing one in parentheses.

But there's also a lot of cases like the one @MichaReiser highlighted, where we just change which of the LHS assignment targets get parenthesized. In those cases, I don't think either formatting is clearly better; it looks ugly either way. I'd rather have Black not make a change there, because every time we change the formatting of existing code, that's disruptive to users. I'd want to make changes only if we believe the new formatting is better. Therefore, I'd prefer to stick with the current formatting in those cases.

Perhaps the rule could be that we split around the RHS if possible, and if that doesn't work, we preserve the current behavior and split around the second assignment target from the left.

For me even @MichaReiser's example is better with the new formatting. Somehow it feels more natural when as many assignments as possible end up formatted as early as possible. Consider for example this (rather extreme) example:

self.selectable = self.persist_selectable = self.local_table = self.foo_bar = self.foo = self.longlonglong_key = self.another_long_long_long_key = self.yet_again_a_long_key = value

On current main it becomes:

self.selectable = (
    self.persist_selectable
) = (
    self.local_table
) = (
    self.foo_bar
) = (
    self.foo
) = (
    self.longlonglong_key
) = self.another_long_long_long_key = self.yet_again_a_long_key = value

and with this PR:

self.selectable = self.persist_selectable = self.local_table = self.foo_bar = (
    self.foo
) = self.longlonglong_key = self.another_long_long_long_key = (
    self.yet_again_a_long_key
) = value

I don't think a hybrid model would make sense here:

self.selectable = (
    self.persist_selectable
) = (
    self.local_table
) = (
    self.foo_bar
) = (
    self.foo
) =  self.longlonglong_key = self.another_long_long_long_key = self.yet_again_a_long_key = (
    value
)

I think there is some inherent value to consistency and easy to follow rules. Thus I feel like we should stick with either splitting early or late, but not make it depend on the case, and in general it seems splitting late >= splitting early. And while I do appreciate avoiding introducing unnecessary diffs, this is quite the corner case and doesn't touch that many lines of code.

All that being said, as a contributor I trust your view on what is good and desired. Also, even though intuitively I feel consistency should weigh in cases where two formatting options are both ugly, I'm not fully sure if it should be enough to justify changing the formatting.

JelleZijlstra · 2023-11-08T04:23:20Z

@hauntsaninja do you have any opinion here? I think this is mostly an improvement but I'm a bit hesitant to make style changes in existing code that aren't clear improvements.

hauntsaninja

I think it's an improvement! Splitting the one in the middle is not usually a choice a human would make.
Moreover, the change here isn't super common or controversial, so I think churn costs aren't too high.

JelleZijlstra · 2023-11-10T00:34:45Z

Sounds good! Unfortunately diff-shades failed again, I'll retry to see if it succeeds.

JelleZijlstra

Let's do it then, I'd like to see if I can get diff-shades to work first though.

JelleZijlstra · 2023-11-23T03:11:35Z

I think the diff-shades errors miraculously fixed themselves.

henriholopainen added 6 commits October 31, 2023 17:45

Add test case

6257416

Refactor: extract logic variables for debuggability/readability

ee5ac82

Add more complex test case

dde4513

Prefer more equal signs on lhs for chained assignments

4ea215b

Add changelog entry

ab5d18f

Update PR number

aa1dc6a

JelleZijlstra reviewed Oct 31, 2023

View reviewed changes

tests/data/cases/preview_prefer_rhs_split.py Show resolved Hide resolved

henriholopainen added 4 commits October 31, 2023 19:19

More test cases

c798816

Small refactor for easier to read logic

fa6cc4f

For chained assignments, don't fail splitting but use previous split …

b4f9466

…on error

Introduce Line.is_chained_assignment

3e10628

henriholopainen requested a review from JelleZijlstra October 31, 2023 17:36

henriholopainen mentioned this pull request Oct 31, 2023

Preview: Improved line breaks with trailing end of line comments #4006

Closed

JelleZijlstra reviewed Nov 3, 2023

View reviewed changes

src/black/linegen.py Outdated Show resolved Hide resolved

Wording

8c1749e

henriholopainen and others added 3 commits November 3, 2023 12:09

Merge branch 'main' into issue_4007

5e5e264

Merge branch 'main' into issue_4007

cb95398

Merge branch 'main' into issue_4007

a51f9b7

Merge branch 'main' into issue_4007

ec4d260

hauntsaninja approved these changes Nov 8, 2023

View reviewed changes

henriholopainen and others added 2 commits November 18, 2023 15:36

Merge branch 'main' into issue_4007

320cbb9

Merge branch 'main' into issue_4007

7d9c3ec

JelleZijlstra approved these changes Nov 18, 2023

View reviewed changes

Merge branch 'main' into issue_4007

f3cadbd

Merge branch 'main' into issue_4007

4ad7c39

JelleZijlstra merged commit fb5e5d2 into psf:main Nov 23, 2023
46 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prefer more equal signs before a break when splitting chained assignments #4010

Prefer more equal signs before a break when splitting chained assignments #4010

henriholopainen commented Oct 31, 2023 •

edited

github-actions bot commented Oct 31, 2023 •

edited

MichaReiser commented Nov 2, 2023 •

edited

JelleZijlstra commented Nov 3, 2023

henriholopainen commented Nov 3, 2023

JelleZijlstra commented Nov 8, 2023

hauntsaninja left a comment

JelleZijlstra commented Nov 10, 2023

JelleZijlstra left a comment

JelleZijlstra commented Nov 23, 2023

Prefer more equal signs before a break when splitting chained assignments #4010

Prefer more equal signs before a break when splitting chained assignments #4010

Conversation

henriholopainen commented Oct 31, 2023 • edited

Description

Checklist - did you ...

github-actions bot commented Oct 31, 2023 • edited

MichaReiser commented Nov 2, 2023 • edited

JelleZijlstra commented Nov 3, 2023

henriholopainen commented Nov 3, 2023

JelleZijlstra commented Nov 8, 2023

hauntsaninja left a comment

Choose a reason for hiding this comment

JelleZijlstra commented Nov 10, 2023

JelleZijlstra left a comment

Choose a reason for hiding this comment

JelleZijlstra commented Nov 23, 2023

henriholopainen commented Oct 31, 2023 •

edited

github-actions bot commented Oct 31, 2023 •

edited

MichaReiser commented Nov 2, 2023 •

edited