perf!: don't look for indent/dedent if comment is at current indent level #244

llllvvuu · 2023-09-22T05:43:22Z

Whether an indent/dedent token is generated for a comment depends on
whether the indent/dedent is persisted after the comment. This takes
O(comment_length) to check. Checking it for every line takes
O(comment_length * comment_lines), which is quadratic.

An optimization was made in a901729 which skips this check when there
is no possible indent/dedent, such as in the following:

# comment 1...
# comment 1000
print("foo")

However, the check cannot be skipped in the following case:

class Foo:
    def foo():
        print("bar")
    # comment 1...
    # comment 1000
    def bar():
        print("foo")

which comes up when commenting out code.

This PR optimizes this case by skipping the check if the comment is at the
current indent length. In the above example, # comment 1 looks ahead
to def bar() and changes the indent length to 4, after which comments
2-1000 do not look ahead, since they are also at indent length 4.

This PR does not optimize the following case:

class Foo:
    def foo():
        print("bar")
    # comment 1...
    # comment 1000
        print("foo")

but it does optimize:

class Foo:
    def foo():
        print("bar")
        # comment 1...
        # comment 1000
        print("foo")

BREAKING CHANGE: In the following scenario, we previously generated an
indent token before comment 1; we now generate it before comment 2.
This is more consistent with how dedents are handled.

def foo():
# comment 1
   # comment 2
   print("bar")

Related: nvim-treesitter/nvim-treesitter#4839

…evel Whether an indent/dedent token is generated for a comment depends on whether the indent/dedent is persisted after the comment. This takes O(comment_length) to check. Checking it for every line takes O(comment_length * comment_lines), which is quadratic. An optimization was made in `a901729` which skips this check when there is no possible indent/dedent, such as in the following: ```python # comment 1... # comment 1000 print("foo") ``` However, the check cannot be skipped in the following case: ```python class Foo: def foo(): print("bar") # comment 1... # comment 1000 def bar(): print("foo") ``` which comes up when commenting out code. This PR optimizes this case by skipping the check if the comment is at the current indent length. In the above example, `# comment 1` looks ahead to `def bar()` and changes the indent length to 4, after which comments 2-1000 do not look ahead, since they are also at indent length 4. This PR does not optimize the following case: ```python class Foo: def foo(): print("bar") # comment 1... # comment 1000 print("foo") ``` but it does optimize: ```python class Foo: def foo(): print("bar") # comment 1... # comment 1000 print("foo") ``` BREAKING CHANGE: In the following scenario, we previously generated an indent token before comment 1; we now generate it before comment 2. This is more consistent with how dedents are handled. ```python def foo(): # comment 1 # comment 2 print("bar") ```

ahlinc · 2023-09-27T08:19:41Z

What's interesting is that before the 188b6b06 commit it seems there was no such problem with comments, so maybe it worth to looking for an alternative solution and try to remove comments from externals.

llllvvuu force-pushed the wip/double_comment_2 branch 2 times, most recently from 73ab0ae to b452dd3 Compare September 22, 2023 05:45

This was referenced Sep 22, 2023

Slow performance on mid-sided Python file when typing just after a comment block nvim-treesitter/nvim-treesitter#4839

Open

WIP: Remove redundant indentation check of comment #243

Closed

llllvvuu changed the title ~~WIP!: alternate solution for redundant comment scan~~ WIP!: simpler solution for redundant comment scan Sep 22, 2023

llllvvuu mentioned this pull request Sep 22, 2023

feature: expose current byte/char in TSLexer for external scanner tree-sitter/tree-sitter#2644

Closed

llllvvuu force-pushed the wip/double_comment_2 branch from b452dd3 to 851bcf6 Compare September 22, 2023 22:28

llllvvuu changed the title ~~WIP!: simpler solution for redundant comment scan~~ perf!: don't look for indent/dedent if comment is at current indent level Sep 22, 2023

llllvvuu force-pushed the wip/double_comment_2 branch 2 times, most recently from b3e8db5 to 34108b4 Compare September 22, 2023 22:32

llllvvuu marked this pull request as ready for review September 22, 2023 22:33

llllvvuu force-pushed the wip/double_comment_2 branch 2 times, most recently from c042932 to 9ff33c8 Compare September 22, 2023 22:38

llllvvuu force-pushed the wip/double_comment_2 branch from 9ff33c8 to 3511972 Compare September 22, 2023 22:52

amaanq force-pushed the master branch from 777ecdb to 06958d6 Compare November 15, 2023 07:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf!: don't look for indent/dedent if comment is at current indent level #244

perf!: don't look for indent/dedent if comment is at current indent level #244

llllvvuu commented Sep 22, 2023 •

edited

ahlinc commented Sep 27, 2023

perf!: don't look for indent/dedent if comment is at current indent level #244

Are you sure you want to change the base?

perf!: don't look for indent/dedent if comment is at current indent level #244

Conversation

llllvvuu commented Sep 22, 2023 • edited

ahlinc commented Sep 27, 2023

llllvvuu commented Sep 22, 2023 •

edited