Speed up ESLint #16962

mdjermanovic · 2023-03-05T23:11:34Z

This is an issue to define tasks to improve ESLint performance per the recommendations from the "Speeding up the JavaScript ecosystem - eslint" blog post:

https://marvinh.dev/blog/speeding-up-javascript-ecosystem-part-3/

I was able to extract five recommendations from the blog post that relate to eslint core or eslint dependencies. Please add more if I missed something.

Token store's utils.search should use a binary search algorithm.

We could implement our own, or find a library. In fact, this used to be a binary search before Chore: Remove lodash #14287. The performance impact of switching to Array#findIndex was discussed in Chore: Remove lodash #14287 (comment), but at the time performance tests did not show significant differences. Regardless, I think we should reintroduce binary search here.
Refactor the code to avoid calling the mentioned utils.search / instantiating BackwardTokenCommentCursor millions of times.

This suggestion requires further analysis. I'm not sure if the premise that we can avoid this because "we should know exactly where we are" applies here because BackwardTokenCommentCursor is used by methods that take an arbitrary node/token, such as SourceCode#getTokensBefore.
Several points on improving esquery performance.

This has already been implemented in Optimize hot code paths estools/esquery#134. Our Single File and Multi Files performance tests show ~8% overall performance improvement. 🚀
Fast path for simple selectors ("Bailing out early" section in the blog post).

The suggestion is to handle the simplest selectors in the form of "NodeType" manually, without using esquery. We could definitely give this a try.
"Rethinking selectors" section in the blog post.

I'm not sure what is the recommendation here, is it to drop declarative selectors in favor of js functions that would be provided by rules?

The text was updated successfully, but these errors were encountered:

kecrily · 2023-03-06T08:47:30Z

I would like to invite the author of the article to participate in the discussion, ping @marvinhagemeister

marvinhagemeister · 2023-03-06T15:21:18Z

Thanks for the ping. Most of the improvements relating to esquery can indeed be found in the linked PR estools/esquery#134 .

Switching to a binary search is an easy win, albeit a small one.
The problem with BackwardTokenCommentCursor is that it always starts the search from the beginning if I recall correctly, similar to utils.search. Given that many files are composed of many tokens, that leads to a lot of misses
Yup, most things were included in that PR
I think adding a special path here for NodeType selectors is still worth it. Those were the most common ones I encountered
Main recommendation there is to drop esquery in favor of JS functions in that section of the blog post. It's arguably a strong breaking change, so not sure how feasible that is.

nzakas · 2023-03-07T19:23:17Z

Here's what I think the task list is, so we can just check these off as we go:

Update TokenStore utils.search() to use a binary search algorithm
Try to avoid BackwardTokenCommentCursor and utils.search()
Bail out early when matching simple node types instead of using esquery in NodeEventGenerator

The last recommendation, to drop selector syntax, would be a significant breaking change and not one we could introduce any time soon. I think a better approach is likely to investigate creating a tool that can take a rule that uses query strings and regenerate it so that it doesn't use query strings...or maybe something simpler like a tool that you can pass a bunch of query strings to and it will generate a rule scaffold for you. In any event, I think going the way of creating a tool to generate more-performant JS code instead of having selectors in the final JS would be a much more palatable choice.

Refs #16962

nzakas · 2023-03-24T21:09:36Z

I took a stab at the fast path for query selectors and didn't see any significant performance improvement, either in our standard perf test or just running ESLint on our own codebase.
#17019

sam3k · 2023-03-29T03:17:30Z

Thanks for the ping. Most of the improvements relating to esquery can indeed be found in the linked PR estools/esquery#134 .

Switching to a binary search is an easy win, albeit a small one.

The problem with BackwardTokenCommentCursor is that it always starts the search from the beginning if I recall correctly, similar to utils.search. Given that many files are composed of many tokens, that leads to a lot of misses

Yup, most things were included in that PR

I think adding a special path here for NodeType selectors is still worth it. Those were the most common ones I encountered

Main recommendation there is to drop esquery in favor of JS functions in that section of the blog post. It's arguably a strong breaking change, so not sure how feasible that is.

These are some great performance improvements:

We microbenchmarked the effect of each of these changes, but to get a better of their real life impact I tested the eslint-plugin-unicorn ESLint plugin on the codebase at my workplace. The plugin relies heavily on selectors. Linting times were as follows:

Before enabling eslint-plugin-unicorn: 14s (TypeScript codebase with type-aware linting rules enabled, and esquery optimizations didn't have any real impact here)

After enabling eslint-plugin-unicorn's recommended config without these optimizations: 23s

After enabling eslint-plugin-unicorn's recommended config with these optimizations: 18s

Cumulatively these optimizations cut down the overhead added by eslint-plugin-unicorn by more than 50%, at least in our setup. The biggest bang for the buck came from hoisting constants (2s reduction) and avoiding for-of transpilation (2s reduction).

fasttime · 2023-04-07T05:37:16Z

I went ahead and reimplemented utils.search with a binary search algorithm in #17066. The unit tests in tests/lib/source-code/token-store.js only use 10 tokens, which is not enough to make a noticeable difference in performance, and also not a realistic scenario for a typical use case. But in the perf tests, the overall impact of the function seems too negligible to be measured. Any ideas how this could be tested?

mdjermanovic · 2023-04-07T10:01:08Z

I went ahead and reimplemented utils.search with a binary search algorithm in #17066. The unit tests in tests/lib/source-code/token-store.js only use 10 tokens, which is not enough to make a noticeable difference in performance, and also not a realistic scenario for a typical use case. But in the perf tests, the overall impact of the function seems too negligible to be measured. Any ideas how this could be tested?

That part of the blog post mentions JSDoc rules that ESLint uses when linting its own codebase (npm run lint). Maybe you could try this: set TIMING=all env variable, then compare times for jsdoc/* rules.

fasttime · 2023-04-07T18:37:17Z

That part of the blog post mentions JSDoc rules that ESLint uses when linting its own codebase (npm run lint). Maybe you could try this: set TIMING=all env variable, then compare times for jsdoc/* rules.

Thanks @mdjermanovic, this method shows a clear performance improvement for jsdoc/* rules when the binary search is used. I've updated my PR to include these results.

kkimdev · 2023-09-07T12:51:55Z

I'm not familiar with the internals of ESLint, but I'm just wondering: can we execute different rules in parallel on different cores(/processes)? If this is feasible, it seems like an easy, low-hanging fruit for speeding up ESLint.

nzakas · 2023-09-07T15:55:13Z

@kkimdev there's a long discussion about what can potentially be parallelized here:
#3565

Doing it per rule would likely slow things down because run sometimes thousands of rules and most of the time we're dealing with 4-8 cores.

nzakas · 2023-09-07T15:55:24Z

Closing this issue as the planned work has been completed.

mdjermanovic added the core Relates to ESLint's core APIs and features label Mar 5, 2023

mdjermanovic added the accepted There is consensus among the team that this change meets the criteria for inclusion label Mar 8, 2023

nzakas added a commit that referenced this issue Mar 24, 2023

perf: Fast path for NodeEventGenerator

99ae371

Refs #16962

nzakas mentioned this issue Mar 24, 2023

perf: Fast path for NodeEventGenerator #17019

Closed

1 task

fasttime mentioned this issue Apr 7, 2023

perf: Binary search in token store utils.search #17066

Merged

1 task

nzakas closed this as completed Sep 7, 2023

eslint-github-bot bot locked and limited conversation to collaborators Mar 6, 2024

eslint-github-bot bot added the archived due to age This issue has been archived; please open a new issue for any further discussion label Mar 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up ESLint #16962

Speed up ESLint #16962

mdjermanovic commented Mar 5, 2023

kecrily commented Mar 6, 2023

marvinhagemeister commented Mar 6, 2023

nzakas commented Mar 7, 2023 •

edited by fasttime

nzakas commented Mar 24, 2023

sam3k commented Mar 29, 2023

fasttime commented Apr 7, 2023 •

edited

mdjermanovic commented Apr 7, 2023

fasttime commented Apr 7, 2023

kkimdev commented Sep 7, 2023

nzakas commented Sep 7, 2023

nzakas commented Sep 7, 2023

Speed up ESLint #16962

Speed up ESLint #16962

Comments

mdjermanovic commented Mar 5, 2023

kecrily commented Mar 6, 2023

marvinhagemeister commented Mar 6, 2023

nzakas commented Mar 7, 2023 • edited by fasttime

nzakas commented Mar 24, 2023

sam3k commented Mar 29, 2023

fasttime commented Apr 7, 2023 • edited

mdjermanovic commented Apr 7, 2023

fasttime commented Apr 7, 2023

kkimdev commented Sep 7, 2023

nzakas commented Sep 7, 2023

nzakas commented Sep 7, 2023

nzakas commented Mar 7, 2023 •

edited by fasttime

fasttime commented Apr 7, 2023 •

edited