Performance - Drastically improve worst case regex performance #51

adam-arthur · 2023-01-16T10:16:18Z

Problem:
Very poor performance when testing certain regexes due to unnecessary conversion to/from string/array.

Impact:
Some of the reporters in Vitest that depend on this code are extremely slow. (See: vitest-dev/vitest#2602)

Notes:
While time complexity of both operations is theoretically the same, seems the VM is optimizing string specific operations much better than the conversion back/forth.

Future:
In this case can run a single global regex on the entire string up front rather than re-allocating a new substring at each index. But for now this is simple and solves the issue.

adam-arthur · 2023-01-16T10:17:10Z

Please see for context:
vitest-dev/vitest#2602

adam-arthur · 2023-01-16T10:25:02Z

CC @sindresorhus

sindresorhus · 2023-01-17T08:15:55Z

index.js

@@ -42,7 +42,7 @@ const wrapWord = (rows, word, columns) => {

 		if (ESCAPES.has(character)) {
 			isInsideEscape = true;
-			isInsideLinkEscape = characters.slice(index + 1).join('').startsWith(ANSI_ESCAPE_LINK);
+			isInsideLinkEscape = word.slice(index + 1).startsWith(ANSI_ESCAPE_LINK);


.slice() operates on code units while the [...word] splits on codepoints. That's a subtle difference.

> '🌍'.slice(0, 2).length < 2 > [...'🌍'].length < 1

Good catch, not too familiar with unicode/string particulars.

Will add a test to this effect and update PR

Converting from array to string repeatedly in a loop leads to very poor performance in some contexts.

adam-arthur · 2023-01-18T03:18:16Z

Updated PR to respect unicode character lengths

sindresorhus · 2023-01-20T08:55:35Z

Will add a test to this effect and update PR

Missing the test

moosemanf · 2023-05-04T14:36:03Z

I would argue the test is "missing" bc basically every other test confirms the correct behaviour, isn't it? @sindresorhus

gtm-nayan · 2023-05-27T14:05:58Z

My profiler shows me that a significant amount of time is also spent in the ansiRegex() call inside strip-ansi. stripAnsi is being called for every "word" and each time it creates a new regex object. Might want to looks there as well, creating the regex in the module scope and then resetting its lastIndex before each use sounds like a fairly low hanging fruit.

gtm-nayan · 2023-06-02T00:24:42Z

Completely tangential but @adam-arthur can you please tell me what tool you're using for the per-line timings in #51 (comment) ?

sindresorhus · 2023-10-27T14:34:46Z

Bump :)

adam-arthur mentioned this pull request Jan 16, 2023

Severe performance regression when reporter with terminal output is enabled vitest-dev/vitest#2602

Closed

6 tasks

sindresorhus reviewed Jan 17, 2023

View reviewed changes

Performance - Drastically improve worst case regex performance

2cbdcae

Converting from array to string repeatedly in a loop leads to very poor performance in some contexts.

adam-arthur force-pushed the perf branch from fb7027c to 2cbdcae Compare January 18, 2023 03:15

sindresorhus added 2 commits January 20, 2023 15:52

Update index.js

158ea17

Update index.js

312bc5a

gtm-nayan mentioned this pull request May 27, 2023

perf: only create regex once chalk/strip-ansi#49

Merged

sindresorhus merged commit d989bc4 into chalk:main Oct 28, 2023

AriPerkkio mentioned this pull request Oct 29, 2023

perf: update log-update v9 vitest-dev/vitest#4390

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance - Drastically improve worst case regex performance #51

Performance - Drastically improve worst case regex performance #51

adam-arthur commented Jan 16, 2023 •

edited

adam-arthur commented Jan 16, 2023 •

edited

adam-arthur commented Jan 16, 2023

sindresorhus Jan 17, 2023

adam-te Jan 17, 2023

adam-arthur commented Jan 18, 2023 •

edited

sindresorhus commented Jan 20, 2023

moosemanf commented May 4, 2023

gtm-nayan commented May 27, 2023 •

edited

gtm-nayan commented Jun 2, 2023

sindresorhus commented Oct 27, 2023

Performance - Drastically improve worst case regex performance #51

Performance - Drastically improve worst case regex performance #51

Conversation

adam-arthur commented Jan 16, 2023 • edited

adam-arthur commented Jan 16, 2023 • edited

adam-arthur commented Jan 16, 2023

sindresorhus Jan 17, 2023

Choose a reason for hiding this comment

adam-te Jan 17, 2023

Choose a reason for hiding this comment

adam-arthur commented Jan 18, 2023 • edited

sindresorhus commented Jan 20, 2023

moosemanf commented May 4, 2023

gtm-nayan commented May 27, 2023 • edited

gtm-nayan commented Jun 2, 2023

sindresorhus commented Oct 27, 2023

adam-arthur commented Jan 16, 2023 •

edited

adam-arthur commented Jan 16, 2023 •

edited

adam-arthur commented Jan 18, 2023 •

edited

gtm-nayan commented May 27, 2023 •

edited