Perf improvements - avoid persisting haste map / processing files when not changed. #8153

scotthovestadt · 2019-03-18T22:48:17Z

Summary

At Facebook, in a common situation where you've changed a couple of files in the largest haste map, this PR cuts off 25%~ of the startup time. In other less common situations where you've working on a smaller haste map, the improvement is 60%~.

The improvement is gained by:

Not re-serializing and writing the haste map to disk if it was loaded off of disk and then not changed.
Not re-creating from scratch the map and mocks part of the haste map on startup when we know what specific files were changed. Instead, just re-process only the specific changed files.

I've benchmarked the startup time by:

Setting up a single test
Running once to prime the cache
Changing a test file
Running the test again with --skipFilter (measuring at this point via time)

I've been a bit conservative and I'm just doing a full re-process when files were deleted (same as current behavior) but I may improve that. It's much less common to delete a file than to edit a file and I wanted to keep the code as simple as possible initially.

In cases where Watchman isn't being used or is freshly started, there is no difference.

I'm always a little suspicious when something relatively simple yields such a large performance improvement, so please help by casting a very critical eye on this PR and all assumptions that I made.

Test plan

All tests pass.
Tested manually in multiple situations.
No change in behavior without watchman.
Manually verified the cache file is updated appropriately in a variety of situations.

…les if not changed

SimenB · 2019-03-19T11:43:52Z

Woah, crazy numbers. Love it!

Is it possible to add a test for this? And update the changelog 🙂

natealcedo · 2019-03-19T12:41:44Z

packages/jest-haste-map/src/index.ts

        });
+        const __hasteMapForTest =
+          (process.env.NODE_ENV === 'test' && hasteMap) || null;
+        return this._watch(hasteMap).then(() => ({


Just a quick observation, since this._buildPromise is an async function, is there a need to call then on this._watch? Shouldn't this be an await call followed by a return of the object?

This didn't used to use async/await in the past, so this is probably why it looks this way. It can be changed to await.

I'll change to await. Thanks for the feedback.

cpojer

Nice work, can't believe how big the speedup is for Haste Map reconciliation!

rubennorte

This looks great! Thanks for working on this.

Did you measure the impact of this change with a clean cache? It'd be great to have the time to build the haste map (specifically, which will be more precise than the full time to run jest) with a cold and a warm cache, and with different scenarios (no files changed, some files modified, some files removed, etc.).

scotthovestadt · 2019-03-19T14:42:53Z

@rubennorte This PR does not impact the cold cache run time at all. What I've done here is basically just add code paths that only fire when the cache is warm to avoid doing unnecessary work. When the cache is cold, or files were deleted, the PR basically has no difference than what's on master currently.

After this PR, the main culprits of the cache generation time are reading and deserializing the cache and serializing and writing the cache back to disk, which account for 30% of the current start time and almost the entirety of the cache generation time when warm.

I have another PR coming that refactors the serialization with the same theme as this PR-- we should only need to write what changed. I expect it to cut off 70% of the cache generation time (when warm), based on early measurements!

SimenB · 2019-03-19T16:27:02Z

packages/jest-haste-map/src/index.ts

+      hasteMap.map = map;
+      hasteMap.mocks = mocks;
+      return hasteMap;
+    } catch (error) {


could do

} finally { this._cleanup(); }

instead of the catch (and remove it from the happy path as well). Not sure if it's better or not?

Since the catch actually throws the error, it wouldn't make it to the finally block in this case. But yeah, the code duplication (even just calling the method) annoys me a little too. I don't see a way around it, though.

github-actions · 2021-05-11T21:07:25Z

This pull request has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.
Please note this issue tracker is not a help forum. We recommend using StackOverflow or our discord channel for questions.

Haste map - avoid persisting haste map if not changed / processing fi…

0d0d2e1

…les if not changed

facebook-github-bot added the cla signed label Mar 18, 2019

Fix lint error.

477d9ee

SimenB requested review from cpojer and rubennorte March 19, 2019 10:57

natealcedo reviewed Mar 19, 2019

View reviewed changes

cpojer approved these changes Mar 19, 2019

View reviewed changes

rubennorte approved these changes Mar 19, 2019

View reviewed changes

scotthovestadt added 3 commits March 19, 2019 07:47

Use await for watch to keep code readable and consistent.

d9b37df

Update CHANGELOG.md

97703c0

Add assertions for watchman changed files.

9522026

SimenB approved these changes Mar 19, 2019

View reviewed changes

SimenB reviewed Mar 19, 2019

View reviewed changes

scotthovestadt merged commit b48576a into jestjs:master Mar 19, 2019

scotthovestadt mentioned this pull request Mar 20, 2019

Fix incorrect duplicate mock warning. #8167

Merged

snyk-bot mentioned this pull request Dec 15, 2019

[Snyk] Upgrade babel-jest from 24.0.0 to 24.9.0 jakeherp/burger-builder#14

Open

This was referenced Mar 21, 2020

[Snyk] Upgrade: jest, jest-cli lwojcik/sc2pte-panel-frontend#88

Closed

[Snyk] Upgrade jest from 24.0.0 to 24.9.0 DavidKindler/burger-builder#47

Merged

github-actions bot locked as resolved and limited conversation to collaborators May 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perf improvements - avoid persisting haste map / processing files when not changed. #8153

Perf improvements - avoid persisting haste map / processing files when not changed. #8153

scotthovestadt commented Mar 18, 2019

SimenB commented Mar 19, 2019

natealcedo Mar 19, 2019 •

edited

cpojer Mar 19, 2019

scotthovestadt Mar 19, 2019

cpojer left a comment

rubennorte left a comment

scotthovestadt commented Mar 19, 2019

SimenB Mar 19, 2019

scotthovestadt Mar 19, 2019

github-actions bot commented May 11, 2021

Perf improvements - avoid persisting haste map / processing files when not changed. #8153

Perf improvements - avoid persisting haste map / processing files when not changed. #8153

Conversation

scotthovestadt commented Mar 18, 2019

Summary

Test plan

SimenB commented Mar 19, 2019

natealcedo Mar 19, 2019 • edited

Choose a reason for hiding this comment

cpojer Mar 19, 2019

Choose a reason for hiding this comment

scotthovestadt Mar 19, 2019

Choose a reason for hiding this comment

cpojer left a comment

Choose a reason for hiding this comment

rubennorte left a comment

Choose a reason for hiding this comment

scotthovestadt commented Mar 19, 2019

SimenB Mar 19, 2019

Choose a reason for hiding this comment

scotthovestadt Mar 19, 2019

Choose a reason for hiding this comment

github-actions bot commented May 11, 2021

natealcedo Mar 19, 2019 •

edited