Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug?]: Aliased installs with node_modules are many times larger than the unaliased versions #3951

Closed
1 task
jpm4 opened this issue Jan 7, 2022 · 3 comments
Labels
bug Something isn't working stale Issues that didn't get attention waiting for feedback Will autoclose in a while unless more data are provided

Comments

@jpm4
Copy link

jpm4 commented Jan 7, 2022

Self-service

  • I'd be willing to implement a fix

Describe the bug

On an attempted upgrade to 3.2.0-rc.10 in a large monorepo, our node_modules size went from ~1.8G to ~2.4G. The most obvious examples of installs that increased in size were aliased ones - the most extreme case I've seen is webpack@4.46.0, from 2.3M -> 260M.

There is a large "self reference" directory (node_modules/webpack4/node_modules/webpack) as the deleted test here would normally test for: https://github.com/yarnpkg/berry/pull/3631/files#diff-ca05954739eec8e45eefa4b801ae3909a5069348db95ca5fb1c7be9ff57cf54aL344 (We must not create self-reference directory 'node_modules/no-deps2/node_modules/no-deps'). That PR is relevant to us for the issue it fixes, v3.1.1 breaks.

To reproduce

{
  "packageManager": "yarn@3.2.0-rc.10",
  "dependencies": {
    "webpack": "^4.46.0",
    "webpack4": "npm:webpack@^4.46.0"
  }
}

Observe large size difference between webpack and webpack4. This can also be observed with separate installs, which would be more realistic anyway. I'd guess this would work with many/all packages; another one I tried at random was selenium-webdriver.

Environment

System:
    OS: macOS 11.6.2
    CPU: (8) x64 Intel(R) Core(TM) i7-4870HQ CPU @ 2.50GHz
  Binaries:
    Node: 12.22.8 - /path/to/node
    Yarn: 3.2.0-rc.10 - /path/to/yarn
    npm: 6.14.15 - ~/.nvm/versions/node/v12.22.8/bin/npm

Additional context

No response

@jpm4 jpm4 added the bug Something isn't working label Jan 7, 2022
@paul-soporan
Copy link
Member

Creating self-reference directories for aliases when using the NM linker is required to fulfill the require promise (if we didn't, webpack4 wouldn't be able to require itself as webpack), the fact that we didn't do it before was a correctness issue.

The solution is to use nmMode: hardlinks-local (which will be the new default starting from 4.x) so that the files in the self-reference directory are hardlinked rather than duplicated.

@paul-soporan paul-soporan added the waiting for feedback Will autoclose in a while unless more data are provided label Jan 7, 2022
@jpm4
Copy link
Author

jpm4 commented Jan 8, 2022

Nice thanks for that tip! On 3.2.0-rc.10 that brings the node_modules size back down to what it was or even slightly lower. The node_modules/aliased_webpack4 dir in our project (not in the minimal repro) is still much bigger than before (~77MB), but maybe that doesn't matter if the entirety of node_modules is smaller?

This also looks to have sped up installs a ton, from 2m -> 1m with node_modules deleted, or 6m -> 3m with .yarn/cache and install-state deleted as well.

According to #3591 there's no real downside to this approach that you've seen?

@yarnbot
Copy link
Collaborator

yarnbot commented Feb 7, 2022

Hi! 👋

This issue looks stale, and doesn't feature the reproducible label - which implies that you didn't provide a working reproduction using Sherlock. As a result, it'll be closed in a few days unless a maintainer explicitly vouches for it or you edit your first post to include a formal reproduction (you can use the playground for that).

Note that we require Sherlock reproductions for long-lived issues (rather than standalone git repositories or similar) because we're a small team. Sherlock gives us the ability to check which bugs are still affecting the master branch at any given point, and decreases the amount of code we need to run on our own machines (thus leading to faster bug resolutions). It helps us help you! 😃

If you absolutely cannot reproduce a bug on Sherlock (for example because it's a Windows-only issue), a maintainer will have to manually add the upholded label. Thanks for helping us triaging our repository! 🌟

@yarnbot yarnbot added the stale Issues that didn't get attention label Feb 7, 2022
@yarnbot yarnbot closed this as completed Feb 12, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working stale Issues that didn't get attention waiting for feedback Will autoclose in a while unless more data are provided
Projects
None yet
Development

No branches or pull requests

3 participants