GitHub action #42

thomaseizinger · 2021-07-26T02:14:26Z

It would be nice to have a Github action that runs dylint without having to manually manage the installation and caching.

smoelius · 2021-07-26T08:42:56Z

Hi, @thomaseizinger. Admittedly, I don't know a lot about GitHub actions. Could you say a little bit more about what you are imagining?

Does something analogous exist for Clippy?

thomaseizinger · 2021-07-28T04:34:26Z

Sure! Sorry for being very brief in the initial issue!

Does something analogous exist for Clippy?

Yes. It is pretty easy to run clippy using GitHub actions.

Rust is pre-installed in the container that the actions run in, so if you are not concerned with running against latest stable Rust, doing something like: - run: cargo clippy -- -D warnings will fail the CI checks on any emitted warnings.
A specific version of Rust can very easily be installed via: https://github.com/actions-rs/toolchain

Could you say a little bit more about what you are imagining?

I want to run dylint as part of my CI workflow but installing it every time takes up quite a bit of time. Caching can obviously be utilized but that requires some setup as well. In an ideal world, a GitHub action would be provided that manages the installation / download / caching for me so I can simply say:

- uses: trailofbits/dylint-action@v1
  with:
    args: -- -D warnings

This would assume we would provide an action in a repository dylint-action in the trailofbits organization.

In an ideal world, https://github.com/actions-rs/install could be used to solve this in a more generic manner but development there has stalled unfortunately :(

smoelius · 2021-07-29T10:44:43Z

This would assume we would provide an action in a repository dylint-action in the trailofbits organization.

This sounds reasonable to me. I'll ask around to see if anyone here has the bandwidth and necessary skills to tackle this. I'll also attach help wanted to this issue, as I think we'd be open to open to an external contribution.

As a note to myself:

Caching can obviously be utilized but that requires some setup as well.

I assume you are referring to this: https://github.com/actions/cache/blob/main/examples.md#rust---cargo I haven't tried it (though I probably will).

Also, as an aside, we do run Dylint from within a GitHub action for a separate project: https://github.com/trailofbits/test-fuzz/blob/6a532b5f37c5b9a6737f99d437d4a183fa560f4d/.github/workflows/ci.yml#L29-L32

Personally, I find the time it requires to be tolerable. But I can see the points you've made.

thomaseizinger · 2021-07-30T22:53:26Z

If we attach binaries to releases, we could also download them instead of building them from scratch every time, mitigating the need for caching.

smoelius · 2021-07-31T15:27:54Z

If we attach binaries to releases, we could also download them instead of building them from scratch every time, mitigating the need for caching.

Hmm, I'm not sure that I want to take on that responsibility.

Can I take it that this would solve a particular need of yours? What platform would you require a binary for?

thomaseizinger · 2021-07-31T22:01:58Z

If we attach binaries to releases, we could also download them instead of building them from scratch every time, mitigating the need for caching.

Hmm, I'm not sure that I want to take on that responsibility.

Can I take it that this would solve a particular need of yours? What platform would you require a binary for?

If we could download a binary, the GitHub action wouldn't need to cache it. I would likely run the workflows only on ubuntu-latest.

smoelius · 2021-08-01T17:44:34Z

I could maybe be convinced to publish ubuntu-latest binaries, but I am not sure that would completely address the problem. If you install Dylint from scratch and run it, much of the time will be spent on building drivers, and possibly building lint libraries.

I experimented with using actions/cache to cache Dylint artifacts and the results were good!

Here specifically is what I tried (https://github.com/trailofbits/test-fuzz/blob/2cac66d43ffe0889430927fb838c37f906fae3c4/.github/workflows/ci.yml#L12-L28):

    - name: Dylint versions
      run: cargo search dylint | sort | tee dylint_versions

    - uses: actions/cache@v2
      with:
        path: |
          ~/.cargo/bin/
          ~/.cargo/registry/index/
          ~/.cargo/registry/cache/
          ~/.cargo/git/db/
          ~/.dylint_drivers/
          ~/.rustup/toolchains/
          target/dylint/
        key: ${{ runner.os }}-dylint-${{ hashFiles('dylint_versions') }}

This took a 21 minute job down to about six minutes!

Note that the project where this was tested runs only one lint library, and it comes from the Dylint repository itself. If the project ran other lint libraries, their versions should be accounted for in the cache key.

So, I agree with you, having a "Dylint action" would be most convenient. But, short of that, this seems to be a pretty good solution.

What do you think, @thomaseizinger?

thomaseizinger · 2021-08-02T03:17:27Z

For normal caching in Rust projects, I usually use https://github.com/Swatinem/rust-cache because it makes it effectively a one-liner which is pretty nice.

I didn't actually realize that it might be worthwhile to cache the built libraries and drivers of dylint as well. Initially I was only thinking of the dylint binary itself.

Do I understand correctly that dylint stores things outside of the target directory? Can I ask why? If it were to store things within target, then we could make use of existing caching infrastructure like https://github.com/Swatinem/rust-cache.

In regards to versions, would it somehow be possible to add dylint libraries as dev-dependencies to a project and therefore have the exact version tracked in Cargo.lock?

smoelius · 2021-08-02T10:45:05Z

For normal caching in Rust projects, I usually use https://github.com/Swatinem/rust-cache because it makes it effectively a one-liner which is pretty nice.

That is nice!

Do I understand correctly that dylint stores things outside of the target directory? Can I ask why?

The drivers (essentially wrappers around the rust compiler) are stored in ~/.dylint_drivers by default. The only thing that distinguishes them is the version of the compiler they wrap. So it makes sense to share them across projects.

If it were to store things within target, then we could make use of existing caching infrastructure like https://github.com/Swatinem/rust-cache.

This should be achievable by setting the DYLINT_DRIVER_PATH environment variable (though an apparent bug means it only works on the first try):

mkdir $PWD/target/dylint_drivers
DYLINT_DRIVER_PATH=$PWD/target/dylint_drivers cargo dylint --all --workspace

Admittedly, we might want a more ergonomic solution than having to set DYLINT_DRIVER_PATH.

In regards to versions, would it somehow be possible to add dylint libraries as dev-dependencies to a project and therefore have the exact version tracked in Cargo.lock?

I don't think so, or at least I can't immediately see how.

The main problem is that all members of a workspace are expected to be compiled with the same compiler version. But a Dylint library is pinned to a specific compiler version, which isn't necessarily the same as that of the project its run against.

I can see the benefits of what you're suggesting. Maybe there's some "trick" we could use to make it work, but I can't see it right now.

thomaseizinger · 2021-08-03T06:37:45Z

The drivers (essentially wrappers around the rust compiler) are stored in ~/.dylint_drivers by default. The only thing that distinguishes them is the version of the compiler they wrap. So it makes sense to share them across projects.

As soon as you change your compiler version, I believe rustc compiles all of your dependencies again. From that perspective, isn't a dylint driver really just another dependency?

If it were to store things within target, then we could make use of existing caching infrastructure like Swatinem/rust-cache.

This should be achievable by setting the DYLINT_DRIVER_PATH environment variable (though an apparent bug means it only works on the first try):

On my personal computer, I am setting target-dir to a directory under ~/.cache via build.target-dir in ~/.cargo/config.toml. This serves two purposes:

Nuking the target directory of all my different Rust projects in one go.
Re-using build artifacts of dependencies across projects.

If dylint were to store its drivers under CARGO_TARGET_DIR, this reuse could happen without dylint needing to do anything by itself I believe :)

The main problem is that all members of a workspace are expected to be compiled with the same compiler version. But a Dylint library is pinned to a specific compiler version, which isn't necessarily the same as that of the project its run against.

I am not sure I completely follow. When and where is this pinning happening?

smoelius · 2021-08-03T10:49:55Z

Let me answer your last question first:

I am not sure I completely follow. When and where is this pinning happening?

Essentially, when you build the library. A library gets a name like:

libtry_io_result@nightly-2021-06-03-x86_64-unknown-linux-gnu.so
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

The reason is that the compiler APIs that lints use are unstable and could change at any time. Dylint needs to know which version of the Rust compiler a lint library uses, as this determines the driver needed to run the library.

Back to your first question:

As soon as you change your compiler version, I believe rustc compiles all of your dependencies again. From that perspective, isn't a dylint driver really just another dependency?

The problem is the compiler version that a lint library uses isn't necessarily the same as what an author wants to use to build their package, e.g., for production.

For example, an author might want to build their package using (stable) 1.44.0-x86_64-unknown-linux-gnu. But they might want to lint that package using a library that compiles using nightly-2021-03-11-x86_64-unknown-linux-gnu. We'd like to give them that flexibility.

Occasionally, this causes problems. It can happen that an author's package can't be compiled with the compiler that a lint library uses. Fortunately, I've only bumped in to this a handful of times.

On my personal computer, I am setting target-dir to a directory under ~/.cache via build.target-dir in ~/.cargo/config.toml. This serves two purposes:

Nuking the target directory of all my different Rust projects in one go.

Re-using build artifacts of dependencies across projects.

That's an interesting idea. If I'm understanding correctly, you're sharing a target directory across projects.

If dylint were to store its drivers under CARGO_TARGET_DIR, this reuse could happen without dylint needing to do anything by itself I believe :)

That might make sense for users who use a shared target directory, as you've suggested. But I don't know how common that is. Given that a workspaces's target directory appears in its root by default, I think Dylint's current behavior makes sense, as it allows drivers to be reused across different workspaces.

thomaseizinger · 2021-08-03T13:14:02Z

Essentially, when you build the library. A library gets a name like:
libtry_io_result@nightly-2021-06-03-x86_64-unknown-linux-gnu.so
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
The reason is that the compiler APIs that lints use are unstable and could change at any time. Dylint needs to know which version of the Rust compiler a lint library uses, as this determines the driver needed to run the library.

Back to your first question:

As soon as you change your compiler version, I believe rustc compiles all of your dependencies again. From that perspective, isn't a dylint driver really just another dependency?

The problem is the compiler version that a lint library uses isn't necessarily the same as what an author wants to use to build their package, e.g., for production.

Right, that makes a lot of sense. Thank you for explaining!

That's an interesting idea. If I'm understanding correctly, you're sharing a target directory across projects.

Yes, correct.

That might make sense for users who use a shared target directory, as you've suggested. But I don't know how common that is. Given that a workspaces's target directory appears in its root by default, I think Dylint's current behavior makes sense, as it allows drivers to be reused across different workspaces.

It does make sense. I would have probably just not gone through the effort of building that optimization into dylint itself. If people are willing to accept rebuild of the same dependencies (like syn, etc) across multiple workspaces, they are likely also fine with dylint having to rebuild drivers per workspace. If they are not, they can use a shared target directory.

Overall, this was very educational, thank you!
I think setting dylint's driver path and using https://github.com/Swatinem/rust-cache is going to be my way forward :)

smoelius · 2021-08-04T10:32:48Z

Right, that makes a lot of sense. Thank you for explaining!

No problem. It's helpful for me to talk through it now and then. :)

If they are not, they can use a shared target directory.

Is using a shared target directory common?

Separately and for my own reference, I wondered whether Cargo offered a way to store the download cache in the target directory (because the download cache is shared across workspaces kind of like how Dylint drivers are). This was the only relevant piece of information I could find: https://stackoverflow.com/questions/45222791/is-it-possible-to-install-cargo-dependencies-in-the-same-directory-as-my-project

And back to this topic:

In regards to versions, would it somehow be possible to add dylint libraries as dev-dependencies to a project and therefore have the exact version tracked in Cargo.lock?

I think I may have some ideas here. I am going to start playing with this once I am confident I have #54 fixed.

thomaseizinger · 2021-08-04T10:40:20Z

I think the reason for why downloads are shared is that the source-code is always the same, regardless of the cargo/Rust version.

I am not sure how common a shared target directory is but I prefer it simply for the ease of freeing up storage :)

smoelius added the help wanted Extra attention is needed label Jul 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GitHub action #42

GitHub action #42

thomaseizinger commented Jul 26, 2021

smoelius commented Jul 26, 2021

thomaseizinger commented Jul 28, 2021

smoelius commented Jul 29, 2021

thomaseizinger commented Jul 30, 2021

smoelius commented Jul 31, 2021 •

edited

thomaseizinger commented Jul 31, 2021

smoelius commented Aug 1, 2021

thomaseizinger commented Aug 2, 2021

smoelius commented Aug 2, 2021

thomaseizinger commented Aug 3, 2021

smoelius commented Aug 3, 2021

thomaseizinger commented Aug 3, 2021

smoelius commented Aug 4, 2021

thomaseizinger commented Aug 4, 2021

GitHub action #42

GitHub action #42

Comments

thomaseizinger commented Jul 26, 2021

smoelius commented Jul 26, 2021

thomaseizinger commented Jul 28, 2021

smoelius commented Jul 29, 2021

thomaseizinger commented Jul 30, 2021

smoelius commented Jul 31, 2021 • edited

thomaseizinger commented Jul 31, 2021

smoelius commented Aug 1, 2021

thomaseizinger commented Aug 2, 2021

smoelius commented Aug 2, 2021

thomaseizinger commented Aug 3, 2021

smoelius commented Aug 3, 2021

thomaseizinger commented Aug 3, 2021

smoelius commented Aug 4, 2021

thomaseizinger commented Aug 4, 2021

smoelius commented Jul 31, 2021 •

edited