feat(bundler): extract, update, artifacts #3058

rarkins · 2019-01-12T06:46:21Z

This PR adds extract, update and artifacts functions to put Bundler support into an "alpha" support stage. Some known gotchas:

Extracting and updating multi-part ranges, e.g. gem "uglifier", ">= 1.3.0", "< 1.4.0", require: false
Dynamic switching between Bundler versions. If Bundler 2.0 is meant to achieve this, it doesn't seem to be working. We'll probably solve it by using multiple Docker images
Updating the Gemfile.lock requires any referenced gemspec files to be present, so this only works with gitFs mode right now

rarkins · 2019-01-12T06:49:55Z

Re: locating/writing gemspec files before lock file generation, does anyone know how exhaustively we need to check for these? Do we just need to look for gemspec and <project name>.gemspec, for example?

bjeanes · 2019-01-12T07:09:59Z

I don't recall ever running into a gem whose gemspec wasn't <name of library>.gemspec in root directory, if that's what you mean. https://guides.rubygems.org/make-your-own-gem/ doesn't mention any extra locations and the template that rubygems generates for new gems would put it there.

rarkins · 2019-01-12T07:21:46Z

Is the project name extracted from any project metadata or based purely on the parent directory name?

bjeanes · 2019-01-12T07:33:04Z

That I don't know. I wouldn't be surprised if it's the other way around and it discovers gems via **/*.gemspec and then uses the metadata from within those files to determin actual gems. I suspect the placement and naming of the gemspec file is purely convention, but I don't know for sure, unfortunately.

rarkins · 2019-01-12T08:27:27Z

I’ll copy over every gemspec file it finds just to be “safe”.

rye · 2019-01-12T15:42:27Z

@rarkins, thanks for doing this!

For what it's worth, gem build explicitly takes an argument that is the gemspec file, so I don't think there exists any requirement within rubygems itself on the file being named or placed in a given location.

After a bit more digging, I found that Bundler's handling of gemspec's is largely inherited from the Bundler::Source::Path class, which has a DEFAULT_GLOB that may be of interest: {,*,*/*}.gemspec. This means that files matching .gemspec, *.gemspec, and */*.gemspec should match that glob. This is, of course, unless the glob option is set in the bundler config? I haven't ever seen this in practice so I'd have to do some more digging to find that.

I hope this is helpful. I'd be glad to do some more research if there are any other questions that need answering.

rarkins · 2019-01-12T15:47:19Z

@rye thanks very much for the extra research! Right now the string match we use on the file list should catch every single .gemspec file. When I tested against Rails's repo I found it still failed though because there was an arbitrary inclusion of its ./RAILS_VERSION file. So even if we grab ever gemspec file, literally any other file in the repo could be referenced. Anyway, this will still be fine if we use Renovate's gitFs as a fallback, as every single file is then available.

For now I think we're fine on the gemspec topic. The big remaining one is to be able to dynamically switch Bundler versions to match what's written in the Gemfile.lock. It's not really difficult but will take some time to set up all the Docker images necessary to support for the hosted version.

rye · 2019-01-12T16:08:29Z

@rarkins, yes, that inclusion of other files in a project is actually very commonplace, since gemspecs are actually Ruby source files and are interpreted as such. One approach to gathering information from a gemspec is to do as Bundler does and actually load the gemspec using RubyGems' own code, then extract dependency information and the like from that. I would be glad to work on this approach when I have some more availability—it should be moderately straightforward though, it's just calling Gem::Specification.load and identifying which fields to pull out. A bit of poking around with pry -rrubygems should be all that is needed. (Edit: to ascertain these fields)

As far as the Bundler version switching goes, that sounds like a good plan. I'll note that Bundler 2.0 (recently launched) includes the ability to interpret prior versions of Gemfile.locks, but many projects have not switched yet and it's probably better to use a project's own Bundler version for parity. That will be an intriguing challenge, especially since Bundler 2.0+ have more strict Ruby version requirements.

rarkins · 2019-01-12T16:11:21Z

My own experience attempting to use Bundler 2.0 for both 2.x and 1.x failed. I see they’ve had teething problems and even backed off requiring Rubygems 3.0.0 now, so I figured it’s easiest to just dynamically switch versions using docker tags instead (and parsing Gemfile.lock). With this current approach I hope we don’t have the need for any advanced ruby requiring as we’ll just use the full file system if necessary plus exact Bundler version.

rye · 2019-01-12T16:51:10Z

@rarkins yep, there have been issues with the 2.0 upgrade all around.

I will note, still, that parsing the Gemfile and Gemfile.lock using Bundler's own code may be a better approach, since it's available to you and would ensure consistent results—Gemfiles themselves are also Ruby source code… (source, group and gem are part of a DSL)

There are idiosyncrasies like that you can use ' or " when wrapping strings and such that would be hard(er) to account for if you're just parsing the file as raw input. And I'm sure someone somewhere has found a reason to require from their Gemfile.

As a proof of concept, I was able to rather easily use Bundler::LockfileParser to parse my project's lockfile:

require 'bundler'

lockfile_contents = open('Gemfile.lock', 'rb', &:read)
Bundler::LockfileParser.new(lockfile_contents).dependencies
# => {"fastlane"=>Gem::Dependency.new("fastlane", Gem::Requirement.new([">= 0"]), :runtime),
# "fastlane-plugin-bugsnag"=>Gem::Dependency.new("fastlane-plugin-bugsnag", Gem::Requirement.new([">= 0"]), :runtime),
# "json"=>Gem::Dependency.new("json", Gem::Requirement.new([">= 0"]), :runtime),
# "rubocop"=>Gem::Dependency.new("rubocop", Gem::Requirement.new(["~> 0.60"]), :runtime),
# "xcodeproj"=>Gem::Dependency.new("xcodeproj", Gem::Requirement.new([">= 0"]), :runtime)}

Bundler::LockfileParser.new(lockfile_contents).dependencies.keys
# => ["fastlane", "fastlane-plugin-bugsnag", "json", "rubocop", "xcodeproj"]

Bundler::LockfileParser.new(lockfile_contents).sources.map(&:remotes).flatten.map(&:to_s)
# => ["https://rubygems.org/"]

(This API works for Bundler 2.0 and 1.17, I didn't test widely but I'd imagine it hasn't changed much.)

So, I guess my argument is that your extraction code could be made much shorter (and more general) if you did opt to use the Bundler code itself. Calling this from JS could be as simple as exec-ing ruby with -rbundler and -e a string containing a script, or you could also have a Ruby script that handles other things. The nice thing is that then you have the exact same information that Bundler has.

Thoughts?

rarkins · 2019-01-12T18:42:31Z

@rye I'm still making up my mind regarding whether to do Gemfile parsing and/or updating using Ruby or keep it using JS. I had expected to need Ruby, but for now I think it might be at 95% of use cases already. e.g. even if someone requires something from their Gemfile, it doesn't necessarily mean we need to care about it - or support it.

However I do ideally want Renovate to be "flexible so you don't need to be", so I'm opening to switching to Ruby-based parsing if there are clear benefits.

One other thing to note is that we need to be able to not only parse a file, but also to be able to update it too. i.e. it's not enough to get a parsed object if is then really hard to perform the update. You can see with the current JS-based approach that we can do it with line numbers, which makes for the easiest possible update logic.

BTW I think the lockfile may be the one case where we don't need to do it in Ruby because it's simple and not executable code.

One downside of making it Ruby-based is that we may need to not only call a child process but actually call a Docker child process - which significantly increases the latency. The reason for this is that if we are executing arbitrary Ruby code then we need to as sandboxed as possible within Docker, rather than directly calling Ruby on the same machine/VM. That is, unless we think Bundler's parsing is sandboxed enough that it couldn't allow malicious code. But although this increases latency a lot, it remains to be seen whether it matters. e.g. if a typical Ruby project takes 10s to run and we add 0.5s to that, it's really not a big deal.

I think the way forward will be like this:

Launch with this simple JS-based parsing
Wait for cases where it "doesn't work" (for example, single quotes rather than double was an example you mentioned)
Determine if they can be easily fixed in the JS parsing or not
If not, then decide whether to make the slower Ruby parsing as a fallback option (configurable), or as the only option

Thanks again for your feedback

rarkins · 2019-01-13T04:57:48Z

My latest attempt at a Bundler 2 image seems to switch back to 1.x as desired, so I'll use that for now

bjeanes · 2019-01-13T05:12:38Z

BTW I think the lockfile may be the one case where we don't need to do it in Ruby because it's simple and not executable code.

Correct.

Unfortunately, executable Gemfiles are pretty common. I've done this during Rails upgrades, for instance, so that I can dynamically change versions of some gems (e.g. gem X v1 is only compatible with Rails <=3, but v2 is ONLY compatible with Rails 4+, so there is no bridging version).

rarkins · 2019-01-13T08:20:40Z

As a general rule, I don't mind shortcomings/known limitation so long as they're gracefully wrong and also that they don't render the stuff that works useless. So I think in this case if you're pulling in versions based on Ruby logic, it simply won't update those and will only update gems that have version fields defined.

So a quick status update:

Parsing done with JS, any versions that rely on third party Ruby files won't get "renovated"
Using a Bundler 2.0 image for Docker, anyone running locally is free of course to use their own locally-installed/matching version
All *.gemspec files will be copied over prior to bundle lock. If Bundler still fails because the gemspec files reference other files, then need to opt into gitFs

rye · 2019-01-13T17:05:50Z

@rarkins, I definitely see where you're at. Thanks again—at this point, it seems like this PR is only meant to add bundler support in alpha stage, which it does quite well. I was under the impression that more projects than my own use ' as a string boundary in their Gemfiles… turns out it's just me. 😅 Looking around it should cover a lot of use cases.

Personally, I'd be more than happy to opt-in on one of my projects to see how well it works.

Just out of curiosity, would you be interested in getting Ruby-based parsing (sandboxed appropriately) instead as a contribution later on down the road? I'm now reading through the codebase to figure out how this might be done, but I'd be happy to chip that in later.

rarkins · 2019-01-13T18:57:44Z

@rye I think this approach may even be good enough for "GA", although obviously wait for feedback first. I'd definitely be happy to have a Ruby-based extract option added later to capture all use cases. Once the JS one is working and we can identify any shortcomings, being able to test/verify a Ruby-based drop-in replacement should be fairly simple. It would pretty much just be a matter of replacing the existing lib/manager/bundler/extract and update functions.

BTW we already do a Python-based parsing of setup.py files, so it wouldn't be the first time.

Anyway I think I'll merge this in the next day or two and will blog instructions on how to test/what to look for and will appreciate your feedback. Are you using the app or CLI currently?

rye · 2019-01-13T19:29:47Z

@rarkins All this sounds good. I definitely think having a full test base and feedback from more users will make any further changes this faster and higher-confidence, so releasing this as an opt-in kind of thing is the first step so some entirely bundler-based projects can start testing it out. (I have a few projects that I'll be inclined to use Renovate on with even partial Bundler support.)

Any yes, I'm using the app, didn't even know that Renovate could be run from the CLI until now, but that actually makes a lot of sense. I'm assuming I'll need to explicitly enable something in the config, but I'm definitely going to do that on StoDevX/AAO-React-Native quite quickly after this is made available to that project. (I could definitely run locally in the meantime if that would be more helpful.)

I'll be glad to provide any further feedback and help as a likely end-user of Bundler support in the long run.

renovate-bot · 2019-01-14T05:58:46Z

🎉 This PR is included in version 13.175.0 🎉

The release is available on:

Your semantic-release bot 📦🚀

bjeanes · 2019-01-15T07:48:39Z

Where would you like feedback about this feature? New issues?

I don't have anything specific yet other than that every update is in an "error" state (according to master issue) but the job logs don't have anything that sticks out to me (I searched for dep name and for "error" and came up empty handed).

rarkins · 2019-01-15T08:26:58Z

@bjeanes I'll take a look for your repo now. For future I've posted this new issue so any feedback doesn't get lost: #3073

rarkins · 2019-01-15T08:39:41Z

@bjeanes I see that Renovate has detected Gemfile and .cache/Gemfile. Should the latter be ignored?

rarkins · 2019-01-15T08:53:56Z

By the way, I don't see any errors in the logs. It looks like it doesn't see any updates. Are there any that you are expecting? And can you paste a screenshot of the master issue here, or send to support@renovatebot.com if confidential?

bjeanes · 2019-01-15T10:31:52Z

I see that Renovate has detected Gemfile and .cache/Gemfile. Should the latter be ignored?

In my case, yes, but this isn't anything common or universal about Ruby-land. I imagine there is some way to disable that that within my config.

I don't see errors either. Where I am seeing "error" is in the master issue. Renovate has since edited the issue but from the history view of the comment content:

rarkins · 2019-01-15T11:31:09Z

First I saw this:

{
 "branch":"renovate/machinist-2.x",
  "dependencies":["machinist"],
  "msg":"Skipping branch creation as not within schedule",
  "time":"2019-01-15T10:20:35.150Z"
}

And then the most recent log shows your repo exiting with:

Response code 429 (Too Many Requests)

It seems rubygems is rate limiting us, even with a very low number of requests so far. I need to look into it more.

bjeanes · 2019-01-15T22:47:31Z

Skipping branch creation as not within schedule

I forced it in master issue and then it went into error state, so likely you are looking at a log of a newer job which is back to not scheduling it.

rye · 2019-01-16T00:13:36Z

@rarkins from the documentation on the RubyGems API:

Rate Limits

To protect the RubyGems.org service from abuse, both intentionally and unintentionally, we have rate limits in place for some of our endpoints. Some endpoints may be cached by our CDN at times and therefore may allow higher request rates. The following is a general guideline for the rate limit rules.

API and website: 10 requests per second

Dependency API: 15 requests per second

Website sign up, sign in, api key, and forgot password: 100 requests per 10 minutes

Users who hit a rate limit will see HTTP 429 responses, for the remainder of the limit window. Usually this is just a few minutes.

The RubyGems.org team may occasionally blackhole user IP addresses for extreme cases to protect the platform. If you think this has happened to you, please submit a help ticket and we’ll be happy to look at it.

From what you're seeing, does it look like Renovate is hitting the 15 requests per second limit? If so, it'll be worth digging into why. Alternatively, I would wonder if it's hitting the 100 request/10 minute limit for some reason?

rarkins · 2019-01-16T08:43:26Z

@bjeanes I think we're probably hitting the 15 requests per second limit, I will need to work out the best location to limit it

rarkins added 7 commits January 10, 2019 10:31

Gemfile extract

f2261fa

add bundler fileMatch and versionScheme

d4b0d7a

fix registryUrls

9d14938

drop comment

f22cd96

add purl

cf1579f

update function

b279127

bundler artifacts

ca53e56

rarkins added the review label Jan 12, 2019

rarkins mentioned this pull request Jan 12, 2019

Support Ruby/Bundler #932

Closed

rarkins changed the title ~~feat(bundler): extract, update, artifacts~~ feat(bundler): extract, update, artifacts (WIP) Jan 12, 2019

copy gemspec files

415cfef

rarkins mentioned this pull request Jan 13, 2019

Bundler lock file maintenance #3061

Closed

rarkins added 4 commits January 13, 2019 09:26

ruby fs error

1af08e9

update snapshots

c55edb5

fix(ruby): isValid for complex ranges

4730a15

fix: handle complex ranges

69c5baf

rarkins added 7 commits January 13, 2019 10:43

Merge branch 'master' into feat/2983-bundler-extract-update

1456e10

fix(ruby): replace complex ranges

6e04919

replace complex ranges

af48e10

Merge branch 'master' into feat/2983-bundler-extract-update

40fe2ee

fix test

a0eec24

add language doc

76b5339

disable by default

02aacac

rarkins added 2 commits January 14, 2019 05:20

throw errors for every bundler failure

03b1b79

add tests

ebf2b84

rarkins changed the title ~~feat(bundler): extract, update, artifacts (WIP)~~ feat(bundler): extract, update, artifacts Jan 14, 2019

rarkins merged commit ba77d4a into master Jan 14, 2019

rarkins deleted the feat/2983-bundler-extract-update branch January 14, 2019 05:52

rarkins removed the review label Jan 14, 2019

renovate-bot added the released label Jan 14, 2019

rye mentioned this pull request Jan 16, 2019

Renovate: Enable experimental bundler support StoDevX/AAO-React-Native#3419

Merged

github-actions bot locked as resolved and limited conversation to collaborators Dec 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(bundler): extract, update, artifacts #3058

feat(bundler): extract, update, artifacts #3058

rarkins commented Jan 12, 2019 •

edited

rarkins commented Jan 12, 2019

bjeanes commented Jan 12, 2019

rarkins commented Jan 12, 2019

bjeanes commented Jan 12, 2019

rarkins commented Jan 12, 2019

rye commented Jan 12, 2019

rarkins commented Jan 12, 2019

rye commented Jan 12, 2019 •

edited

rarkins commented Jan 12, 2019

rye commented Jan 12, 2019 •

edited

rarkins commented Jan 12, 2019

rarkins commented Jan 13, 2019

bjeanes commented Jan 13, 2019 •

edited

rarkins commented Jan 13, 2019

rye commented Jan 13, 2019

rarkins commented Jan 13, 2019

rye commented Jan 13, 2019

renovate-bot commented Jan 14, 2019

bjeanes commented Jan 15, 2019

rarkins commented Jan 15, 2019

rarkins commented Jan 15, 2019

rarkins commented Jan 15, 2019

bjeanes commented Jan 15, 2019

rarkins commented Jan 15, 2019

bjeanes commented Jan 15, 2019

rye commented Jan 16, 2019

Rate Limits

rarkins commented Jan 16, 2019

feat(bundler): extract, update, artifacts #3058

feat(bundler): extract, update, artifacts #3058

Conversation

rarkins commented Jan 12, 2019 • edited

rarkins commented Jan 12, 2019

bjeanes commented Jan 12, 2019

rarkins commented Jan 12, 2019

bjeanes commented Jan 12, 2019

rarkins commented Jan 12, 2019

rye commented Jan 12, 2019

rarkins commented Jan 12, 2019

rye commented Jan 12, 2019 • edited

rarkins commented Jan 12, 2019

rye commented Jan 12, 2019 • edited

rarkins commented Jan 12, 2019

rarkins commented Jan 13, 2019

bjeanes commented Jan 13, 2019 • edited

rarkins commented Jan 13, 2019

rye commented Jan 13, 2019

rarkins commented Jan 13, 2019

rye commented Jan 13, 2019

renovate-bot commented Jan 14, 2019

bjeanes commented Jan 15, 2019

rarkins commented Jan 15, 2019

rarkins commented Jan 15, 2019

rarkins commented Jan 15, 2019

bjeanes commented Jan 15, 2019

rarkins commented Jan 15, 2019

bjeanes commented Jan 15, 2019

rye commented Jan 16, 2019

Rate Limits

rarkins commented Jan 16, 2019

rarkins commented Jan 12, 2019 •

edited

rye commented Jan 12, 2019 •

edited

rye commented Jan 12, 2019 •

edited

bjeanes commented Jan 13, 2019 •

edited