Skip to content
This repository has been archived by the owner on Jun 29, 2022. It is now read-only.

changed method of loading npm packages #4

Open
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

MauriceButler
Copy link
Member

Change the load-npm-data functionality to used some end-points directly as the previous method was getting incorrect / truncated data because of the growth of the registry as per #3

Using an enpoint to get just package ids as per npm/npm-registry-couchapp#162

Then loading just the latest data from each module to get correct and up to date maintainers.

Also skipping the adding of repos for a bunch of unprocessable packages thus massively reducing the time to process the github requests.

Hopefully once deployed and results uploaded to https://github.com/polyhack/npm-github-data issue #3 should be resolved.

@@ -17,6 +17,7 @@ const fs = require('fs')
"allPackagesOutput" : "/path/to/allpackages.json"
, "repositoriesOutput" : "/path/to/repositories.json"
, "githubOutput" : "/path/to/githubusers.json"
, "aussieOutput" : "/path/to/aussieOutput.json"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is this correct? I don't think this is needed here

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I needed to as is used on line 87

@rvagg
Copy link
Member

rvagg commented Dec 18, 2015

good work, (I hope!), some style issues, I stopped commenting cause I think you get the idea re spaces and semis

@MauriceButler
Copy link
Member Author

Yeah sorry is very foreign to my style so I struggled a bit will try to clean up now.

@MauriceButler
Copy link
Member Author

@rvagg hopefully that is all good.

@rvagg
Copy link
Member

rvagg commented Dec 18, 2015

I'm going to have to ponder this one, one request per package is going to be a pain in the backside.

@MauriceButler
Copy link
Member Author

For any particular reason?

Not sure there is currently an api to do what we want in less requests that handles the current repository size.

@rvagg
Copy link
Member

rvagg commented Dec 18, 2015

well, one alternative is to use an npm mirror, I have one in my house with each package as a json file on disk, I could use that, I just need to think through whether I want to rely on it!

@MauriceButler
Copy link
Member Author

Ok. Let me know what you think.

How much space does a mirror need?

I could potentially host one too.

There would be a lot more data in there than we need for this system but maybe other uses.

@franciskim
Copy link

Happy to help host a mirror on my AWS if it's relatively easy

@MauriceButler
Copy link
Member Author

ping @rvagg have you had a chance to have a think about this approach as yet?

@MauriceButler
Copy link
Member Author

hey @rvagg just another gentle ping as I noticed you are online :)

@franciskim
Copy link

It would be great if #3 was fixed as I imagine there are a whole heap of developers not being listed on the site.

@SomeoneWeird
Copy link
Member

@MauriceButler I'm keen to get this in, are we still sure it works?

@MauriceButler
Copy link
Member Author

I am not sure. Has been a long time. I'll try to get time to run it up tonight and make sure it still works. If not at least in the next couple days.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants