-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
explore stargazer limit challenge (40k+) #76
Comments
The header of the response to /stargazers shows the "last" page, and beyond that there's no hope I think:
One could think that this is a great reason to run github-repo-stats before reaching the 40000 stargazers, so that one can build the full timeseries over time via incremental updates, always getting newer data while the old stargazers move out of the visible time window. However, seemingly one has to stick with seeing the first 40000 stargazers, newer ones are always in the blind spot. This is a super quick analysis, maybe I have missed something here. |
I’ve pinged Swifty to take a look here. The proactivity is much appreciated |
Oh hey @jgehrcke thank you. Is there a way then we can use the summary stat (total stargazers) and build a history from now? |
Thanks for the kind words @ntindle @Swiftyos!
Yes! Example:
I will look into adding this to github-repo-stats. This is actually kind of good news -- another good reason for running github-repo-stats periodically :). Btw, another quote I found on the limitation being baked into the GitHub API(s):
|
Just saw https://github.com/Significant-Gravitas/Auto-GPT/ starting to use github-repo-stats. They have ~150k stargazers. We can extract 40000:
This seems to be a known limitation of the API, delivering only 400 pages:
https://stackoverflow.com/questions/68910259/fetch-all-stargazers-over-time-of-a-repository
Strongly related, potentially offering a solution: https://observablehq.com/@observablehq/github-stargazer-history
@Swiftyos I hope you get this notification; we can look into extracting the 'correct' number of stargazers in your special case there. The "many stargazer challenge" has been deliberately un-addressed by me and there are obvious ideas for improvement so that the larger chunk of the stargazer timeseries does not need to be re-fetched every single time the action runs. Also see
github-repo-stats/fetch.py
Line 370 in e999cc2
@Swiftyos I saw you picked a 90 minute interval for running the action -- that is a little often for no obvious benefit! Once per day should really be good enough. Do you have any specific concerns you try to address with the 90 minute interval?
The text was updated successfully, but these errors were encountered: