Implement pep 503 Simple Repository API for deployment #25639

kohtala · 2019-09-04T07:12:57Z

🚀 Feature

Add files to https://download.pytorch.org/whl/ to implement PEP 503 for pip --extra-index-url and for pipenv Pipfile extra source urls.

Motivation

pipenv install using package configuration like torch = {file = "https://download.pytorch.org/whl/cu100/torch-1.1.0-cp36-cp36m-linux_x86_64.whl"} fails with hash mismatch. I guess the problem could be that it downloads the hashes from package releases on pypi and they do not match the whl that provides the package with same name. Pipenv promises vulnerability check in addition to other nice new features for managing package configuration.

Using the simple API you could just document one command that always installs the latest release.

This is related to issue #4793.

Pitch

Rename the different cuda version packages uniquely, like torch-cu100. Add metadata Provides-Dist header to indicate it provides torch to satisfy dependencies on torch.

Provide the simple repository api at URL https://download.pytorch.org/whl/, which lists the different torch-cuNN, torchvision-cuNN etc. packages.

For pip it should work to use pip --extra-index-url https://download.pytorch.org/whl/ torch-cu100 to install latest up to date version of torch for CUDA 10.0.

For pipenv it should work to use configuration like

[[source]]
name = "pytorch"
url = "https://download.pytorch.org/whl/"
verify_ssl = true

[packages]
torch = {version="*", index="pytorch"}

Alternatives

Could keep old package names and provide many different simple repository apis for different cuda versions.

However I do not see much need for this, as it is possible to reduce the impact of change by keeping the package names for the cuda version currently on PyPi and just rename the other cuda version packages.

Additional context

I am not experienced in setting up Python repositories, so this requires some testing it surely works.

I think the rename of packages and use of Provides-Dist would be good for upload of all cuda versions to PyPi as well.

cc @ezyang @gchanan @zou3519 @bdhirsh @seemethere @malfet @walterddr

The text was updated successfully, but these errors were encountered:

ezyang · 2019-09-04T19:15:41Z

When I wrote up #23656 I was not aware provides-dist. It sounds like a good choice; better than local version identifiers, which are not actually very good for what we are using them for. @soumith, do you have any experience with this flag?

soumith · 2019-09-04T21:09:46Z

provides-dist actually sounds great, I didn't know about the existence of this flag either. This overall proposal sounds nice.

@kohtala are you offering to take up this work, or should we take it up?

kohtala · 2019-09-07T11:28:26Z

Thanks for compliments :-)
I am not familiar with torch release process enough to know where to go making changes. It'd need the build and release as well as some documentation changes. Maybe set it up at https://download.pytorch.org/simple/ (à la https://pypi.org/simple/) instead of https://download.pytorch.org/whl/ so it can be tested there first and update documentation when it works.
If it is all familiar to you, I'm sure you'd be much more efficient. Besides, even while I love to contribute, I have some difficulty finding the time.

cpbotha · 2020-03-17T15:26:51Z

Hi all, until someone has time to fix this at the source, I hacked and slashed together (using Emacs of course) an index-url you can use for pytorch: https://vxlabs.com/pypi/

I am currently using this in my poetry pyproject.toml as an additional index-url and it works like a charm.

You can drill down, you'll see that it ends up pointing to all the whl packages that are hosted at pytorch, it just supplies the top-level indices according to PEP 503.

ezyang · 2020-03-17T18:07:59Z

Thanks @cpbotha. BTW, @kohtala, @seemethere and I were looking at this recently, and we noticed that provides-dist has a very scary message on the docs saying most tools in the ecosystem don't use it. Do you have any experience using it for projects?

kohtala · 2020-03-18T12:13:01Z

Hi.

No, I have not used provides-dist.

The main improvement proposed in this issue was the simple repository index that @cpbotha created for test. provides-dist was an idea on top of it to take it one step further. If provides-dist does not work, there would then need to be different indices for different cuda versions. Different cuda versions could not be in a single index (such PyPi).

[[source]]
name = "pytorch"
url = "https://download.pytorch.org/whl/cu100/"
verify_ssl = true

[packages]
torch = {version="==1.1.0", index="pytorch"}

Somewhere https://download.pytorch.org/whl/cu100/torch_stable.html is already generated. It just needs to be split by package and modified for PEP 503 and offered as index on the

I tried with this Pipfile and it was able to lock and install. Solved at least the pipenv problem.

[[source]]
name = "pypi"
url = "https://pypi.org/simple"
verify_ssl = true

[[source]]
name = "pytorch"
url = "https://vxlabs.com/pypi/"
verify_ssl = true

[dev-packages]

[packages]
torch = {version="==1.1.0", index="pytorch"}
torchvision = {version="==0.3.0", index="pytorch"}
fastai = "==1.0.54"

[requires]
python_version = "3.7"

kohtala · 2020-03-18T12:20:32Z

I tried to see what happens with pip. Unfortunately it seems to treat packages by same name and version unique and always download the one from PyPi. pypa/pip#5045

cpbotha · 2020-03-18T12:21:09Z

Just to add to @kohtala 's comment above:

I wast struggling yesterday with the cuda and gpu versions.

According to PEP 440 "torch-1.4.0+cpu" is exactly the same version as "torch-1.4.0" and "torch-1.4.0+cu92", for example. So if you start by installing +cpu for example, it's hard to convince pip or pipenv that you want to go back to the cu101 packages, which are the postfix-less version ones.

Anyways, I don't know about provides-dist support, but @kohtala 's suggestion to have separate indices for the different hardware configurations would work reliably for many people.

Hopefully I'll get some time soon to split https://vxlabs.com/pypi/ out into cpu, cu92 and cu101.

cpbotha · 2020-03-18T12:22:07Z

I tried to see what happens with pip. Unfortunately it seems to treat packages by same name and version unique and always download the one from PyPi. pypa/pip#5045

Besides the local version label (+blah) I mentioned above, I am able to install directly from my index with e.g. pip install --index-url=https://vxlabs.com/pypi/ torch==1.4.0.

kohtala · 2020-03-18T15:35:51Z

Besides the local version label (+blah) I mentioned above, I am able to install directly from my index with e.g. pip install --index-url=https://vxlabs.com/pypi/ torch==1.4.0.

In this install the only index is your index so it won't go to pypi. If you need something from pypi in the same install (-r requirements.txt) then if we are to trust the pypa/pip#5045 issue it'll install torch from pypi.

Anyway, the Simple Repository API would be an improvement. Unfortunately not as great improvement as one would hope.

cpbotha · 2020-03-18T15:41:50Z

Besides the local version label (+blah) I mentioned above, I am able to install directly from my index with e.g. pip install --index-url=https://vxlabs.com/pypi/ torch==1.4.0.

In this install the only index is your index so it won't go to pypi. If you need something from pypi in the same install (-r requirements.txt) then if we are to trust the pypa/pip#5045 issue it'll install torch from pypi.

Anyway, the Simple Repository API would be an improvement. Unfortunately not as great improvement as one would hope.

poetry does the right thing here. You can define any number of source indices. By default it will go extra-index -> extra-index -> pypi. However, you can change the order with source config settings in the pyproject.toml.

Also, you can specify per-dependency which source should be used if you don't like the global precedence in that specific case.

Whatever the case may be, pytorch publishing official simple indices, one for each hardware config (cu92, cu100, cu101, cpu) with each containing refs to all other relevant packages (only torch and torchvision are different for the different HW configs, there is also torchtext and torchaudio) would be the most practical at this stage.

@soumith and @ezyang -- if you could perhaps post pointers to where an intrepid contributor can start looking to code up the necessary scripts to do this as part of your processes, maybe an intrepid contributor will try. (it might be me)

ezyang · 2020-03-20T01:35:32Z

Thanks for offering. Here's the script that we use to create the index: https://github.com/pytorch/builder/blob/master/cron/update_s3_htmls.sh

It... kind of looks like we might be making simple indices already. So is the request to drop the local version specifier as well from the versions in that case?

cpbotha · 2020-03-20T07:16:48Z

Thanks for offering. Here's the script that we use to create the index: https://github.com/pytorch/builder/blob/master/cron/update_s3_htmls.sh

It... kind of looks like we might be making simple indices already. So is the request to drop the local version specifier as well from the versions in that case?

I had a quick look at the script, it looks like it's almost there (but I did not look long enough :).

Ideally, we end up with the following simple indices:

In each case, the top-level index only a hrefs the package names: torch, torchvision, torchaudio, torchtext, etc. -- each of those hrefs is another html a hreffing the full list of packages for that hardware configuration.

In this case, the +cu92 local version label can stay. The user can control their hardware choice by just selecting the correct index.

If I find the time in the coming days to experiment with the script, I'll let you know! If anyone else picks this up, I'm not going to stop you. :) (just please leave a comment here if you're going to try so we don't do double effort)

ezyang · 2020-03-20T14:57:16Z

I'm not aware of anyone touching the s3 indexes right now. You'll have to hack it up since you don't have access to the bucket but I'm more than happy to help you deploy the update it if you have a proposed update. Note that if we change the URLs in a BC-breaking way that is going to be a lot more work!

kousu · 2020-08-15T02:10:48Z

@kohtala you can instead use --find-links:

pip install --find-links https://download.pytorch.org/whl/cpu/torch_stable.html torch

You can make this more permanent by adding it to a requirements.txt, e.g. #26340 (comment) or https://github.com/neuropoly/spinalcordtoolbox/blob/b64cad3c846fd6bd7a557688b67b80fe0b2c6dc2/requirements.txt#L26-L30

numpy==1.17.2                                                                                                                                                                                                     
pandas==0.25.2
-f https://download.pytorch.org/whl/torch_stable.html                                                                                                                                                             
torch==1.3.1+cpu

and then pip install -r requirements.txt will do the right thing.

This doesn't seem to be compatible with making proper packages, though, as far as I can tell, because for a proper package you need to specify all your dependencies in setup.py and not in requirements.txt. So if you depend on pytorch you can't publish your package to pypi. You'll have to, I guess, make them install from source, or maybe pip install -r https://code.example.com/you/yourpackage/requirements.txt ? I'm not really sure. If #26340 happened this would be a non-issue.

kohtala · 2020-08-19T06:06:02Z

Thanks @kousu.

Since creating this issue I moved from Pipenv to using pip-tools with requirements.in file that just says eg. torch @ https://download.pytorch.org/whl/cu100/torch-1.1.0-cp37-cp37m-linux_x86_64.whl. That works and does not impose what Pipenv developers think is right for us.

But on #26340 there is a nice idea of splitting the cuda into an package extra. Each cuda version could have separate extra. There would still be need for the Provides-Dist support in pip so you could have dependency like "I want any HW acceleration, but don't care which it is" and any cuda (or AMD, whatever) version could satisfy it.

caniko · 2020-11-13T09:57:37Z

I really need this feature for my python packages. I am willing to help, what is holding us back?

ezyang · 2020-11-13T17:29:58Z

bumping priority based on user activity

jkyl · 2020-11-14T10:13:14Z

With a recent (not sure which, sorry, but latest should do) version of Poetry, environment markers work to the extent that I have included the following in my pyproject.toml:

torch = [
    { version = "1.6.0", markers = "sys_platform != 'win32'" },
    { url = "https://download.pytorch.org/whl/cu102/torch-1.6.0-cp36-cp36m-win_amd64.whl", markers = "python_version ~= '3.6' and sys_platform == 'win32'" },
    { url = "https://download.pytorch.org/whl/cu102/torch-1.6.0-cp37-cp37m-win_amd64.whl", markers = "python_version ~= '3.7' and sys_platform == 'win32'" },
    { url = "https://download.pytorch.org/whl/cu102/torch-1.6.0-cp38-cp38-win_amd64.whl", markers = "python_version ~= '3.8' and sys_platform == 'win32'" }
]
torchvision = [
    { version = "0.7.0", markers = "sys_platform != 'win32'" },
    { url = "https://download.pytorch.org/whl/cu102/torchvision-0.7.0-cp36-cp36m-win_amd64.whl", markers = "python_version ~= '3.6' and sys_platform == 'win32'" },
    { url = "https://download.pytorch.org/whl/cu102/torchvision-0.7.0-cp37-cp37m-win_amd64.whl", markers = "python_version ~= '3.7' and sys_platform == 'win32'" },
    { url = "https://download.pytorch.org/whl/cu102/torchvision-0.7.0-cp38-cp38-win_amd64.whl", markers = "python_version ~= '3.8' and sys_platform == 'win32'" }
]

as a workaround.

jkyl · 2020-11-14T10:15:59Z

Which is not at all to detract from the priority of this issue! My workaround sucks and pytorch should definitely also do a simple index!

rgommers · 2021-06-03T06:22:24Z

For organizing the wheels, #25639 (comment) (subdirs like cpu/, cu111/, etc.) seems like a good suggestion.

The Poetry issue isn't really actionable; https://eternalphane.github.io/pytorch-pypi is just collecting all wheels in a single index, and there's no way for Poetry (or Pip, or any wheel-based tool) to do anything reasonable based on only file names with +cu111, +rocm4.0.1. Everything after + is just a random identifier (see PEP 440) - it can be used for ordering, but not for selecting the desired hardware.

malfet · 2021-06-15T01:37:10Z

Likely done by pytorch/builder@71a2b9a

malfet · 2021-06-15T17:46:54Z

pip install --extra-index-url https://download.pytorch.org/whl/cpu/ torch should install PyTorch on CPU and
pip install --extra-index-url https://download.pytorch.org/whl/cu111/ torch should install PyTorch with CUDA-11.1 support

kohtala · 2021-06-17T08:12:03Z

Seems to work. Thanks!

I found some discussion at pypa/pip#8606 about the rules to select packages between indices. As I tried the commands, they seemed to select the one that I wanted, but I still did not find the detailed rules to understand why. Apparently it chooses the best match by highest version and matching tags and only selects between indices if indices serve the same file. The file name is not the same and the one at pytorch.org would seem to be treated by pip 21.1.2 as a better match.

cgarciae · 2021-10-18T22:42:44Z

The issue was closed but currently only adding https://eternalphane.github.io/pytorch-pypi/ as a source works with poetry without having to use the exact URL for the wheel (which is not very user friendly).

rgommers · 2021-10-19T08:38:45Z

adding https://eternalphane.github.io/pytorch-pypi/ as a source works with poetry without having to use the exact URL for the wheel (which is not very user friendly).

Does that actually work for different CUDA/ROCm versions? If so, it'd be great to see an explanation of how the correct wheel is selected.

cgarciae · 2021-10-19T15:01:37Z

Hey @rgommers! No, for poetry users its just a bit more convenient than the whole URL string but you still have to select an exact version + hardware tag e.g. 1.9.1+cpu.

I think my issue is that searching for the correct source is not trivial and there are a lot of abandoned repos, if possible pytorch could provide an official service of what https://eternalphane.github.io/pytorch-pypi/ is doing to make this easier.

Edit: Or point towards https://eternalphane.github.io/pytorch-pypi/ in the installation docs if its a trusted source.

rgommers · 2021-10-19T15:09:00Z

you still have to select an exact version + hardware tag e.g. 1.9.1+cpu.

That's what I thought. I think that's worse than separate directories - if you have separate directories you use normal version constraints like `"torch >= 1.9.0, <1.10.0", while if everything is in a single dir you can't. So it'd be better to just improve the docs to make it easier to find the right URLs.

cgarciae · 2021-10-19T15:16:11Z

Agreed.

vikigenius · 2021-12-11T05:54:21Z

@cgarciae tried to use https://eternalphane.github.io/pytorch-pypi/ with poetry:

[[tool.poetry.source]]
name = "torch"
url = "https://eternalphane.github.io/pytorch-pypi/"

[[tool.poetry.source]]
name = "torchvision"
url = "https://eternalphane.github.io/pytorch-pypi/"

However trying to install torchvision and torch together causes failure like this:

  Because torchvision (0.11.1+cu113) depends on torch (1.10.0)
   and siamenc depends on torch (1.10.0+cu113), torchvision is forbidden.
  So, because siamenc depends on torchvision (0.11.1+cu113), version solving failed.

Is there anyway I can install both torch and torchvision together properly?

rgommers · 2021-12-11T12:06:11Z

siamenc is not on PyPI under that name, so not sure what it is. But this is the problem with a dependency like 1.10.0+cu113. The right way to do this is to depend on 1.10.0 (without +cu113) and point to https://download.pytorch.org/whl/cu113 as the index. Poetry should be able to do this, when given the correct index.

vikigenius · 2021-12-11T20:12:30Z

@rgommers siamenc is just the name of my project. Poetry I think uses custom repositories as default and it thus throws errors like this if I add the custom repository like this if I add the repo like this.

[[tool.poetry.source]]
name = "torch"
url = "https://download.pytorch.org/whl/cu113"

Here is the error.

  RepositoryError

  403 Client Error: Forbidden for url: https://download.pytorch.org/whl/cu113/python-lsp-server/

  at ~/.local/share/pypoetry/venv/lib/python3.9/site-packages/poetry/repositories/legacy_repository.py:393 in _get
      389│             if response.status_code == 404:
      390│                 return
      391│             response.raise_for_status()
      392│         except requests.HTTPError as e:
    → 393│             raise RepositoryError(e)
      394│
      395│         if response.status_code in (401, 403):
      396│             self._log(
      397│                 "Authorization error accessing {url}".format(url=response.url),

I would appreciate it if you can provide me a working example of how to do it with poetry.

rgommers · 2021-12-14T12:02:16Z

I don't use Poetry so can't help provide an example unfortunately.

403 Client Error: Forbidden for url: https://download.pytorch.org/whl/cu113/python-lsp-server/

It should be using PyPI for python-lsp-server, not download.pytorch.org. You'll need to make sure the custom repository applies just to torch, not to anything else.

Jerry2001Qu · 2021-12-14T20:04:40Z

Looks like the issue with Poetry not using PyPI should be solvable after this PR: python-poetry/poetry#908

But is broken here: python-poetry/poetry#3855

I currently can't figure out a workaround.

Jerry2001Qu · 2021-12-14T20:53:45Z

Downgrading to Poetry 1.0.10 might be a workaround (ontop of my previous comment) as per: python-poetry/poetry#4704 (comment)

Haven't tested because it's too much of a pain, switching to pip!

Arcitec · 2022-02-19T03:50:48Z

If anyone needs the correct solution for installing PyTorch via Pipenv, I have posted a guide and explanation here:

pypa/pipenv#4961 (comment)

It would be cool if the official pytorch website could list those install commands (the ones I've generated) as an option during the "roll your own selections" guide. I.e. having Pipenv as a choice next to Pip, and then showing the command-style that I'm using in my guide. Then projects that are based on Pipenv won't have to manually lookup the latest versions in the repo HTML in a browser anymore.

stephanbertl · 2023-05-15T13:37:00Z

Any update on this?

We have an internal Sonatype Nexus repository. It only supports pep503. It's impossible to proxy the pytorch repository with the current format.

ezyang · 2023-05-15T20:22:32Z

Please go ahead and file a new issue for these problems

ailzhang added module: binaries Anything related to official binaries that we release to users triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Sep 4, 2019

lig mentioned this issue May 7, 2020

CPU version of PyTorch on PyPI #26340

Open

cpbotha mentioned this issue May 27, 2020

--find-links equivalent python-poetry/poetry#1391

Closed

2 tasks

ezyang mentioned this issue Jun 10, 2020

Pip package URL is not a valid index #39779

Closed

ezyang mentioned this issue Nov 13, 2020

Providing a pytorch index conforming to PEP-503 #47670

Closed

ezyang added the high priority label Nov 13, 2020

pytorch-probot bot added the triage review label Nov 13, 2020

malfet self-assigned this Nov 16, 2020

zou3519 removed the triage review label Nov 16, 2020

malfet assigned malfet and unassigned seemethere Jun 15, 2021

kohtala closed this as completed Jun 17, 2021

yonimedhub mentioned this issue Jun 28, 2021

pytorch and poetry python-poetry/poetry#4231

Closed

This was referenced Feb 19, 2022

GUIDE: Using GLIDE in pipenv instead of pip openai/glide-text2im#25

Open

GUIDE: How to install PyTorch via Pipenv (and how to add other 3rd Party Repositories). pypa/pipenv#4961

Closed

pbsds mentioned this issue Feb 25, 2022

Implement pep-503 - Simple Repository API facebookresearch/pytorch3d#1090

Open

drewgilliam mentioned this issue Jun 6, 2022

PEP 503 facebookresearch/detectron2#4314

Open

tomasff mentioned this issue Oct 7, 2022

PEP 503-standard repo URL pyg-team/pytorch_geometric#5625

Open

mkoeppe mentioned this issue Sep 30, 2021

Create PEP 503 simple repository for wheels built during installation sagemath/sage#30527

Closed

stephanbertl mentioned this issue May 15, 2023

clearml agent stuck when using torch allegroai/clearml-agent#151

Open

Implement pep 503 Simple Repository API for deployment #25639

Implement pep 503 Simple Repository API for deployment #25639

Comments

kohtala commented Sep 4, 2019 • edited by pytorch-probot bot

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

ezyang commented Sep 4, 2019

soumith commented Sep 4, 2019

kohtala commented Sep 7, 2019

cpbotha commented Mar 17, 2020 • edited

ezyang commented Mar 17, 2020

kohtala commented Mar 18, 2020

kohtala commented Mar 18, 2020

cpbotha commented Mar 18, 2020

cpbotha commented Mar 18, 2020

kohtala commented Mar 18, 2020

cpbotha commented Mar 18, 2020 • edited

ezyang commented Mar 20, 2020

cpbotha commented Mar 20, 2020

ezyang commented Mar 20, 2020

kousu commented Aug 15, 2020

kohtala commented Aug 19, 2020

caniko commented Nov 13, 2020

ezyang commented Nov 13, 2020

jkyl commented Nov 14, 2020 • edited

jkyl commented Nov 14, 2020 • edited

rgommers commented Jun 3, 2021

malfet commented Jun 15, 2021

malfet commented Jun 15, 2021

kohtala commented Jun 17, 2021

cgarciae commented Oct 18, 2021

rgommers commented Oct 19, 2021

cgarciae commented Oct 19, 2021 • edited

rgommers commented Oct 19, 2021

cgarciae commented Oct 19, 2021

vikigenius commented Dec 11, 2021

rgommers commented Dec 11, 2021

vikigenius commented Dec 11, 2021 • edited

rgommers commented Dec 14, 2021

Jerry2001Qu commented Dec 14, 2021

Jerry2001Qu commented Dec 14, 2021

Arcitec commented Feb 19, 2022 • edited

If anyone needs the correct solution for installing PyTorch via Pipenv, I have posted a guide and explanation here:

stephanbertl commented May 15, 2023

ezyang commented May 15, 2023

kohtala commented Sep 4, 2019 •

edited by pytorch-probot bot

cpbotha commented Mar 17, 2020 •

edited

cpbotha commented Mar 18, 2020 •

edited

jkyl commented Nov 14, 2020 •

edited

jkyl commented Nov 14, 2020 •

edited

cgarciae commented Oct 19, 2021 •

edited

vikigenius commented Dec 11, 2021 •

edited

Arcitec commented Feb 19, 2022 •

edited