S3 support #326

gmaze · 2024-01-19T14:27:14Z

The Argo ADMT is experiencing with Amazon S3 in order to move the GDAC infrastructure into the cloud.
In order to prepare argopy for this and to be able to access and test the AWS prototype server, we need to develop support for S3.
This would require:

New file store to support S3 with fsspec, this is based on s3fs
Update Index store to support S3

A new data fetcher will be developed in another PR

gmaze · 2024-04-22T13:58:25Z

@tcarval is there any reasons for not having the gz index files on s3 ?
https://argo-gdac-sandbox.s3.eu-west-3.amazonaws.com/pub/index.html#pub/idx/

tcarval · 2024-05-06T16:12:34Z

@tcarval is there any reasons for not having the gz index files on s3 ? https://argo-gdac-sandbox.s3.eu-west-3.amazonaws.com/pub/index.html#pub/idx/

I am adding the gz indexes (the synchronization gdac - aws is underway)

- a prototype for the record and benchmark different design

fix access to AWS creds

- and new checker for AWS credentials

gmaze · 2024-05-17T09:56:01Z

New IndexStore ready to work with AWS S3 core index file

from argopy import ArgoIndex
idx = ArgoIndex(host='s3://argo-gdac-sandbox/pub/idx').load()
idx.search_wmo_cyc(6903091, 1)

poke @tcarval

… on s3 - refactor argo_index_pa and argo_index_pd to use argostore super init - index_path now a dynamic property - index_path set to use gz file when found on server at instanciation - index property not set when used with s3 store - new search_s3 decorator for some search methods (to be used with s3) - more Path usage - minor reformatting - fix bug in s3index to return appropriate empty pyarrow table if no SQL response is found

- add keywords/shortcuts for hosts

- add a decorator to fix errors raised when pyarrow is not availalble

fix bug for unknown AWS credentials with boto3 client

gmaze added 5 commits October 13, 2023 13:17

New stores.filesystems.s3store

f087ab8

Merge branch 'master' into gdac-amazons3

9a260a6

Merge branch 'master' into gdac-amazons3

74ce4de

s3 store

1f00edc

Update filesystems.py

092e0d3

gmaze self-assigned this Apr 15, 2024

gmaze added enhancement New feature or request backends performance labels Apr 15, 2024

Merge branch 'master' into gdac-amazons3

4a66958

more s3 support

1a233cd

gmaze added 12 commits May 15, 2024 13:10

Merge branch 'master' into gdac-amazons3

c526bf1

An index store using data from aws s3

0f85a28

Update test_stores_index.py

90f9b70

Create argo_index_proto_s3.py

8b65da2

- a prototype for the record and benchmark different design

Update pytests.yml

97b6884

fix access to AWS creds

Update pytests.yml

100042d

fix anonymous AWS s3 calls

8ed5016

Update pytests.yml

9c42352

Use anonymous s3 store only when AWS credentials cannot be located

2fcb044

s3store docstring

34fe475

- and new checker for AWS credentials

Add boto3 dep

b6425d3

Update env files [skip-ci]

3adaa44

gmaze requested review from quai20 May 17, 2024 09:56

gmaze added 2 commits May 17, 2024 14:47

Update doc

6243557

add s3fs & boto3 to RTD env

378d888

gmaze added 15 commits May 17, 2024 15:02

Update checkers.py

bbc9728

minor reformating

e9f4756

Add decorator to requirements

f6545b1

Add "decorator" to CI envs

582805c

pin decorator version

0413969

Update argo_index_proto.py

6250b9d

- add keywords/shortcuts for hosts

Update argo_index_proto_s3.py

1b9aad2

- add a decorator to fix errors raised when pyarrow is not availalble

Update whats-new.rst

b7b93d0

Update argo_index_proto_s3.py

8593139

fix bug for unknown AWS credentials with boto3 client

More docs

1c83c92

remove s3 stuff from log

fc209af

more doc

35df569

Update argo_index_proto_s3.py

c8e8d21

Update argo_index_proto_s3.py

d320300

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

S3 support #326

S3 support #326

gmaze commented Jan 19, 2024 •

edited

gmaze commented Apr 22, 2024

tcarval commented May 6, 2024 •

edited

gmaze commented May 17, 2024 •

edited

S3 support #326

Are you sure you want to change the base?

S3 support #326

Conversation

gmaze commented Jan 19, 2024 • edited

gmaze commented Apr 22, 2024

tcarval commented May 6, 2024 • edited

gmaze commented May 17, 2024 • edited

gmaze commented Jan 19, 2024 •

edited

tcarval commented May 6, 2024 •

edited

gmaze commented May 17, 2024 •

edited