-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Proposal: allow assigning outputs to more that one collection. #228
Comments
I understand the need and desire to produce items that belong to multiple collections, however I am uncertain how this would work in practice. Maybe you can help me understand? Specifically, in your example, a Landsat 8 item matches both collections, but STAC items can have only one collection defined, correct? If so, does this mean we need to duplicate this item in the workflow output for each additional matched collection? Or does one of these collections take precedence for the workflow output (first matched?) and we only consider multiple collection matches when publishing (so we publish more items than are in the workflow output)? I don't love either of these solutions. The former seems preferable for some reasons, but in either case you end up with multiple items representing the same (meta)data, and that seems problematic from both a data consistency and catalog management perspective. Does this situation potentially point at a gap in the STAC spec, that supporting only a single collection per item is too limiting? Should the spec allow items to have multiple collections? Or do we need a new concept to cover such "item aliases"? I think we've discussed the idea of dynamic collections before. Would that perhaps be an idea here? |
The intention would be that it would be published, separately, to each collection so yes it would be duplicated. |
If the item is to be duplicated, it seems like some metadata should be injected to the one destined for the Side note: until #226 merges, the last match collection is the one assigned to the item. After, it will be the first match. |
Currently, the
upload_options.collections
dictionary will assign the collection that has the first match.For example a typical
collections
dict might beHowever, we have a case where we want to publish an item to multiple collections. It's likely not a common case, however by explicitly allowing for multiple matches it also requires a payload writer to be more exact in how they write the JSONPath expressions.
In this case we may have:
So that Landsat-8 items would be put in both the "landsat-8" and "landsat-8-legacy" collections.
The text was updated successfully, but these errors were encountered: