Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

percolate biosample attributes to sample table in Terra #532

Open
dpark01 opened this issue Apr 5, 2024 · 0 comments
Open

percolate biosample attributes to sample table in Terra #532

dpark01 opened this issue Apr 5, 2024 · 0 comments
Assignees

Comments

@dpark01
Copy link
Member

dpark01 commented Apr 5, 2024

It's getting to be quite limiting that we can't easily access the biosample metadata from the sample or assembly tables in Terra using the current data model that our WDLs create. We should pursue one of the following solutions (or something like it)

  1. demux_deplete populates the sample table with columns from biosample_attributes_tsv
  2. demux_deplete populates the sample table with a json object containing all the biosample attributes from only the relevant row of the tsv corresponding to this sample
  3. demux_deplete emits a tsv output file that is a slightly transformed version of the biosample_attributes_tsv, the main difference being that it contains one more column that corresponds to the sample_id of the sample table (ie, the "sanitized" sample name with dashes and underscores and removing any slashes or spaces from the real/original sample name) -- currently the original biosample_attributes_tsv only has the unsanitized / external facing sample id. Then the user can simply use terra_tsv_to_table to update the sample table themselves (this would require updating terra_tsv_to_table to accept arbitrary columns as the index column by rewriting the column header with the requisite entity: stuff on the fly)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants