Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add utilites to update Echo filters and reference genotypes #8867

Open
wants to merge 8 commits into
base: EchoCallset
Choose a base branch
from

Conversation

chrisvittal
Copy link

No description provided.

Copy link
Collaborator

@mcovarr mcovarr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, though this should target the EchoCallset branch rather than ah_var_store.

site_filtering_data : list[str]
Paths to site filtering files.
vets_filtering_data : list[str]
Paths to VETS/VQSR filtering files.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ploidy_data is missing from these docs

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks for catching this!

Comment on lines 87 to 89
hl.vds.VariantDataset(
reference_data=reference_data, variant_data=variant_data
).write(output_path, overwrite=True)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  1. The overwrite=True makes me nervous, though my teammates may overrule me on that 😅. This could also be a (defaulted) function parameter.
  2. It might be useful to return the VariantDataset object so that callers can introspect after these transformations.
Suggested change
hl.vds.VariantDataset(
reference_data=reference_data, variant_data=variant_data
).write(output_path, overwrite=True)
vds = hl.vds.VariantDataset(reference_data=reference_data, variant_data=variant_data)
vds.write(output_path)
return vds

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sure, i can do this.

avros : Sequence[str]
List of paths for raw site filtering data
site_path : str
Path to site filters table where, if a hail table exists, it will be read, unless ``force`` is true
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This update is backwards. The force parameter will always recreate the table.

@chrisvittal chrisvittal changed the base branch from ah_var_store to EchoCallset June 15, 2024 05:50
avros : Sequence[str]
List of paths for raw site filtering data
site_path : str
Path to site filters table where, if a hail table exists, it will be read, unless ``force`` is true
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This update is backwards. The force parameter will always recreate the table.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants