- 
                Notifications
    
You must be signed in to change notification settings  - Fork 57
 
GCP subnet and separate buckets #2048
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
This is related to work in griffith-lab/cloud-workflows repo. The intention behind splitting buckets is to apply separate lifecycle rules so input files stick around but execution files are deleted after 30 days
…parate-buckets
| 
           I did a test run with somatic_exome that went all the way through as expected  | 
    
| 
           Remove separate buckets, @chrisamiller and I talked a bit and decided on single bucket with expiry rule. Example data and large static files (e.g. vep_cache) handled separately  | 
    
| my $contents = <<"EOCONFIG" | ||
| { | ||
| "final_workflow_log_dir": "gs://$cromwell_gcp_bucket/build.$build_id/logs", | ||
| "final_workflow_log_dir": "gs://$cromwell_gcp_bucket/build.$build_id"/logs, | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is this quotation mark supposed to be inside the slash or at the end?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This quote is haunting me. I have no idea why it keeps changing back but this will be the fourth time I've fixed it?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ugh it was just my last push didn't go through
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1 🚢
Two changes to reflect different infrastructure for GCP runs
separate buckets for inputs and executions files. also handled by initial setup, different buckets have different lifecycle rules so execution files which are no longer needed can be expired out without losing inputs which may be reused consistently. output files expected to be pulled to local storage, not persisted between buckets