Batch records cannot be generated from resumed checkpoints when `removes_subdirs=True`

`_record_survival` needs to iterate through all job records, but job records from the generations before the checkpoint would have been removed if `removes_subdirs=True`, by `_record_survival` itself:

https://github.com/airallergy/sober/blob/ab5201dcdb0c27fb07044f1dc44410df7188245e/sober/_evolver.py#L104-L105

This is an issue when the checkpoint is from a completed run, where `_record_survival` is executed and batch records are generated. A typical occurrence is on HPC with a request limit on resources.

Two possible remedies:

- Comment out this two lines to leave job records untouched
- Somehow detect from which generation the optimisation resumes, only iterate the newly generated job records, and reuse the previous batch records.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch records cannot be generated from resumed checkpoints when `removes_subdirs=True` #5

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

	if self._output_manager._removes_subdirs:
	shutil.rmtree(batch_dir)

Batch records cannot be generated from resumed checkpoints when removes_subdirs=True #5

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions

Batch records cannot be generated from resumed checkpoints when `removes_subdirs=True` #5