Deadpool is a process pool that is really hard to kill.
Deadpool is an implementation of the Executor interface
in the concurrent.futures standard library. Deadpool is
a process pool executor, quite similar to the stdlib's
ProcessPoolExecutor.
This document assumes that you are familiar with the stdlib
ProcessPoolExecutor. If you are not, it is important
to understand that Deadpool makes very specific tradeoffs that
can result in quite different behaviour to the stdlib
implementation.
This project can be licenced either under the terms of the Apache 2.0 licence, or the Affero GPL 3.0 licence. The choice is yours.
The python package name is deadpool-executor, so to install
you must type $ pip install deadpool-executor. The import
name is deadpool, so in your Python code you must type
import deadpool to use it.
I try quite hard to keep dependencies to a minimum. Currently
Deadpool has no dependencies other than psutil which
is simply too useful to avoid for this library.
I created Deadpool because I became frustrated with the
stdlib ProcessPoolExecutor, and various other community
implementations of process pools. In particular, I had a use-case
that required a high server uptime, but also had variable and
unpredictable memory requirements such that certain tasks could
trigger the OOM killer, often resulting in a "broken" process
pool. I also needed task-specific timeouts that could kill a "hung"
task, which the stdlib executor doesn't provide.
You might wonder, isn't it bad to just kill a task like that?
In my use-case, we had extensive logging and monitoring to alert
us if any tasks failed; but it was paramount that our services
continue to operate even when tasks got killed in OOM scenarios,
or specific tasks took too long. This is the primary trade-off
that Deadpool offers: the pool will not break, but tasks
can receive SIGKILL under certain conditions. This trade-off
is likely fine if you've seen many OOMs break your pools.
I also tried using the Pebble community process pool. This is a cool project, featuring several of the properties I've been looking for such as timeouts, and more resilient operation. However, during testing I found several occurrences of a mysterious RuntimeError that caused the Pebble pool to become broken and no longer accept new tasks.
My goal with Deadpool is that the pool must never enter
a broken state. Any means by which that can happen will be
considered a bug.
5 What differs from ProcessPoolExecutor?
Deadpool is generally similar to ProcessPoolExecutor since it executes
tasks in subprocesses, and implements the standard Executor abstract
interface. We can draw a few comparisons to the stdlib pool to guide
your decision process about whether this makes sense for your use-case:
Deadpoolalso supports themax_tasks_per_childparameter (a new feature in Python 3.11, although it was available in multiprocessing.Pool since Python 3.2).- The "initializer" callback in
Deadpoolworks the same. Deadpooldefaults to the forkserver multiprocessing context, unlike the stdlib pool which defaults toforkon Linux. It's just a setting though, you can change it in the same way as with the stdlib pool. Like the stdlib, I strongly advise you to avoid usingforkbecause propagation threads and locks via fork is going to ruin your day eventually. While this is a difference to the default behaviour of the stdlib pool, it's not a difference in behaviour to the stdlib pool when you use theforkservercontext which is the recommended context for multiprocessing.
Deadpool differs from the stdlib pool in the following ways:
- If a
Deadpoolsubprocess in the pool is killed by some external actor, for example, the OS runs out of memory and the OOM killer kills a pool subprocess that is using too much memory,Deadpooldoes not care and further operation is unaffected.Deadpoolwill not, and indeed cannot raise BrokenProcessPool or BrokenExecutor. Deadpoolprecreates all subprocesses up to the pool size on creation.Deadpooltasks can have priorities. When the executor chooses the next pending task to schedule to a subprocess, it chooses the pending task with the highest priority. This gives you a way of prioritizing certain kinds of tasks. For example, you might give UI-sensitive tasks a higher priority to deliver a more snappy user experience to your users. The priority can be specified in thesubmitcall.- The shutdown parameters
waitandcancel_futurescan behave differently to how they work in the ProcessPoolExecutor. This is discussed in more detail later in this document. Deadpoolcurrently only works on Linux. There isn't any specific reason it can't work on other platforms. The malloc trim feature also requires a glibc system, so probably won't work on Alpine.
Deadpool has the following features that are not present in the
stdlib pool:
- With
Deadpoolyou can provider a "finalizer" callback that will fire before a subprocess is shut down or killed. The finalizer callback might be executed in a different thread than the main thread of the subprocess, so don't rely on the callback running in the main subprocess thread. There are certain circumstances where the finalizer will not run at all, such as when the subprocess is killed by the OS due to an out-of-memory (OOM) condition. So don't design your application such that the finalizer is required to run for correct operation. - Even though
Deadpooltypically uses a hard kill to remove subprocesses, it does still run any handlers registered withatexit. Deadpooltasks can have timeouts. When a task hits the timeout, the underlying subprocess in the pool is killed withSIGKILL. The entire process tree of that subprocess is killed. Your application logic needs to handle this. Thefinalizerwill not run.Deadpoolalso allows afinalizer, with correspondingfinalargs, that will be called after a task is executed on a subprocess, but before the subprocess terminates. It is analogous to theinitializerandinitargsparameters. Just like theinitializercallable, thefinalizercallable is executed inside the subprocess. It is not guaranteed that the finalizer will always run. If a process is killed, e.g. due to a timeout or any other reason, the finalizer will not run. The finalizer could be used for things like flushing pending monitoring messages, such as traces and so on.Deadpoolcan ask the system allocator (Linux only) to return unused memory back to the OS based on exceeding a max threshold RSS. For long-running pools and modern kernels, the system memory allocator can hold onto unused memory for a surprisingly long time, and coupled with bloat due to memory fragmentation, this can result in carrying very large RSS values in your pool. Themax_tasks_per_childhelps with this because a subprocess is entirely erased when the max is reached, but it does mean that periodically there will be a small latency penalty from constructing the replacement subprocess. In my opinion,max_tasks_per_childis appropriate for when you know or suspect there's a real memory leak somewhere in your code (or a 3rd-party package!), and the easiest way to deal with that right now is just to periodically remove a process.Deadpoolcan propagateos.environto the subprocesses. Normally, env vars present at the time of the "main" process will propagate to subprocesses, but dynamically modified env vars viaos.environwill not. Actually, it depends on the start method, withforkdoing the propagation, andforkserverandspawnnot doing it. The parameterpropagate_environ, e.g.,propagate_environ=os.environ, re-enables this forforkserverandspawn. The supplied mapping will be applied to the subprocesses as they are created. This also means that if you want to modify some settings, you can modify the mapping object at any time, and new subprocesses created after that modification will get the new vars. One example use-case is dynamically changing the logging level within subprocesses.
Deadpool has a min_workers and max_workers parameter.
While max_workers is the same as the stdlib pool, min_workers
is a new feature.
The min_workers parameter allows deadpool to "scale down" the
pool when it is idle. This is another strategy alongside other
features like max_tasks_per_child and max_worker_memory_bytes
to help deal with memory bloat in long-running pools.
Here is a very simple example of how to get statistics from the executor:
with deadpool.Deadpool() as exe:
fut = exe.submit(...)
stats = exe.get_statistics()The call must be made while the executor is still alive. It will succeed after the executor is shut down or closed, but some of the statistics will be zeroed out.
The call to get_statistics will return a dictionary with the
following keys:
tasks_received: The total number of tasks submitted to the executor. Does not mean that they started running, only that they were successfully submitted.tasks_launched: The total number of tasks that were launched on a subprocess. This records the count of all tasks that were successfully scheduled to run. These tasks were picked up from the submit backlog and given to a worker process to execute.tasks_failed: The total number of tasks that failed. This includes tasks that raised an exception, and tasks that were killed due to a timeout, and really any other reason that a task failed.worker_processes_created: The total number of subprocesses that were ever created by the executor. This can be, and often will be greater than the max_workers setting because there are many options that can cause workers to be discarded and replaced. Examples of these might be themax_tasks_per_childsetting, or themin_workerssetting, or the memory thresholds and so on.max_workers_busy_concurrently: The maximum number of workers that were ever busy at the same time. This is a useful statistic to decide whether you might consider increasing or decreasing the size of the pool. For example, if yourmax_workersis set to 100, but after running for, say, a week, you see thatmax_workers_busy_concurrentlyis only 50, then you might consider reducing the pool size to 50. The system memory manager on linux likes to hold onto heap memory. If your have more workers than you need, you'll see that the system memory usage over time is going to be higher than it needs to be because even when the pool is fully idle, you will still observe the persistent worker processes having a large memory allocation even though no jobs are running. This is a symptom of malloc retention behaviour.worker_processes_still_alive: The number of worker processes that are still alive. This includes both idle and busy worker processes. This is mainly a debugging statistic that I can use to check whether worker processes are "leaking" somehow and not being cleaned up correctly. This number should not be greater than themax_workers. (It could be, temporarily, depending on the exact timing and strategy in the inner workings of the executor, but on average it should not)worker_processes_idle: The number of worker processes that are idle.worker_processes_busy: The number of worker processes that are busy.
Here is an example from the tests to explain what each of the statistics mean:
with deadpool.Deadpool(min_workers=5, max_workers=10) as exe:
futs = []
for _ in range(50):
futs.append(exe.submit(t, 0.05))
futs.append(exe.submit(f_err, Exception))
results = []
for fut in deadpool.as_completed(futs):
try:
results.append(fut.result())
except Exception:
pass
time.sleep(0.5)
stats = exe.get_statistics()
assert results == [0.05] * 50
print(f"{stats=}")
assert stats == {
"tasks_received": 100,
"tasks_launched": 100,
"tasks_failed": 50,
"worker_processes_created": 10,
"max_workers_busy_concurrently": 10,
"worker_processes_still_alive": 5,
"worker_processes_idle": 5,
"worker_processes_busy": 0,
}In this example, we submit 100 tasks, 50 of which will raise an exception. The executor will create 10 worker processes, and will have a maximum of 10 workers busy at the same time. After all the tasks are completed, we wait for a short time to allow the executor to clean up any worker processes that are no longer needed. The statistics should show that 5 worker processes are still alive, and all of them are idle.
The simple case works exactly the same as with ProcessPoolExecutor:
import deadpool
def f():
return 123
with deadpool.Deadpool() as exe:
fut = exe.submit(f)
result = fut.result()
assert result == 123It is intended that all the basic behaviour should "just work" in the
same way, and Deadpool should be a drop-in replacement for
ProcessPoolExecutor; but there are some subtle differences so you
should read all of this document to see if any of those will affect you.
If a timeout is reached on a task, the subprocess running that task will be
killed, as in SIGKILL. Deadpool doesn't mind, but your own
application should: if you use timeouts it is likely important that your tasks
be idempotent, especially if
your application will restart tasks, or restart them after application deployment,
and other similar scenarios.
import time
import deadpool
def f():
time.sleep(10.0)
with deadpool.Deadpool() as exe:
fut = exe.submit(f, deadpool_timeout=1.0)
with pytest.raises(deadpool.TimeoutError)
fut.result()The parameter deadpool_timeout is special and consumed by Deadpool
in the call. You can't use a parameter with this name in your function
kwargs.
import time
import deadpool
def f():
x = list(range(10**100))
with deadpool.Deadpool() as exe:
fut = exe.submit(f, deadpool_timeout=1.0)
try:
result = fut.result()
except deadpool.ProcessError:
print("Oh no someone killed my task!")As long as the OOM killer terminates merely a subprocess (and not the main
process), which is likely because it'll be your subprocess that is using too
much memory, this will not hurt the pool, and it will be able to receive and
process more tasks. Note that this event will show up as a ProcessError
exception when accessing the future, so you have a way of at least tracking
these events.
Here's a typical example of how code using Deadpool might look. The output of the code further below should be similar to the following:
$ python examples/entrypoint.py
...................xxxxxxxxxxx.xxxxxxx.x.xxxxxxx.x
$Each . is a successfully completed task, and each x is a task
that timed out. Below is the code for this example.
import random, time
import deadpool
def work():
time.sleep(random.random() * 4.0)
print(".", end="", flush=True)
return 1
def main():
with deadpool.Deadpool() as exe:
futs = (exe.submit(work, deadpool_timeout=2.0) for _ in range(50))
for fut in deadpool.as_completed(futs):
try:
assert fut.result() == 1
except deadpool.TimeoutError:
print("x", end="", flush=True)
if __name__ == "__main__":
main()
print()- The work function will be busy for a random time period between 0 and 4 seconds.
- There is a
deadpool_timeoutkwarg given to thesubmitmethod. This kwarg is special and will be consumed by Deadpool. You cannot use this kwarg name for your own task functions. - When a task completes, it prints out
.internally. But when a task raises adeadpool.TimeoutError, axwill be printed out instead. - When a task times out, keep in mind that the underlying process that
is executing that task is killed, literally with the
SIGKILLsignal.
The example below is similar to the previous one for timeouts. In fact this example retains the timeouts to show how the different features compose together. In this example we create tasks with different priorities, and we change the printed character of each task to show that higher priority items are executed first.
The code example will print something similar to the following:
$ python examples/priorities.py
!!!!!xxxxxxxxxxx!x..!...x.xxxxxxxx.xxxx.x...xxxxxxYou can see how the ! characters, used for indicating higher priority
tasks, appear towards the front indicating that they were executed sooner.
Below is the code.
import random, time
import deadpool
def work(symbol):
time.sleep(random.random() * 4.0)
print(symbol, end="", flush=True)
return 1
def main():
with deadpool.Deadpool(max_backlog=100) as exe:
futs = []
for _ in range(25):
fut = exe.submit(work, ".",deadpool_timeout=2.0, deadpool_priority=10)
futs.append(fut)
fut = exe.submit(work, "!",deadpool_timeout=2.0, deadpool_priority=0)
futs.append(fut)
for fut in deadpool.as_completed(futs):
try:
assert fut.result() == 1
except deadpool.TimeoutError:
print("x", end="", flush=True)
if __name__ == "__main__":
main()
print()- When the tasks are submitted, they are given a priority. The default
value for the
deadpool_priorityparameter is 0, but here we'll write them out explicity. Half of the tasks will have priority 10 and half will have priority 0. - A lower value for the
deadpool_priorityparameters means a higher priority. The highest priority allowed is indicated by 0. Negative priority values are not allowed. - I also specified the
max_backlogparameter when creating the Deadpool instance. This is discussed in more detail next, but quickly: task priority can only be enforced on what is in the submitted backlog of tasks, and themax_backlogparameter controls the depth of that queue. Ifmax_backlogis too low, then the window of prioritization will not include tasks submitted later which might have higher priorities than earlier-submitted tasks. Thesubmitcall will in fact block once themax_backlogdepth has been reached.
By default, the max_backlog parameter is set to 5. This parameter is
used to create the "submit queue" size. The submit queue is the place
where submitted tasks are held before they are executed in background
processes.
If the submit queue is large (max_backlog), it will mean
that a large number of tasks can be added to the system with the
submit method, even before any tasks have finished exiting. Conversely,
a low max_backlog parameter means that the submit queue will fill up
faster. If the submit queue is full, it means that the next call to
submit will block.
This kind of blocking is fine, and typically desired. It means that
backpressure from blocking is controlling the amount of work in flight.
By using a smaller max_backlog, it means that you'll also be
limiting the amount of memory in use during the execution of all the tasks.
However, if you nevertheless still accumulate received futures as my
example code above is doing, that accumulation, i.e., the list of futures,
will contribute to memory growth. If you have a large amount of work, it
will be better to set a callback function on each of the futures rather
than processing them by iterating over as_completed.
The example below illustrates this technique for keeping memory consumption down:
import random, time
import deadpool
def work():
time.sleep(random.random() * 4.0)
print(".", end="", flush=True)
return 1
def cb(fut):
try:
assert fut.result() == 1
except deadpool.TimeoutError:
print("x", end="", flush=True)
def main():
with deadpool.Deadpool() as exe:
for _ in range(50):
exe.submit(work, deadpool_timeout=2.0).add_done_callback(cb)
if __name__ == "__main__":
main()
print()With this callback-based design, we no longer have an accumulation of futures in a list. We get the same kind of output as in the "typical example" from earlier:
$ python examples/callbacks.py
.....xxx.xxxxxxxxx.........x..xxxxx.x....x.xxxxxxxSpeaking of callbacks, the customized Future class used by Deadpool
lets you set a callback for when the task begins executing on a real
system process. That can be configured like so:
with deadpool.Deadpool() as exe:
f = exe.submit(work)
def cb(fut: deadpool.Future):
print(f"My task is running on process {fut.pid}")
f.add_pid_callback(cb)Obviously, both kinds of callbacks can be added:
with deadpool.Deadpool() as exe:
f = exe.submit(work)
f.add_pid_callback(lambda fut: f"Started on {fut.pid=}")
f.add_done_callback(lambda fut: f"Completed {fut.pid=}")In the documentation for ProcessPoolExecutor, the following function signature is given for the shutdown method of the executor interface:
shutdown(wait=True, *, cancel_futures=False)I want to honor this, but it presents some difficulties because the
semantics of the wait and cancel_futures parameters need to be
somewhat different for Deadpool.
In Deadpool, this is what the combinations of those flags mean:
wait |
cancel_futures |
effect |
|---|---|---|
True |
True |
Wait for already-running tasks to complete; the
shutdown() call will unblock (return) when they're done. Cancel
all pending tasks that are in the submit queue, but have not yet started
running. The fut.cancelled() method will return True for such
cancelled tasks. |
True |
False |
Wait for already-running tasks to complete.
Pending tasks in the
submit queue that have not yet started running will not be cancelled, and
will all continue to execute. The shutdown() call will return only
after all submitted tasks have completed. |
False |
True |
Already-running tasks will be cancelled and this means the underlying subprocesses executing these tasks will receive SIGKILL. Pending tasks on the submit queue that have not yet started running will also be cancelled. |
False |
False |
This is a strange one. What to do if the caller
doesn't want to wait, but also doesn't want to cancel things? In this
case, already-running tasks will be allowed to complete, but pending
tasks on the submit queue will be cancelled. This is the same outcome as
as wait==True and cancel_futures==True. An alternative design
might have been to allow all tasks, both running and pending, to just
keep going in the background even after the shutdown() call
returns. Does anyone have a use-case for this? |
If you're using Deadpool as a context manager, you might be wondering
how exactly to set these parameters in the shutdown call, since that
call is made for you automatically when the context manager exits.
For this, Deadpool provides additional parameters that can be provided when creating the instance:
# This is pseudocode
import deadpool
with deadpool.DeadPool(
shutdown_wait=True,
shutdown_cancel_futures=True
):
fut = exe.submit(...)This project uses nox. Follow the instructions for installing
nox at their page, and then come back here.
While nox can be configured so that all the tools for each of
the tasks can be installed automatically when run, this takes
too much time and so I've decided that you should just have
the following tools in your environment, ready to go. They
do not need to be installed in the same venv or anything like
that. I've found a convenient way to do this is with pipx.
For example, to install black using pipx you can do
the following:
$ pipx install blackYou must do the same for isort and ruff. See the following
sections for actually using nox to perform dev actions.
To run the tests:
$ nox -s testTo run tests for a particular version, and say with coverage:
$ nox -s testcov-3.11To pass additional arguments to pytest, use the -- separator:
$ nox -s testcov-3.11 -- -k test_deadpool -s <etc>This is nonstandard above, but I customized the noxfile.py to
allow this.
To apply style fixes, and check for any remaining lints,
$ nox -t styleThe only docs currently are this README, which uses RST. Github uses docutils to render RST.
This project uses flit to release the package to pypi. The whole process isn't as automated as I would like, but this is what I currently do:
Ensure that
mainbranch is fully up to date with all to be released, and all the tests succeed.Change the
__version__field indeadpool.py. Flit uses this to stamp the version.Verify that
flit buildsucceeds. This will produce a wheel in thedist/directory. You can inspect this wheel to ensure it contains only what is necessary. This wheel will be what is uploaded to PyPI.Commit the changed ``__version__``. Easy to forget this step, resulting in multiple awkward releases to try to get the state all correct again.
Now create the git tag and push to github:
$ git tag YYYY.MM.patch $ git push --tags origin main
Now deploy to PyPI:
$ flit publish