Thanks for the pointers. Following that guide I was able to determine that I can get the task transitions if I do the following:
logging.getLogger("distributed.scheduler").setLevel(logging.DEBUG)
This gives me logs such as:
distributed.scheduler - DEBUG - Stimulus task finished ('from_pandas-2b3f78b4bd17e79620af6e53e647318e', 2), tcp://127.0.0.1:42181
distributed.scheduler - DEBUG - Stimulus task finished ('from_pandas-2b3f78b4bd17e79620af6e53e647318e', 0), tcp://127.0.0.1:42201
distributed.scheduler - DEBUG - Stimulus task finished ('from_pandas-2b3f78b4bd17e79620af6e53e647318e', 1), tcp://127.0.0.1:45229
distributed.scheduler - DEBUG - Stimulus task finished ('from_pandas-2b3f78b4bd17e79620af6e53e647318e', 3), tcp://127.0.0.1:41599
distributed.scheduler - DEBUG - Stimulus task finished ('from_pandas-2b3f78b4bd17e79620af6e53e647318e', 5), tcp://127.0.0.1:42201
distributed.scheduler - DEBUG - Stimulus task finished ('from_pandas-2b3f78b4bd17e79620af6e53e647318e', 6), tcp://127.0.0.1:45229
distributed.scheduler - DEBUG - Stimulus task finished ('from_pandas-2b3f78b4bd17e79620af6e53e647318e', 4), tcp://127.0.0.1:42181
distributed.scheduler - DEBUG - Stimulus task finished ('from_pandas-2b3f78b4bd17e79620af6e53e647318e', 7), tcp://127.0.0.1:41599
distributed.scheduler - DEBUG - Stimulus task finished ('explode-18c5ea4301bfdc82a4139c4302919b1a', 2), tcp://127.0.0.1:42181
distributed.scheduler - DEBUG - Stimulus task finished ('explode-18c5ea4301bfdc82a4139c4302919b1a', 1), tcp://127.0.0.1:45229
distributed.scheduler - DEBUG - Stimulus task finished ('explode-18c5ea4301bfdc82a4139c4302919b1a', 3), tcp://127.0.0.1:41599
distributed.scheduler - DEBUG - Stimulus task finished ('explode-18c5ea4301bfdc82a4139c4302919b1a', 0), tcp://127.0.0.1:42201
distributed.scheduler - DEBUG - Stimulus task finished ('explode-18c5ea4301bfdc82a4139c4302919b1a', 7), tcp://127.0.0.1:45229
distributed.scheduler - DEBUG - Stimulus task finished ('explode-18c5ea4301bfdc82a4139c4302919b1a', 4), tcp://127.0.0.1:42181
distributed.scheduler - DEBUG - Stimulus task finished ('explode-18c5ea4301bfdc82a4139c4302919b1a', 6), tcp://127.0.0.1:41599
distributed.scheduler - DEBUG - Stimulus task finished ('explode-18c5ea4301bfdc82a4139c4302919b1a', 5), tcp://127.0.0.1:42201
It’s great to be able to see in this level of detail. This is generally what I want - the individual subtasks (I guess they’re called stimulus tasks?). But it would be great to get these without all the other logs I get inundated with due to using such a low debug level. Ideally it would even print out all of the state transitions, such as the task submission not just when a task finishes.
I also wonder, if the logs have this level of detail, why we can’t create a progress bar context manager just like there is for the local scheduler.