DataNode initialization / construction

hi all, i’m just getting started with Dask and trying to familiarize myself with how stuff works. i’ve started by looking at the unit tests, specifically test_merge.py::test_merge where it calls from_pandas() to construct a dask df. in Expr::_new_, the part i’m confused on is right after you unpack the operands but right before you return the inst object . the inst object has a “dask” attribute that is a map to a bunch of DataNodes. However, i’m not sure how these DataNodes are being constructed. despite my efforts to put breakpoints in the initializer for DataNode, it never goes in there. is there something I’m missing? thanks

Hi @Jda,

Sorry for the long time to answer.

I’m not sure looking at unit tests is the right entrypoint for Dask. I think you’d better read the documentation, and then go through examples or tutorials.

If you xant to understand about DataNodes, you should go through the graph section of the doc.