Auto-parallelization of machine-learning dataflow graphs for CPU multicores