Variable-Specific Synchronization in MXNet
Instead of halting all operations globally, MXNet allows for targeted synchronization by blocking execution only until a specific variable is computed. This is achieved by calling the wait_to_read() method on a specific tensor, such as z.wait_to_read(). In this scenario, the framework blocks the return of control to the Python frontend only until that particular variable's result is available, while permitting other unrelated background computations in the backend queue to continue processing simultaneously.
0
1
Tags
D2L
Dive into Deep Learning @ D2L
Related
Global Synchronization in MXNet
Variable-Specific Synchronization in MXNet
Implicit Blockers in Deep Learning Frameworks
Global Synchronization in PyTorch
Example of Asynchronous Benchmarking
Scheduling Overhead in Multithreaded Deep Learning Systems
Example of Synchronous vs. Asynchronous Increment Benchmark
Minibatch Synchronization to Prevent Task Queue Overflow
Chip Vendor Performance Analysis Tools for Deep Learning
Automatic Multi-GPU Parallelism via Asynchronous Execution