Logo image
Open Research University homepage
Surrey researchers Sign in
US-Byte: An Efficient Communication Framework for Scheduling Unequal-sized Tensor Blocks in Distributed Deep Learning
Journal article   Open access   Peer reviewed

US-Byte: An Efficient Communication Framework for Scheduling Unequal-sized Tensor Blocks in Distributed Deep Learning

Yunqi Gao, Bing Hu, Mahdi Boloursaz Mashhadi, A-Long Jin, Pei Xiao and Chunming Wu
IEEE transactions on parallel and distributed systems : a publication of the IEEE Computer Society, Vol.35(1), pp.123-139
01/2024

Abstract

Distributed Deep Learning, Data Parallelism Communication Scheduling Tensor Partitioning Tensor Fusion
pdf
The final manuscript9.75 MBDownloadView
Author's Accepted Manuscript Open Access

Metrics

208 File views/ downloads
65 Record Views

Details

Logo image

Usage Policy