Xu, Bao-yu, Wu Zhang, Xian-he Sun, and Yang Wang. "A memory-driven scheduling scheme and optimization for concurrent execution in GPU." Cluster Computing 19, no. 4 (2016): 2241-2250.