Dr. Kirk W. Cameron

Assistant Professor
Department of Computer Science and Engineering 
University of South Carolina

Time : Friday, April 5, 11:00am

Location: SB 239

 

Toward a unified, practical model of parallel computation: Incorporating the memory hierarchy

 

Abstract

Modeling communication memory performance is difficult since it requires estimation of the interaction of access patterns (software) with the memory hierarchy implementation (hardware). Micro-architecture enhancements such as blocking, prefetching, write buffering, non-blocking accesses and associativity are present (and varied) at each level of a multi-level cache hierarchy, complicating any attempt to estimate performance. Combining characterization of the access patterns of an application with a resulting architectural model is equally challenging. Moreover, coupling any such model with the implicit network-related parameters of the LogP model of parallel computation and finding an acceptable balance between simplicity and practicality exacerbates the problem. 

Applications that require out-of-core accesses suffer severely from local memory communication performance delays. Message packing and transmission to the network buffer for such codes require a significant portion of time that is not estimated in models of point-to-point communication like LogP. Architectural advances that reduce the effects of network latency through pipelined communication and DMA accesses result in local memory-related delays encompassing a greater proportion of overall communication delay for many more applications. For asynchronous transfers the effects of local communication may encompass nearly all communication cost. There is increasing need to include local memory performance in models of parallel and distributed computation. We discuss these issues and extend the LogP model of parallel computation to incorporate local memory communication performance.

 

Short Bio of the Speaker

Kirk W. Cameron received his Ph.D. degree in computer science from Louisiana State University in August 2000. During his graduate career, he worked as an engineer for Intel Corporation in Hillsboro, Oregon as part of 82450NX Chipset design group in 1996. He accomplished portions of his thesis work as a graduate researcher on the Parallel Architecture and Performance Team in the Modeling, Algorithms and Informatics Group (CCS-3) of the Computer and Computational Science Division at Los Alamos National Laboratory. Currently, Dr. Cameron is an assistant professor in the Department of Computer Science and Engineering at the University of South Carolina. His research interests include: parallel and distributed processing, software systems, performance evaluation, and computer architecture. 

 

Back