ICAP2012 - List of Authors (Min, M.)

Paper

Title

Page

Comparison of Different Electromagnetic Solvers for Accelerator Simulations

155

J. Xu, R. Zhao, X. Zhufu
IS, Beijing, People's Republic of China
C. Li, X. Qi, L. Yang
IMP, Lanzhou, People's Republic of China
M. Min
ANL, Argonne, USA

Funding: Chinese Academy of Science
Electromagnetic simulations are fundamental for accelerator modeling. In this paper two high-order numerical methods will be studied. These include continuous Galerkin (CG) method with vector bases, and discontinuous Galerkin (DG) method with nodal bases. Both methods apply domain decomposition method for the parallelization. Due to the difference in the numerical methods, these methods have different performance in speed and accuracy. DG method on unstructured grid has the advantages of easy parallelization, good scalability, and strong capability to handle complex geometries. Benchmarks of these methods will be shown on simple geometries in detail first. Then they will be applied for simulation in accelerator devices, and the results will be compared and discussed.

FRSAC1

Hybrid Programming and Performance for Beam Propagation Modeling

284

M. Min, A. Mametjanov
ANL, Argonne, USA
J. Fu
RPI, Troy, New York, USA

Funding: DOE ASCR (Advanced Scientific Computing Research) Program
We examined hybrid parallel infrastructures in order to ensure performance and scalability for beam propagation modeling as we move toward extreme-scale systems. Using an MPI programming interface for parallel algorithms, we expanded the capability of our existing electromagnetic solver to a hybrid (MPI/shared-memory) model that can potentially use the computer resources on future-generation computing architecture more efficiently. As a preliminary step, we discuss a hybrid MPI/OpenMP model and demonstrate performance and analysis on the leadership-class computing systems such as the IBM BG/P, BG/Q, and Cray XK6. Our hybrid MPI/OpenMP model achieves speedup when the computation amounts are large enough to compensate the OMP threading overhead.

Slides FRSAC1 [4.252 MB]

Paper	Title	Page
WEP08	Comparison of Different Electromagnetic Solvers for Accelerator Simulations	155
	J. Xu, R. Zhao, X. Zhufu IS, Beijing, People's Republic of China C. Li, X. Qi, L. Yang IMP, Lanzhou, People's Republic of China M. Min ANL, Argonne, USA
	Funding: Chinese Academy of Science Electromagnetic simulations are fundamental for accelerator modeling. In this paper two high-order numerical methods will be studied. These include continuous Galerkin (CG) method with vector bases, and discontinuous Galerkin (DG) method with nodal bases. Both methods apply domain decomposition method for the parallelization. Due to the difference in the numerical methods, these methods have different performance in speed and accuracy. DG method on unstructured grid has the advantages of easy parallelization, good scalability, and strong capability to handle complex geometries. Benchmarks of these methods will be shown on simple geometries in detail first. Then they will be applied for simulation in accelerator devices, and the results will be compared and discussed.

FRSAC1	Hybrid Programming and Performance for Beam Propagation Modeling	284
	M. Min, A. Mametjanov ANL, Argonne, USA J. Fu RPI, Troy, New York, USA
	Funding: DOE ASCR (Advanced Scientific Computing Research) Program We examined hybrid parallel infrastructures in order to ensure performance and scalability for beam propagation modeling as we move toward extreme-scale systems. Using an MPI programming interface for parallel algorithms, we expanded the capability of our existing electromagnetic solver to a hybrid (MPI/shared-memory) model that can potentially use the computer resources on future-generation computing architecture more efficiently. As a preliminary step, we discuss a hybrid MPI/OpenMP model and demonstrate performance and analysis on the leadership-class computing systems such as the IBM BG/P, BG/Q, and Cray XK6. Our hybrid MPI/OpenMP model achieves speedup when the computation amounts are large enough to compensate the OMP threading overhead.
	Slides FRSAC1 [4.252 MB]