Parallelization of an Implicit Algorithm for Multi-Dimensional Particle-in-Cell Simulations
George M. Petrov 1*, Jack Davis 11 Naval Research Laboratory, Plasma Physics Division, 4555 Overlook Ave. SW, Washington, DC 20375, USA
Received 7 August 2013; Accepted (in revised version) 28 February 2014
Available online 24 June 2014
The implicit 2D3V particle-in-cell (PIC) code developed to study the interaction of ultrashort pulse lasers with matter [G. M. Petrov and J. Davis, Computer Phys. Comm. 179, 868 (2008); Phys. Plasmas 18, 073102 (2011)] has been parallelized using MPI (Message Passing Interface). The parallelization strategy is optimized for a small number of computer cores, up to about 64. Details on the algorithm implementation are given with emphasis on code optimization by overlapping computations with communications. Performance evaluation for 1D domain decomposition has been made on a small Linux cluster with 64 computer cores for two typical regimes of PIC operation: "particle dominated", for which the bulk of the computation time is spent on pushing particles, and "field dominated", for which computing the fields is prevalent. For a small number of computer cores, less than 32, the MPI implementation offers a significant numerical speed-up. In the "particle dominated" regime it is close to the maximum theoretical one, while in the "field dominated" regime it is about 75-80 % of the maximum speed-up. For a number of cores exceeding 32, performance degradation takes place as a result of the adopted 1D domain decomposition. The code parallelization will allow future implementation of atomic physics and extension to three dimensions.
Notice: Undefined variable: ams in /var/www/html/readabs.php on line 164
PACS: 52.38.-r, 52.38.Ph, 52.50.Jm
Key words: Particle-in-cell, Maxwell equations, MPI, laser–target interaction.
Email: firstname.lastname@example.org (G. M. Petrov), email@example.com (J. Davis)