LTMP2 - Tutorial: Difference between revisions

Revision as of 15:50, 27 September 2018

(UNDER CONSTRUCTION)

On this page, we explain how to perform a calculation using the LTMP2^[1] algorithm. Make sure you successfully completed the preparation steps Hartree-Fock ground state and Hartree-Fock virtuals.

The INCAR file

The LTMP2 calculation can simply be performed using the following INCAR file

ALGO = ACFDTRK 
LMP2LT = .TRUE.
NOMEGA = # number of tau points (see below)
NBANDS = # same valule as in the Hartree-Fock unoccupieds step ( = number of plane-waves)
ENCUT = # same value as in the Hartree-Fock step
LORBITALREAL = .TRUE.
PRECFOCK = Fast

Make sure that VASP reads the WAVECAR file from the Hartree-Fock virtuals step.

NOMEGA flag

The number of $\tau$ -points is set with the NOMEGA flag. This is necessary to calculate the Laplace transformed energy denominator (see Ref ^[1] for details),

${\frac {1}{\varepsilon _{i}+\varepsilon _{j}-\varepsilon _{a}-\varepsilon _{b}}}=-\int _{0}^{\infty }{\textrm {e}}^{-(\varepsilon _{i}+\varepsilon _{j}-\varepsilon _{a}-\varepsilon _{b})\tau }\;{\textrm {d}}\tau \;.$ .

Parallelization

The LTMP2 algorithm is a high-performance code and can easily be used on many CPUs. Both OpenMP and MPI is supported. We recommend to use MPI for parallelization since the code possesses an almost ideal parallelization efficiency. OpenMP should only be used to increase the shared memory, if necessary.

In order to activate the efficient MPI parallelization use the KPAR flag in the following way (note that the usual meaning of the KPAR flag becomes obsolete in the LTMP2 algorithm). Set KPAR to half of the used MPI ranks. If this results in memory issues, further decrease KPAR (such that KPAR is alway a divisor of the used MPI ranks) or increase the number of OpenMP threads.

Example for 512 CPUs

MPI ranks: 512
OpenMP threads per rank: 1

KPAR = 256

To decrease the memory requirement you can alternatively set
MPI ranks: 512
OpenMP threads per rank: 1

KPAR = 128

or
MPI ranks: 512
OpenMP threads per rank: 1

KPAR = 64

and so on. Or also try
MPI ranks: 256
OpenMP threads per rank: 2

KPAR = 128

References

↑ ^a ^b T. Schäfer, B. Ramberger, and G. Kresse, J. Chem. Phys. 146, 104101 (2017).

[schaefer2017-1] T. Schäfer, B. Ramberger, and G. Kresse, J. Chem. Phys. 146, 104101 (2017).

[1]

@@ Line 24: / Line 24: @@
 == Parallelization ==
-The LTMP2 algorithm is a high-performance code and can easily be used on many CPUs. Both OpenMP and MPI is supported.
+The LTMP2 algorithm is a high-performance code and can easily be used on many CPUs. Both OpenMP and MPI is supported. We recommend to use MPI for parallelization since the code possesses an almost ideal parallelization efficiency. OpenMP should only be used to increase the shared memory, if necessary.
+In order to activate the efficient MPI parallelization use the KPAR flag in the following way (note that the usual meaning of the KPAR flag becomes obsolete in the LTMP2 algorithm). Set KPAR to half of the used MPI ranks. If this results in memory issues, further decrease KPAR (such that KPAR is alway a divisor of the used MPI ranks) or increase the number of OpenMP threads.
+==== Example for 512 CPUs ====
+MPI ranks: 512 <br />
+OpenMP threads per rank: 1 <br />
+ KPAR = 256
+To decrease the memory requirement you can alternatively set <br />
+MPI ranks: 512 <br />
+OpenMP threads per rank: 1 <br />
+ KPAR = 128
+or <br />
+MPI ranks: 512 <br />
+OpenMP threads per rank: 1 <br />
+ KPAR = 64
+and so on. Or also try <br />
+MPI ranks: 256 <br />
+OpenMP threads per rank: 2 <br />
+ KPAR = 128
 == References ==