RPA/ACFDT: Correlation energy in the Random Phase Approximation

ACFDT stands for the adiabatic connection fluctuation dissipation theorem and is an alternative way to derive the energy expression for the correlation energy in the random phase approximation (RPA). In the following, the diagrammatic description is presented. For the ACFDT formulation, the reader is referred to the literature.^[1] There is also a lecture introducing RPA on our YouTube channel.

Diagrammatic approach to the correlation energy

The correlation energy [math]\displaystyle{ E_c }[/math] is defined as the missing piece of the Hartree-Fock energy [math]\displaystyle{ E_{x} }[/math] to the total energy, that is [math]\displaystyle{ E_{tot} = E_{x} + E_c }[/math]. The exact form of [math]\displaystyle{ E_c }[/math] is unknown and can be calculated only approximately for a realistic system. The Random Phase Approximation (RPA) is such an approximation that provides access to [math]\displaystyle{ E_c }[/math]. The RPA was first studied by Bohm and Pines for the homogeneous electron gas and was later recognized by Gell-Mann and Brueckner as an approximation of [math]\displaystyle{ E_c }[/math] that can be expressed in the same language as Feynman used a few years earlier to describe the positron. ^[2]^[3]^[4]

Feynman's diagrammatic approach is based on quantum field theory (QFT), which in turn is based on the Gell-Mann and Low theorem. This theorem states that the eigenstate of an interacting Hamiltonian can be expressed in terms of the eigenstates of the non-interacting one.^[5] For this reason, each diagrammatic calculation, like the RPA or GW, requires the solution of the non-interacting Hamiltonian [math]\displaystyle{ H_0 }[/math] of the system, like for instance the Hartree-Fock energies and orbitals or the solutions of the Kohn-Sham Hamiltonian [math]\displaystyle{ \epsilon_{n\bf k}, \phi_{n\bf k} }[/math].

QFT is commonly formulated in the Dirac (also known as interaction) picture, where dynamics described by the interaction part [math]\displaystyle{ \hat V }[/math] of the fully interacting Hamiltonian [math]\displaystyle{ \hat H=\hat H_0+\hat V }[/math] are singled out via time-dependent operators like [math]\displaystyle{ \hat V(t)=e^{i\hat H_0t}\hat Ve^{-i\hat H_0t} }[/math]. These operators act on states like the non-interacting groundstate of the system [math]\displaystyle{ |\Psi_0\rangle }[/math], causing fluctuations at a specific point in time. The main idea of QFT is to understand observations, which can be measured by an observer, as a collective phenomenon of all possible fluctuations.^[6]

Thereby, fluctuations are understood as the creation of virtual electrons (and holes) that interact with each other and are annihilated after some time. Formally this is achieved by introducing creation [math]\displaystyle{ \hat\psi^\dagger({\bf r},t) }[/math] and annihilation operators [math]\displaystyle{ \hat\psi({\bf r},t) }[/math] that satisfy following relations

[math]\displaystyle{ \hat\psi({\bf r},t)|\Psi_0\rangle = 0 =\langle \Psi_0 | \hat\psi({\bf r},t) }[/math]

[math]\displaystyle{ \lbrace \psi({\bf r},t),\psi^\dagger({\bf r},t)\rbrace = \psi^\dagger({\bf r},t)\psi^\dagger({\bf r},t) + \psi^\dagger({\bf r},t),\psi({\bf r},t) = i\delta({\bf r}-{\bf r}') }[/math]

[math]\displaystyle{ \lbrace\psi^\dagger({\bf r},t),\psi^\dagger({\bf r},t) \rbrace = 0 = \lbrace\psi({\bf r},t),\psi({\bf r},t) \rbrace. }[/math]

The first relation defines the non-interacting groundstate [math]\displaystyle{ |\Psi_0\rangle }[/math] as the Fermi vacuum (the groundstate in the absence of any fluctuations), while the second and third anti-commutator relations are a consequence of the Pauli principle. In fact, all operators that describe measurable quantities of a system of interacting electrons can be represented in terms of [math]\displaystyle{ \psi^\dagger({\bf r},t) }[/math] and [math]\displaystyle{ \psi({\bf r},t) }[/math] alone; additional objects are not necessary.

However, the time-ordering operator

[math]\displaystyle{ \hat T \hat A(t)\hat B(t') = \Theta(t-t') \hat A(t)\hat B(t') - \Theta(t'-t)\hat B(t')\hat A(t), }[/math]

where [math]\displaystyle{ \Theta(t) }[/math] is the unit step function, and the time-evolution operator

[math]\displaystyle{ \hat S(t,t_0)=\hat T e^{-i\int_{t_0}^t \hat V(t'){\rm d}t'} }[/math]

are helpful quantities, since they allow to formulate the Gell-Mann and Low theorem as follows.

Gell-Mann and Low theorem

Using adiabatic coupling of the interaction [math]\displaystyle{ \hat V(t) \to \hat V_\eta(t) = e^{-\eta|t|} \hat V }[/math], Gell-Mann and Low proved that the vectors

[math]\displaystyle{ \frac{|\Omega_\nu\rangle}{\langle \Omega_\nu|\Psi_\nu\rangle} =\lim_{\eta\to0}\frac{\hat S_\eta(0,-\infty)|\Psi_\nu\rangle}{\langle \Omega_\nu|\Psi_\nu\rangle} }[/math]

are the eigenstates of the interacting Hamiltonian.^[5]

We follow the common literature and suppress the infinitesimal [math]\displaystyle{ \eta }[/math] in the following bearing in mind that the limit [math]\displaystyle{ \eta \to 0 }[/math] is performed at the very end of the calculation.^[7]^[8]

Diagrammatic perturbation theory

A consequence of the Gell-Mann and Low theorem, is the following form of the interacting groundstate energy^[8]

[math]\displaystyle{ E_{tot}=E_0 = \langle \Omega_0|\hat H|\Omega_0\rangle = \frac{\langle\Psi_0| \hat S(\infty,-\infty)\hat H|\Psi_0\rangle}{\langle \Psi_0|\hat S(\infty,-\infty)|\Psi_0\rangle}, }[/math]

which can be seen as starting point of diagrammatic perturbation theory. The expression above is used to derive all possible approximations by expanding the time-evolution operator [math]\displaystyle{ \hat S }[/math] into a series. The resulting matrix-elements of creation and annihilation operators are evaluated term by term using the canonical anti-commutator relations defined above (Wick's theorem^[9]). It follows that all terms in perturbation theory are expressed by only two quantities, the non-interacting Feynman propagator

[math]\displaystyle{ G_0(1,2) = -i \sum_{n{\bf k}} \phi({\bf r}_2)\phi^*({\bf r}_1) e^{-i(\epsilon_{n \bf k}-\epsilon_F)(t_2-t_1)}\left[ f_{n\bf k}\Theta(t_2-t_1) - (1-f_{n\bf k})\Theta(t_1-t_2)\right], \quad 1 = ({\bf r}_1,t_1), 2 = ({\bf r}_2,t_2) }[/math]

and the Coulomb interaction

[math]\displaystyle{ V(1,2) = \frac{\delta( t_1-t_2)}{|{\bf r}_1-{\bf r}_2|}. }[/math]

Then, each term in the series corresponds to an integral over space-time coordinates [math]\displaystyle{ ({\bf r},t) }[/math].

Feynman diagrams are used to illustrate which terms are considered in the perturbation series. The illustration is usually achieved with so-called Feynman rules that map a specific diagram to an integral (and vice versa). For instance the second order diagram

is also known as the direct Møller-Plessett term and stands for following integral

[math]\displaystyle{ E^{(2)}_{\rm dMP}=\int{\rm d}(1,2,3,4) G_0(1,2)G_0(2,1) V(1,3)V(2,4) G_0(3,4)G_0(4,3), \quad {\rm d}(1,\cdots,4) = {\rm d}{\bf r}_1{\rm d}t_1\cdots {\rm d}{\bf r}_4{\rm d}t_4 }[/math]

All Feynman rules can be found in the book of Negele and Orland or elsewhere.^[7]^[8]

The random-phase approximation

The RPA is obtained from neglecting all second and higher order terms in the perturbation series of the groundstate energy, except of those which can be expressed soley in terms of the independent particle polarizability

[math]\displaystyle{ \chi_0(1,2) = -i G_0(1,2) G_0(2,1) }[/math]

corresponding to the "bubble" diagram

Because of the symmetric time property [math]\displaystyle{ \chi_0(t_2-t_1)=\chi_0(t_1-t_2) }[/math], the independent particle polarizability is of bosonic character. Because the RPA neglects all non-bosonic terms in the perturbation series, it corresponds essentially to a "bosonization" of the many-body problem for which the n-th order term can be written analytically as^[7]

[math]\displaystyle{ E^{(n)}_{\rm dMP} = \frac1{2n}\int_{-\infty}^\infty\frac{{\rm d}\omega}{2\pi} {\rm Tr}\left[ \tilde \chi_0(\omega) \cdot V \right]^n. }[/math]

Here, the trace of the matrix product is most effectively done in reciprocal space [math]\displaystyle{ \left[\tilde \chi_0(\omega) \cdot V\right]({\bf q+G}_1,{\bf q+G}_2) = \sum_{\bf G} \tilde \chi_0({\bf q+G}_1,{\bf G},\omega)V({\bf q+G},{\bf q+G}_2) }[/math] using the Fourier transformed polarizability [math]\displaystyle{ \tilde \chi_0({\bf q+G}_1,{\bf q+G}_2,\omega) }[/math], the diagonal Coulomb potential [math]\displaystyle{ V({\bf q+G}_1,{\bf q+G}_2)=\frac{ \delta_{ {\bf G}_1 {\bf G}_2 } }{|{\bf q+G}_1|} }[/math] and the conserved crystal momentum [math]\displaystyle{ {\bf q} }[/math] in the first Brillouin zone.

All bubble terms of order [math]\displaystyle{ n \ge 2 }[/math] can be written in a closed form using the series for the logarithm [math]\displaystyle{ \ln(1-x)+x=-\sum_{n=2}^\infty \frac{x^n}{n} }[/math] and define the correlation part of the RPA energy

[math]\displaystyle{ E_c^{\rm RPA} = \int\frac{ {\rm d}\omega}{2\pi} {\rm Tr}\left\lbrace \ln\left[ 1-\tilde \chi_0(\omega)\cdot V \right] + \tilde \chi_0(\omega)\cdot V \right\rbrace. }[/math]

There are two first order contributions to the total energy that yield the exact exchange energy [math]\displaystyle{ E_x=T+V_{ext}+V_h+V_x }[/math], which is usually determined separately.

Computational Complexity

The calculation of the RPA integral requires the determination of the independent particle polarizability matrix [math]\displaystyle{ \tilde \chi^0_{\bf GG'}({\bf q},\omega_n)=\tilde \chi_0({\bf q+G},{\bf q+G}',\omega_n) }[/math] on each of the [math]\displaystyle{ N_{\bf q} }[/math] sampling points of the first Brillouin zone for [math]\displaystyle{ N_{\omega} }[/math] frequency points. The number of frequency points is reduced drastically, by performing the integration over the imaginary frequency axis [math]\displaystyle{ \omega\to i\omega }[/math].^[10]

The independent particle polarizability on the imaginary axis can be determined with two alternative methods.

Quartic scaling RPA: Direct calculation

Direct calculation of [math]\displaystyle{ \tilde\chi^0 }[/math] using the formula of Adler and Wiser^[11] ^[12]

[math]\displaystyle{ \tilde\chi^0_{{\bf GG}'}({\bf q},i\omega) = \sum\limits_{{\bf k}\in BZ}\sum\limits_{n,n'} \frac{ f_{n{\bf k}}(1 - f_{n{\bf k-q}}) }{ \epsilon_{n{\bf k-q}}-\epsilon_{n{\bf k}} -i \omega } \langle \phi_{n {\bf k-q}} | e^{i{\bf Gr}} | \phi_{n'{\bf k}} \rangle \langle \phi_{n' {\bf k}} | e^{-i{\bf G'r'}} | \phi_{n'{\bf k-q}} \rangle }[/math]

yields an RPA algorithm that has a computational cost of [math]\displaystyle{ N_\omega N_{\bf k}^2 N_{\bf G}^4 }[/math]. Because the number of plane waves [math]\displaystyle{ N_{\bf G} }[/math] scales linearly with the system size (number of electrons in the unit cell), the direct calculation of the polarizability is unfavourable for large system sizes, e.g. for more than ~20 atoms in the unit cell.

Cubic scaling RPA: Contraction of imaginary time Green's functions

An alternative way to determine [math]\displaystyle{ \tilde\chi^0 }[/math] is to frist determine imaginary time Green's functions of the form^[13]

[math]\displaystyle{ G_0({\bf r,r'},i\tau) = \sum\limits_{{\bf k}\in BZ}\sum\limits_{n} \phi_{n{\bf k}}({\bf r})\phi_{n \bf k}^*({\bf r'}) e^{-(\epsilon_{n\bf k}-\epsilon_{F})\tau}\left[ \Theta(-\tau)f_{n\bf k}-\Theta(\tau)(1-f_{n\bf k}) \right] }[/math]

and to perform afterwards a Fourier transformation into reciprocal and imaginary frequency space of

[math]\displaystyle{ \chi_0({\bf r,r'},i\tau) = -G_0({\bf r,r'},i\tau) G_0({\bf r',r},-i\tau). }[/math]

Although more evolved, this approach has the advantage that the computational cost for the determination of [math]\displaystyle{ \tilde \chi_0 }[/math] scales with [math]\displaystyle{ N_\omega N_{\bf k} N_{\bf G}^3 }[/math] and is essentially only cubic in system size. The space-time method allows to study relatively large systems with the RPA.^[14]

Basis set convergence of RPA-ACFDT calculations

The expression for the ACFDT-RPA correlation energy written in terms of reciprocal lattice vectors reads:

[math]\displaystyle{ E_{\rm c}^{\rm RPA}=\int_{0}^{\infty} \frac{\mathrm{d}\omega}{2\pi} \sum_{{\mathbf{q}}\in \mathbf{BZ} }\sum_{{\mathbf{G}}} \left\{(\mathrm{ln}[1-\tilde\chi^0({\mathbf{q}},\mathrm{i}\omega)V({\mathbf{q}})])_{{\mathbf{G,G}}} +V_{{\mathbf{G,G}}}({\mathbf{q}})\tilde\chi^0({\mathbf{q}},{\mathrm{i}}\omega) \right\} }[/math].

The sum over reciprocal lattice vectors has to be truncated at some [math]\displaystyle{ \mathbf{G}_{\mathrm{max}} }[/math], determined by [math]\displaystyle{ \frac{\hbar^2|{\mathbf{G}}+{\mathbf{q}}|^2}{2\mathrm{m}_e} }[/math] < ENCUTGW, which can be set in the INCAR file. The default value is [math]\displaystyle{ \frac{2}{3}\times }[/math] ENCUT, which experience has taught us not to change. For systematic convergence tests, instead increase ENCUT and repeat steps 1 to 4, but be aware that the "maximum number of plane-waves" changes when ENCUT is increased. Note that it is virtually impossible, to converge absolute correlation energies. Rather concentrate on relative energies (e.g. energy differences between two solids, or between a solid and the constituent atoms).

Since correlation energies converge very slowly with respect to [math]\displaystyle{ \mathbf{G}_{\rm max } }[/math], VASP automatically extrapolates to the infinite basis set limit using a linear regression to the equation: ^[1]^[15]^[16]

[math]\displaystyle{ E_{\mathrm{c}}({\mathbf{G}})=E_{\mathrm{c}}(\infty)+\frac{A}{{\mathbf{G}}^3} }[/math].

Furthermore, the Coulomb kernel is smoothly truncated between ENCUTGWSOFT and ENCUTGW using a simple cosine like window function (Hann window function). Alternatively, the basis set extrapolation can be performed by setting LSCK=.TRUE., using the squeezed Coulomb kernel method.^[17]

The default for ENCUTGWSOFT is 0.8[math]\displaystyle{ \times }[/math]ENCUTGW (again we do not recommend to change this default).

The integral over [math]\displaystyle{ \omega }[/math] is evaluated by means of a highly accurate minimax integration.^[10] The number of [math]\displaystyle{ \omega }[/math] points is determined by the flag NOMEGA, whereas the energy range of transitions is determined by the band gap and the energy difference between the lowest occupied and highest unoccupied one-electron orbital. VASP determines these values automatically (from vasp.5.4.1 on), and the user should only carefully converge with respect to the number of frequency points NOMEGA. A good choice is usually NOMEGA=12, however, for large gap systems one might obtain [math]\displaystyle{ \mu }[/math]eV convergence per atom already using 8 points, whereas for metals up to NOMEGA=24 frequency points are sometimes necessary, in particular, for large unit cells.

Strictly adhere to the steps outlines above. Specifically, be aware that steps two and three require the WAVECAR file generated in step one, whereas step four requires the WAVECAR and WAVEDER file generated in step three (generated by setting LOPTICS=.TRUE.).

Matsubara Formalism: Metallic systems at finite Temperature

The zero-temperature formalism of many-body perturbation theory breaks down for metals (systems with zero energy band-gap) as pointed out by Kohn and Luttinger.^[18] This conundrum is lifted by considering diagrammatic perturbation theory at finite temperature [math]\displaystyle{ T>0 }[/math], which may be understood by an analytical continuation of the real-time [math]\displaystyle{ t }[/math] to the imaginary time axis [math]\displaystyle{ -i\tau }[/math]. Matsubara has shown that this Wick rotation in time [math]\displaystyle{ t\to-i\tau }[/math] reveals an intriguing connection to the inverse temperature [math]\displaystyle{ \beta=1/T }[/math] of the system.^[19] More precisely, Matsubara has shown that all terms in perturbation theory at finite temperature can be expressed as integrals of imaginary time quantities (such as the polarizability [math]\displaystyle{ \chi(-i\tau) }[/math]) over the fundamental interval [math]\displaystyle{ -\beta\le\tau\le\beta }[/math].

As a consequence, one decomposes imaginary time quantities into a Fourier series with period [math]\displaystyle{ \beta }[/math] that determines the spacing of the Fourier modes. For instance the imaginary polarizability can be written as

[math]\displaystyle{ \chi(-i\tau)=\frac1\beta\sum_{m=-\infty}^\infty \tilde \chi(i\nu_m)e^{-i\nu_m\tau},\quad \nu_m=\frac{2m}\beta\pi }[/math]

and the corresponding random-phase approximation of the correlation energy at finite temperature becomes a series over (in this case, bosonic) Matsubara frequencies

[math]\displaystyle{ \Omega_c^{\rm RPA}=\frac12\frac1\beta \sum_{m=-\infty}^\infty {\rm Tr}\left\lbrace \ln\left[ 1 -\tilde \chi(i\nu_m) V \right] -\tilde \chi(i\nu_m) V \right\rbrace,\quad \nu_m=\frac{2m}\beta\pi }[/math]

The Matsubara formalism has the advantage that all contributions to the Green's function and the polarizability are mathematically well-defined, including contributions from states close to the chemical potential [math]\displaystyle{ \epsilon_{n{\bf k}}\approx \mu }[/math], such that Matsubara series also converge for metallic systems.

Although formally convenient, the Matsubara series converges poorly with the number of considered terms in practice. VASP, therefore, uses a compressed representation of the Fourier modes by employing the Minimax-Isometry method.^[20] This approach converges exponentially with the number of considered frequency points.

References

[harl:2008-1] J. Harl and G. Kresse, Phys. Rev. B 77, 045136 (2008).

[bohm:pr:82-2] D. Bohm and D. Pines, J. Phys. 82, 625 (1951).

[gell-mann:pr:106-3] M. Gell-Mann and K. Brueckner, J. Phys. 106, 364 (1957).

[feynman:pr:76-4] R. P. Feynman, J. Phys. 76, 749 (1948).

[gell-mann:pr:84-5] M. Gell-Mann and F. Low, J. Phys. 84, 350 (1951).

[mattuck:2012-6] R. D. Mattuck, Dover Books on Physics (2012).

[negele:1988-7] J. Negele and H. Orland, Frontiers in Physics (1988).

[fetter:2003-8] A. L. Fetter and J. D. Walecka, Dover Books on Physics (2003).

[wick:1950-9] G. C. Wick, Phys. Rev. 80, 268 (1950).

[kaltak:2014-10] M. Kaltak, J. Klimeš, and G. Kresse, J. Chem. Theory Comput. 10, 2498-2507 (2014).

[adler:1962-11] S. L. Adler, Phys. Rev. 126, 413 (1962)

[wiser:1963-12] N. Wiser, Phys. Rev. 129, 62 (1963)

[rojas:prl:1995-13] H. N. Rojas, R. W. Godby, and R. J. Needs, Phys. Rev. Lett. 74, 1827 (1995).

[kaltak:prb:2014-14] M. Kaltak, J. Klimeš, and G. Kresse, Phys. Rev. B 90, 054115 (2014).

[harl:2010-15] J. Harl, L. Schimka, and G. Kresse, Phys. Rev. B 81, 115126 (2010).

[klimes:2014-16] J. Klimeš, M. Kaltak, and G. Kresse, Phys. Rev. B 90, 075125 (2014).

[riemelmoser:jcp:2020-17] S. Riemelmoser, M. Kaltak, and G. Kresse, J. Chem. Phys. 152(13), 134103 (2020).

[KohnLuttinger:PR:1960-18] W. Kohn and J. M. Luttinger, Phys. Rev. 118, 41 (1960).

[Matsubara:PTP:1955-19] T. Matsubara, Prog. Theor. Phys. 14, 351 (1955).

[Kaltak:PRB:2020-20] M. Kaltak and G. Kresse, Phys. Rev. B. 101, 205145 (2020).

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]