Jump to content

Requests for technical support from the VASP team should be posted in the VASP Forum.

RPA/ACFDT: Correlation energy in the Random Phase Approximation

From VASP Wiki

ACFDT stands for the adiabatic connection fluctuation dissipation theorem and is an alternative way to derive the energy expression for the correlation energy in the random phase approximation (RPA). In the following, the diagrammatic description is presented. For the ACFDT formulation, the reader is referred to the literature.[1] There is also a lecture introducing RPA on our YouTube channel.

Diagrammatic approach to the correlation energy

The correlation energy Ec is defined as the missing piece of the Hartree-Fock energy Ex to the total energy, that is Etot=Ex+Ec. The exact form of Ec is unknown and can be calculated only approximately for a realistic system. The Random Phase Approximation (RPA) is such an approximation that provides access to Ec. The RPA was first studied by Bohm and Pines for the homogeneous electron gas and was later recognized by Gell-Mann and Brueckner as an approximation of Ec that can be expressed in the same language as Feynman used a few years earlier to describe the positron. [2][3][4]

Feynman's diagrammatic approach is based on quantum field theory (QFT), which in turn is based on the Gell-Mann and Low theorem. This theorem states that the eigenstate of an interacting Hamiltonian can be expressed in terms of the eigenstates of the non-interacting one.[5] For this reason, each diagrammatic calculation, like the RPA or GW, requires the solution of the non-interacting Hamiltonian H0 of the system, like for instance the Hartree-Fock energies and orbitals or the solutions of the Kohn-Sham Hamiltonian ϵn𝐤,ϕn𝐤.

QFT is commonly formulated in the Dirac (also known as interaction) picture, where dynamics described by the interaction part V^ of the fully interacting Hamiltonian H^=H^0+V^ are singled out via time-dependent operators like V^(t)=eiH^0tV^eiH^0t. These operators act on states like the non-interacting groundstate of the system |Ψ0, causing fluctuations at a specific point in time. The main idea of QFT is to understand observations, which can be measured by an observer, as a collective phenomenon of all possible fluctuations.[6]

Thereby, fluctuations are understood as the creation of virtual electrons (and holes) that interact with each other and are annihilated after some time. Formally this is achieved by introducing creation ψ^(𝐫,t) and annihilation operators ψ^(𝐫,t) that satisfy following relations

ψ^(𝐫,t)|Ψ0=0=Ψ0|ψ^(𝐫,t)

{ψ(𝐫,t),ψ(𝐫,t)}=ψ(𝐫,t)ψ(𝐫,t)+ψ(𝐫,t),ψ(𝐫,t)=iδ(𝐫𝐫)

{ψ(𝐫,t),ψ(𝐫,t)}=0={ψ(𝐫,t),ψ(𝐫,t)}.

The first relation defines the non-interacting groundstate |Ψ0 as the Fermi vacuum (the groundstate in the absence of any fluctuations), while the second and third anti-commutator relations are a consequence of the Pauli principle. In fact, all operators that describe measurable quantities of a system of interacting electrons can be represented in terms of ψ(𝐫,t) and ψ(𝐫,t) alone; additional objects are not necessary.

However, the time-ordering operator

T^A^(t)B^(t)=Θ(tt)A^(t)B^(t)Θ(tt)B^(t)A^(t),

where Θ(t) is the unit step function, and the time-evolution operator

S^(t,t0)=T^eit0tV^(t)dt

are helpful quantities, since they allow to formulate the Gell-Mann and Low theorem as follows.

Gell-Mann and Low theorem

Using adiabatic coupling of the interaction V^(t)V^η(t)=eη|t|V^, Gell-Mann and Low proved that the vectors

|ΩνΩν|Ψν=limη0S^η(0,)|ΨνΩν|Ψν

are the eigenstates of the interacting Hamiltonian.[5]

We follow the common literature and suppress the infinitesimal η in the following bearing in mind that the limit η0 is performed at the very end of the calculation.[7][8]

Diagrammatic perturbation theory

A consequence of the Gell-Mann and Low theorem, is the following form of the interacting groundstate energy[8]

Etot=E0=Ω0|H^|Ω0=Ψ0|S^(,)H^|Ψ0Ψ0|S^(,)|Ψ0,

which can be seen as starting point of diagrammatic perturbation theory. The expression above is used to derive all possible approximations by expanding the time-evolution operator S^ into a series. The resulting matrix-elements of creation and annihilation operators are evaluated term by term using the canonical anti-commutator relations defined above (Wick's theorem[9]). It follows that all terms in perturbation theory are expressed by only two quantities, the non-interacting Feynman propagator

G0(1,2)=in𝐤ϕ(𝐫2)ϕ*(𝐫1)ei(ϵn𝐤ϵF)(t2t1)[fn𝐤Θ(t2t1)(1fn𝐤)Θ(t1t2)],1=(𝐫1,t1),2=(𝐫2,t2)

and the Coulomb interaction

V(1,2)=δ(t1t2)|𝐫1𝐫2|.

Then, each term in the series corresponds to an integral over space-time coordinates (𝐫,t).

Feynman diagrams are used to illustrate which terms are considered in the perturbation series. The illustration is usually achieved with so-called Feynman rules that map a specific diagram to an integral (and vice versa). For instance the second order diagram

is also known as the direct Møller-Plessett term and stands for following integral

EdMP(2)=d(1,2,3,4)G0(1,2)G0(2,1)V(1,3)V(2,4)G0(3,4)G0(4,3),d(1,,4)=d𝐫1dt1d𝐫4dt4

All Feynman rules can be found in the book of Negele and Orland or elsewhere.[7][8]

The random-phase approximation

The RPA is obtained from neglecting all second and higher order terms in the perturbation series of the groundstate energy, except of those which can be expressed soley in terms of the independent particle polarizability

χ0(1,2)=iG0(1,2)G0(2,1)

corresponding to the "bubble" diagram

Because of the symmetric time property χ0(t2t1)=χ0(t1t2), the independent particle polarizability is of bosonic character. Because the RPA neglects all non-bosonic terms in the perturbation series, it corresponds essentially to a "bosonization" of the many-body problem for which the n-th order term can be written analytically as[7]

EdMP(n)=12ndω2πTr[χ~0(ω)V]n.

Here, the trace of the matrix product is most effectively done in reciprocal space [χ~0(ω)V](𝐪+𝐆1,𝐪+𝐆2)=𝐆χ~0(𝐪+𝐆1,𝐆,ω)V(𝐪+𝐆,𝐪+𝐆2) using the Fourier transformed polarizability χ~0(𝐪+𝐆1,𝐪+𝐆2,ω), the diagonal Coulomb potential V(𝐪+𝐆1,𝐪+𝐆2)=δ𝐆1𝐆2|𝐪+𝐆1| and the conserved crystal momentum 𝐪 in the first Brillouin zone.

All bubble terms of order n2 can be written in a closed form using the series for the logarithm ln(1x)+x=n=2xnn and define the correlation part of the RPA energy

EcRPA=dω2πTr{ln[1χ~0(ω)V]+χ~0(ω)V}.

There are two first order contributions to the total energy that yield the exact exchange energy Ex=T+Vext+Vh+Vx, which is usually determined separately.

Computational Complexity

The calculation of the RPA integral requires the determination of the independent particle polarizability matrix χ~𝐆𝐆0(𝐪,ωn)=χ~0(𝐪+𝐆,𝐪+𝐆,ωn) on each of the N𝐪 sampling points of the first Brillouin zone for Nω frequency points. The number of frequency points is reduced drastically, by performing the integration over the imaginary frequency axis ωiω.[10]

The independent particle polarizability on the imaginary axis can be determined with two alternative methods.

Quartic scaling RPA: Direct calculation

Direct calculation of χ~0 using the formula of Adler and Wiser[11] [12]

χ~𝐆𝐆0(𝐪,iω)=𝐤BZn,nfn𝐤(1fn𝐤𝐪)ϵn𝐤𝐪ϵn𝐤iωϕn𝐤𝐪|ei𝐆𝐫|ϕn𝐤ϕn𝐤|ei𝐆𝐫|ϕn𝐤𝐪

yields an RPA algorithm that has a computational cost of NωN𝐤2N𝐆4. Because the number of plane waves N𝐆 scales linearly with the system size (number of electrons in the unit cell), the direct calculation of the polarizability is unfavourable for large system sizes, e.g. for more than ~20 atoms in the unit cell.

Cubic scaling RPA: Contraction of imaginary time Green's functions

An alternative way to determine χ~0 is to frist determine imaginary time Green's functions of the form[13]

G0(𝐫,𝐫,iτ)=𝐤BZnϕn𝐤(𝐫)ϕn𝐤*(𝐫)e(ϵn𝐤ϵF)τ[Θ(τ)fn𝐤Θ(τ)(1fn𝐤)]

and to perform afterwards a Fourier transformation into reciprocal and imaginary frequency space of

χ0(𝐫,𝐫,iτ)=G0(𝐫,𝐫,iτ)G0(𝐫,𝐫,iτ).

Although more evolved, this approach has the advantage that the computational cost for the determination of χ~0 scales with NωN𝐤N𝐆3 and is essentially only cubic in system size. The space-time method allows to study relatively large systems with the RPA.[14]

Basis set convergence of RPA-ACFDT calculations

The expression for the ACFDT-RPA correlation energy written in terms of reciprocal lattice vectors reads:

EcRPA=0dω2π𝐪𝐁𝐙𝐆{(ln[1χ~0(𝐪,iω)V(𝐪)])𝐆,𝐆+V𝐆,𝐆(𝐪)χ~0(𝐪,iω)}.

The sum over reciprocal lattice vectors has to be truncated at some 𝐆max, determined by 2|𝐆+𝐪|22me < ENCUTGW, which can be set in the INCAR file. The default value is 23× ENCUT, which experience has taught us not to change. For systematic convergence tests, instead increase ENCUT and repeat steps 1 to 4, but be aware that the "maximum number of plane-waves" changes when ENCUT is increased. Note that it is virtually impossible, to converge absolute correlation energies. Rather concentrate on relative energies (e.g. energy differences between two solids, or between a solid and the constituent atoms).

Since correlation energies converge very slowly with respect to 𝐆max, VASP automatically extrapolates to the infinite basis set limit using a linear regression to the equation: [1][15][16]

Ec(𝐆)=Ec()+A𝐆3.

Furthermore, the Coulomb kernel is smoothly truncated between ENCUTGWSOFT and ENCUTGW using a simple cosine like window function (Hann window function). Alternatively, the basis set extrapolation can be performed by setting LSCK=.TRUE., using the squeezed Coulomb kernel method.[17]

The default for ENCUTGWSOFT is 0.8×ENCUTGW (again we do not recommend to change this default).

The integral over ω is evaluated by means of a highly accurate minimax integration.[10] The number of ω points is determined by the flag NOMEGA, whereas the energy range of transitions is determined by the band gap and the energy difference between the lowest occupied and highest unoccupied one-electron orbital. VASP determines these values automatically (from vasp.5.4.1 on), and the user should only carefully converge with respect to the number of frequency points NOMEGA. A good choice is usually NOMEGA=12, however, for large gap systems one might obtain μeV convergence per atom already using 8 points, whereas for metals up to NOMEGA=24 frequency points are sometimes necessary, in particular, for large unit cells.

Strictly adhere to the steps outlines above. Specifically, be aware that steps two and three require the WAVECAR file generated in step one, whereas step four requires the WAVECAR and WAVEDER file generated in step three (generated by setting LOPTICS=.TRUE.).


Matsubara Formalism: Metallic systems at finite Temperature

The zero-temperature formalism of many-body perturbation theory breaks down for metals (systems with zero energy band-gap) as pointed out by Kohn and Luttinger.[18] This conundrum is lifted by considering diagrammatic perturbation theory at finite temperature T>0, which may be understood by an analytical continuation of the real-time t to the imaginary time axis iτ. Matsubara has shown that this Wick rotation in time tiτ reveals an intriguing connection to the inverse temperature β=1/T of the system.[19] More precisely, Matsubara has shown that all terms in perturbation theory at finite temperature can be expressed as integrals of imaginary time quantities (such as the polarizability χ(iτ)) over the fundamental interval βτβ.

As a consequence, one decomposes imaginary time quantities into a Fourier series with period β that determines the spacing of the Fourier modes. For instance the imaginary polarizability can be written as

χ(iτ)=1βm=χ~(iνm)eiνmτ,νm=2mβπ

and the corresponding random-phase approximation of the correlation energy at finite temperature becomes a series over (in this case, bosonic) Matsubara frequencies

ΩcRPA=121βm=Tr{ln[1χ~(iνm)V]χ~(iνm)V},νm=2mβπ

The Matsubara formalism has the advantage that all contributions to the Green's function and the polarizability are mathematically well-defined, including contributions from states close to the chemical potential ϵn𝐤μ, such that Matsubara series also converge for metallic systems.

Although formally convenient, the Matsubara series converges poorly with the number of considered terms in practice. VASP, therefore, uses a compressed representation of the Fourier modes by employing the Minimax-Isometry method.[20] This approach converges exponentially with the number of considered frequency points.

Related tags and articles

References