ML RCUT2: Difference between revisions

From VASP Wiki
(Hints on how to vary ML_RCUT1)
mNo edit summary
Line 12: Line 12:


and <math>g\left(\mathbf{r}\right)</math> is an approximation of the delta function. A basis set expansion of <math>\rho^{(3)}_i(r)</math> yields the expansion coefficients <math>p_{n\nu l}^{i}</math> which are used in practice to describe the atomic environment (see [[Machine learning force field: Theory#Descriptors|this section]] for details). The tag {{TAG|ML_RCUT2}} sets the cutoff radius <math>R_\text{cut}</math> at which the cutoff function <math>f_{\mathrm{cut}}\left(r_{ij}\right)</math> decays to zero.
and <math>g\left(\mathbf{r}\right)</math> is an approximation of the delta function. A basis set expansion of <math>\rho^{(3)}_i(r)</math> yields the expansion coefficients <math>p_{n\nu l}^{i}</math> which are used in practice to describe the atomic environment (see [[Machine learning force field: Theory#Descriptors|this section]] for details). The tag {{TAG|ML_RCUT2}} sets the cutoff radius <math>R_\text{cut}</math> at which the cutoff function <math>f_{\mathrm{cut}}\left(r_{ij}\right)</math> decays to zero.
{{NB|mind|The cutoff radius determines how many neighbor atoms <math>N_\mathrm{a}</math> are taken into account to describe each central atom's environment. Hence, important features may be missed if the cutoff radius is set to a too small value. On the other hand, a large cutoff radius increases the computational cost of the descriptor as the cutoff sphere contains more neighbor atoms. A good compromise is always system-dependent, therefore different values should be tested to achieve satisfying accuracy '''and''' speed.}}
{{NB|mind|The cutoff radius determines how many neighbor atoms <math>N_\mathrm{a}</math> are taken into account to describe each central atom's environment. Hence, important features may be missed if the cutoff radius is set to a too small value. On the other hand, a large cutoff radius increases the computational cost as the cutoff sphere contains more neighbor atoms. A larger cutoff can also significantly degrade the learning efficiency. A good compromise is system-dependent, therefore different values should be tested to achieve satisfying accuracy '''and''' speed.}}


For materials (i.e. organic molecules, polymers) containing only H, B, C, O, N, and F, the learning efficiency might increase when ML_RCUT2 is set to values around 4  <math>\AA</math> (that is, less training data are required to achieve a specific accuracy), wheres for materials with very long bonds, a larger value around 6  <math>\AA</math> might improve the accuracy,
For materials (i.e. organic molecules, polymers) containing only H, B, C, O, N, and F, the learning efficiency might increase when ML_RCUT2 is set to values around 4  <math>\AA</math> (that is, less training data are required to achieve a specific accuracy), wheres for materials with very long bonds, a larger value around 6  <math>\AA</math> might improve the accuracy,

Revision as of 12:06, 9 May 2023

ML_RCUT2 = [real]
Default: ML_RCUT2 = 5.0 

Description: This flag sets the cutoff radius for the angular descriptor in the machine learning force field method. The unit of the cut-off radius is .


The angular descriptor is constructed from

and is an approximation of the delta function. A basis set expansion of yields the expansion coefficients which are used in practice to describe the atomic environment (see this section for details). The tag ML_RCUT2 sets the cutoff radius at which the cutoff function decays to zero.

Mind: The cutoff radius determines how many neighbor atoms are taken into account to describe each central atom's environment. Hence, important features may be missed if the cutoff radius is set to a too small value. On the other hand, a large cutoff radius increases the computational cost as the cutoff sphere contains more neighbor atoms. A larger cutoff can also significantly degrade the learning efficiency. A good compromise is system-dependent, therefore different values should be tested to achieve satisfying accuracy and speed.

For materials (i.e. organic molecules, polymers) containing only H, B, C, O, N, and F, the learning efficiency might increase when ML_RCUT2 is set to values around 4 (that is, less training data are required to achieve a specific accuracy), wheres for materials with very long bonds, a larger value around 6 might improve the accuracy,

Related tags and articles

ML_LMLFF, ML_RCUT1, ML_W1, ML_SION1, ML_SION2, ML_MRB1, ML_MRB2


Examples that use this tag