Lecture 5 Modeling & Basis Sets

Lecture 5 Modeling & Basis Sets
Atomic Basis Set 武晓君江俊

Before calculation Isolated system .vs. Bulk Material

Surface and interface

Surface adsorption

Hetero-interface

Solution

Nanotube、Graphene

Basis Sets Gaussian Basis Set Numeric Basis Set Plane-Wave Basis Set
a set of mathematical functions (basis functions), linear combination of which yields molecular orbitals Gaussian Basis Set Numeric Basis Set Plane-Wave Basis Set Mixed Basis Set

What and Why? Xμis basis set function, aμ is coefficient
why expand molecular wave function with basis sets? Computer needs it. Numerical Calculation on: Grid  Huge computational cost Single electron wavefunction is rewritten as the linear combination of limited analysis functions 基组是量子化学专用语。量子化学中的基组是在量子化学中用于描述体系波函数的若干具有一定性质的函数，基组是量子化学从头计算的基础，在量子化学中有着非常重要的意义。基组的概念最早脱胎于原子轨道，随着量子化学的发展，现在量子化学中基组的概念已经大大扩展，不局限于原子轨道的原始概念了。在量子化学计算中，根据体系的不同，需要选择不同的基组，构成基组的函数越多，基组便越大，对计算的限制就越小，计算的精度也越高，同时计算量也会随基组的增大而剧增 Xμis basis set function, aμ is coefficient

LCAO Approximation (Linear Combination of Atomic Oribtal)
Numerical solution for Hartree-Fock orbital is applicable only for Single atom or Two-atom molecule Orbitals of Two-atom molecule are actually the linear combination of atomic orbitals Example: The sigma orbital of H2   1sA + 1sB For multi-atom system, orbital has to be described with LCAO 数值解只能计算原子和简单双原子的Hartree-Fock轨道, 而且分子轨道, 比如sigma, pi轨道等等可以表示成原子轨道的线性组合

Basis Sets with: {μ} – a set of known functions
For UHF wave-functions two sets of coefficients are needed: if μ  AO  LCAO-MO if μ  AO  LCBF-MO

 is called basis function
Basis Sets  is called basis function  is Normally defined according to atomic position (centralized on atom) More generalized, flexible mathematical form than atomic orbital is Better , More , often resulted in more accurate molecular orbital 这些原子轨道被称为基函数, 它们的中心通常选在分子中的各个原子上, 当基函数足够多足够好时, 我们计算得到的Hartree-Fock分子轨道就相当准确了, 也就是说LCAO近似基本上就被弥补上了.

Fitting a function with polynomials
0.2674 c2 c0 0.408 c2 c4 0.0001 c0 0.755 c2 c4 0.0021 c6

The generalized form of Basis function
Radial part Angular part 我们首先来看基函数的一般形式, 因为我们选择的基函数是原子轨道, 从原子轨道的性质我们知道, 它是球对称的, 相对于其所属原子是球对称的, 比如A原子的原子轨道相对于A原子是球对称的, 而相对于B原子则不是球对称的. 但是只差一个与坐标差有关的平移而已. 现在我们写出原子轨道的基本形式, 这里, psi, r, theta, phi等于径向部分R乘以角度theta部分和角度phi部分, 也可以分成径向部分R和角度部分Ylm, 也就是说, 所有的原子轨道都可以分解成两个部分的乘积, 即径向部分和角度部分.

Angular part: xl ym zn l+m+n=0 1 direction l+m+n=1 3 direction l+m+n=2
因为原子轨道是球对称的, 所以角度部分是恒定的, 就是我们常常所说的s轨道, p轨道, d轨道, f轨道等等, 其形状也是我们每个人所熟悉的, 在其数学表达式中, 我们可以看到, s轨道的角度部分是一个常数, 也就是是各向同性的, 也可以说s轨道只与r的函数有关,它是xlymzn的函数, 其中l+m+n=0, 只有一个角向分量 p轨道是沿着x, y, 或z中的某个方向延展的, 在数学表达式中, 它只与x/r, y/r或z/r有关, 也就是说, 它是xlymzn的函数, 其中l+m+n=1, 有三个角向分量 d轨道是沿着x的二次方, y的二次方, z的二次方, 或xy, yz, zx等方向延展的, 也就是说, 它是xlymzn这个函数中, l+m+n=2, 有六个可能的角向分量, 这就是我们在Gaussian03程序中经常见到的所谓6D, 而我们的 l+m+n=2 6 direction

5D/6D & 7F/10F xl ym zn f orbital : l+m+n=3, 100 direction 5D/6D:
x2, y2, z2 →(x2+y2+z2)→ s orbital xy, yz, zx f orbital : l+m+n=3, 100 direction p orbital 7F/10F: x3, xy2, xz2 →x(x2+y2+z2) y3, y2z, y2x →y(x2+y2+z2) z3, z2x, z2y →z(x2+y2+z2) xyz 原子轨道只有5D, 也就是只有5个角向分量, 那是为什么呢, 这是因为, 从6D的6个角向分量中可以线性组合出一个分量: 其角度部分是(x2+y2+z2)=r2, 这个角向也是自己就是球对称的, 与我们已经存在的s轨道的角度部分是完全相同的, 所以, 独立的d轨道分量只有5个, 这就是5D, 选取5D还是6D一般影响不是太大, 只是积分的时候多一个线性关系而已. f轨道是xlymzn这个函数中l+m+n=3, 有10个可能的角向分量, 也就是Gaussian03中的10F, 而我们知道, 原子轨道只有7个f分量, 这也是因为三个线性组合就可以得到一个p分量, 而这些p分量在我们前面已经存在了, 所以剩下的独立分量就只剩下7个了, 这就是所谓的7f. g轨道是l+m+n=4, 等等从整体来看, 从s轨道到p轨道, 到d轨道, 到f轨道, 到g轨道得到, 按照这个顺序, 轨道的各向异性越来越大, 而我们的原子成键形成分子的过程, 也是一个从各向同性到各向异性的过程, 这种各向异性的变化规律是我们后面添加极化函数的基础. 只要是原子基组, 角度部分都是一样的,不同的仅仅是径向部分R, 比如Slater型函数, Gaussian型函数, 数值基组, 下面我们讲述径向部分.

1. Slater Type Orbitals (STO)
- similar to atomic orbitals of the hydrogen atom more convenient (from the numerical calculation point of view) than AO, especially when n-l≥2 (radial part is simply r2, r3, ... and not a polinom) STO – are labeled like hydrogen atomic orbitals and their normalized form is: LCAO近似中另外一个重要因素是径向部分, 我们知道, 原子在形成分子的过程中, 虽然由于得到或失去部分电子, 或者形成共价键, 其轨道要膨胀或收缩, 以及变形, 但是基本形状还是与原子轨道类似的, 其径向部分也与原子轨道的径向部分类似, 那么直接的做法就是使用类氢原子轨道去作为基函数拟合分子轨道, 对于类氢原子轨道而言, 其1s, 2p, 3d, 4f轨道是单项式, 而其它轨道, 2s, 3s, 等等, 3p, 4p 等等, 4d, 5d等等因为必须与1s, 2p, 3d, 4f等轨道正交而它们的指前表达式都是多项式, 在计算Fock矩阵时, 多项式乘以多项式会产生出很多的线性组合项, 使用起来很不方便, 因此, Slater引入了Slater型轨道, 它只保留了1s, 2p, 3d, 4f等的径向函数, 把它们线性组合就可以得到类氢原子轨道, 也就是Slater型函数可以和类氢原子轨道通过一组公式互相变换. 所以, Slater型轨道是直观上最佳的基函数. Slater型函数的具体形式是这样的, 它们有几个重要的特性: 对于氢原子或类氢离子, 它们是精确的轨道, 在对原子计算中, 它们是非常好的基函数, 它们还有正确的渐近性质, 在r趋于完全大时, 函数趋于零的速度与真实情况是符合的它们在原子核处满足尖点性质, 即在r趋于零时形成尖点它唯一的缺点就是三中心和四中心双电子积分没有可以使用的公式, 必须用数值方法才能进行计算, 比如ADF程序使用的就是Slater型基组, 其多中心积分都是采用数值积分方法, 不过它采用的数值积分方法经过优化, 速度比较快, 另外, 因为使用的是Slater型轨道, 只需要确定Slater指数就可以得到一个基函数, 所以对重金属元素很容易得到可用的基组, 因此在重金属计算中常常被使用, ADF程序自己声称可以应用于所有118个元素. 既然多中心积分需要用数值积分, 而中心多中心积分有是最耗时间的, 不如从一开始就使用数值积分, 使用数值基组, 这就是Dmol程序所采用的, Dmol程序的计算效率也很高. STO provide reasonable representations of atomic orbitals however, they are not well suited to numerical (fast) calculations of especially two-electron integrals their use in practical molecular orbital calculations has been limited

STO Advantages: Physically, the exponential dependence on distance from the nucleus is very close to the exact hydrogenic orbitals. Ensures fairly rapid convergence with increasing number of functions. Use only one zeta parameter to fit the orbital Disadvantages: Three and four center integrals cannot be performed analytically. No radial nodes. These can be introduced by making linear combinations of STOs. Practical Use: Calculations of very high accuracy, atomic and diatomic systems. Semi-empirical methods where 3- and 4-center integrals are neglected.

STO与类氢离子的径向函数 1s, 2p, 3d, 4f的指前因子为单项式, 它们之间相互正交
为了保证其它的s轨道(2s, 3s,...)与1s正交, 其它的s 轨道的指前因子都是多项式, 3p, 4p, ..., 4d, 5d, ..., 5f, 6f, ... 也是如此多项式在积分时*d会分解成N2项, 计算比较麻烦, 于是Slater引入了Slater型原子轨道, 它相当于把类氢离子径向波函数正交化限制去掉, 而要求其指前因子为单项式!!! (为分子计算提供条件)

Radial basis function of Hydrogen-like ion

2. Gaussian Type Basis Sets (GTO)
Add different factors to Gaussian function, to describe orbital of s, p, d, …. 斯莱特型基组就是原子轨道基组，基组由体系中各个原子中的原子轨道波函数组成。斯莱特型基组是最原始的基组，函数形式有明确的物理意义，但是这一类型的函数，数学性质并不好，在计算多中心双电子积分时，计算量很大，因而随着量子化学理论的发展，斯莱特型基组很快就被淘汰了. 高斯型函数在计算中有较好的性质，可以将三中心和四中心的双电子积分轻易转化为二中心的双电子积分，因而可以在相当程度上简化计算

Production of two G function is a G’function
GTO advantages： Production of two G function is a G’function Index is r2 function，can be decomposed into x2+y2+z2，3-D integration is simplified into three 1-D integration. Analytic form for all two-e integral GTO disadvantages Compared with slater function, decay of G function is faster. At nuclear point, there is no a singular point of curve (CUSP) GTO is not a real orbital, No radial nodes 但是高斯型函数与斯莱特型函数在r=0 处的行为差异较大，直接使用高斯型函数构成基组会使得量子化学计算的精度下降。

GTO multiple-center integration
GTO multiplication rule: Product of two center functions  one center function 我们看到, Slater型轨道的缺陷是Hartree-Fock方法中最经常出现的多中心积分无法用公式表达出来, 那么什么函数才能胜任呢? 1950年, Boys提出用Gaussian型函数作为基函数来展开分子轨道, 对于Gaussian函数, 存在所谓的乘积定理, 它可以把双中心函数的乘积变成单中心的乘积, 从而把多中心乘积积分通过多次使用双中心化单中心的方法就可以积分出来. 比如, 对于两个中心在A和B的Gaussian函数, 可以把它们变成中心在P处的Gaussian函数, P到A和B的距离有Gaussian函数的指数的系数确定, 其指前因子是有指数系数和AB间的距离确定. 这样就把双中心积分变成单中心积分了. 通过在Gaussian函数前面加上不同的因子, 可以得到对应于s, px, py, pz, dxy, dyz, dzx等等的Gaussian函数, 相对于Slater型函数, Gaussian函数还有一个优点, Slater型轨道的指数上是r的函数, 而r等于(x2+y2+z2)1/2, 这样就把x, y和z三个方向耦合起来了; 而Gaussian函数的指数是r2的函数, 而可以分解成x2+y2+z2, 这样就可以把三维积分变成独立的三个一维积分, 可以大大简化计算; 这些都是Gaussian函数的优点!

GTO Basis Sets Define a set of functions with specific exponent indexes and coefficients for certain atoms Coefficient Exponent index Type of Basis set Minimum type basis Double-zeta, Triple-zeta basis split-valence multi-ζ basis Polarization function diffuse function 有了基函数这些基本组件, 我们就可以去构造基组了, 对于使用Slater型轨道或数值基函数的程序, 基组是对所有原子给出的一组指数, 而对于Gaussian型函数, 基组是对一些原子给出的一组指数和收缩系数, 对于使用Gaussian函数的程序, 现在已经发展出很多系列的基组. 基组的类型主要有以下几种, 最小基组, 双zeta和三zeta基组等等, 价层分裂基组, 极化函数, 弥散函数.

Count only real STO in neutral atom
2. 1 Minimum type basis: contract Gaussian basis set（STO-nG）： Linear combination of GTO to mimic one STO primitive basis function Count only real STO in neutral atom Example: Carbon atom: 1s, 2s, 2px, 2py, 2pz STO-3G Combine 3 GTO for one STO Acquire exponent indexes from simulations of atoms with adjustment base on some typical molecules Also called as Single-zeta basis sets (zeta= , exponent index of STO) 为了弥补Gaussian函数的缺陷, 我们可以用几个指数不同的Gaussian型函数拟合一个Slater型轨道, 这样其在r大的情况下的衰减特征基本上能完全弥补上, 同时在原子核处尖点的性质可以大部分弥补上, 一般而言, 在用3个Gaussian函数拟合Slater型轨道时基本上就很好了, 这样的函数用STO-3G, 大家可以看到, 这时拟合效果的确是不错的, 首先, 对于原子和分子计算, 基函数最小的基组就是最小基组, 在这个基组中, 只有中性原子的轨道壳层(nl)才给定基函数, 比如对于C原子, 有1s, 2s, 2p 轨道, 其中2p包括了(2px, 2py, 2pz), 也就是每个原子的基组中的基函数线性组合后得到的是球对称的函数, 对于Fe, 有1s, 2s, 2p, 3s, 3p, 4s, 3d轨道, 3d有5个基函数. 这样的最小基组有一个系列, STO-nG, n=1, 2, 3, 4, 5, 6. 也就是由1, 2, 3, 4, 5, 或6个Gaussian函数去拟合所需要的Slater型轨道, 其中最常用的是STO-3G, 其中的指数是从原子得到, 然后用一些典型的分子去调整, 得到一组对分子计算作优化的指数. 这样的最小基组也称为单zeta基组, zeta, 也就是希腊字母的zeta, 是Slater型轨道的指数, 单zeta就是对每个轨道壳层只给出一个Slater型轨道. 最小基组, 从字面上我们也知道, 不存在比它还小的基组, 如果比它还小, 基函数的数目就小于体系电子的数目了, 就根本无法描写所有的电子了.

n: the number of GTO for one STO
n GTO， STO-nG (n = 2 ~ 6) n: the number of GTO for one STO STO-3G: not flexible enough to describe valence orbital Not accurate for core electrons ( with strong singular point effect) STO-6G: is necessary for core electrons. STO-8G: is necessary for Ga,Ge, As, Se, Br, Kr 我们从化学上知道, 价层轨道能量比较高, 其电子分布相对弥散, 因此尖点特征不很明显, 因此用STO-#G型的函数描写就可以了, 但是内层轨道的能量很低, 电子主要集中在距离原子核附近, 因此尖点特征非常重要, 仅仅用STO-3G就不行了, 精度比较低, 一般要用6个Gaussian函数去拟合一个Slater型轨道, 也就是STO-6G, 其实, 对于Ga,Ge, As, Se, Br, Kr, 这些第四周期的p族元素, 内层轨道要用STO-8G. 现在, 我们就可以用Gaussian型函数构造了进行分子计算的基函数了. 其中的每个Gaussian函数称为素基函数.

Double each minimal basis to be two
2. 2 Double-zeta (DZ), Triple-zeta (TZ) basis sets Electron distribution in molecules can differ significantly from the one in the atoms and the chemical bond may introduce directionalities which can not be described by a minimal basis. Double each minimal basis to be two One set with big index (close to atomic core) & One set with small index (away from atomic core) Radial expansion and contraction Atomic charge density will contract when a atom donate electrons during molecule formation Atomic charge density will expand when a atom attain electrons during molecule formation employs two s-functions for hydrogen (1s and 1s'), four s- functions (1s, 1s', 2s and 2s') and two p-functions (2p and 2p') for first row elements, and six s-functions and four p- functions for second row elements 最小基组当然是LCAO近似中最粗略的近似了, 在很多情况下是无法满足我们的要求的, 甚至其定性的结果都是错误的. 这是因为在原子形成分子的过程中, 其电子分布要重新分布, 因此对应的轨道就不仅仅是原子情形下的轨道了, 并且在轨道得到电子的时候要膨胀, 失去电子的时候要收缩, 我们的基组怎样才能适应这个要求呢? 第一步的改进就是使用双Zeta基组, 就是每个轨道壳层给出两个Slater型轨道, 其中一个比较紧, 也就是距离原子核比较近, 指数zeta比较大, 另外一组比较松, 也就是距离原子核比较远, 指数zeta比较小, 这样, 在描写电子云变化方面, 在径向上有一定的伸缩性, 这样, 用两组轨道就可以去拟合我们的分子中轨道的情况. 因为原子形成分子时电子的轨道变化是无法预知的, 所以双Zeta基组是得到分子的可信波函数的必要条件! 当然, 三zeta和更多zeta的基组可以进一步提高分子波函数的精度, 它们在精确计算中是必须的.

DZ enable charge density expansion and contraction along radial direction
C1=1, C2=0 + C1<1, C2<1 Zeta Zeta2 C C2 C1=0, C2=1 Away from core Close to core

2.3 split-valence multi-ζ basis sets
(Less basis functions, lower computation cost and time consuming) Use multi STO to simulate one Atomic Orbital Inner orbital: one STO (varies slightly during the formation from atoms to molecule) Valence orbital: multi STO 3-21G Inner (1s ) : 3 GTO -> 1 STO -> AO Valence (2s, 2p): 2GTO -> 1STO(close) 1GTO -> 1STO(far) 1 STO(close) + 1 STO(far) -> AO 6-311G Inner (1s ) : 6GTO -> 1STO -> AO Valence (2s, 2p): 3GTO -> 1STO(close) 1GTO -> 1STO(far1) 1GTO -> 1STO(far2) 1 STO(close) + 1 STO(far1) + 1 STO(far2) -> AO 3-21G basis contains the same number of primitive GTOs as the STO-3G, however, it is much more exible as there are twice as many valence functions which can combine freely to make MOs. 在双zeta基组的基础上, 我们希望能尽可能减少基组中基函数的数目, 因为基函数越少, 我们的计算成本越低, 计算就越快. 从化学上来看, 在原子形成分子的过程中, 只有价层轨道参与成键, 其轨道变化比较大, 而内层轨道在原子形成分子中几乎不变, 那么我们可以在最小基组的基础上, 只对价层轨道进行加倍, 而内层使用最小基组, 这样就融合和最小基组和双zeta基组的优点, 而有没有损失什么化学精度. 这种基组是现在计算中使用的主流基组, 主要有3-21G基组, 它的内层用一个Slater型函数, 这个STO是用3个Gaussian型函数拟合的, 其价层的近核的STO是用两个Gaussian函数拟合, 远离核的只使用了一个Gaussian函数. 这是最小的可信基组了. 还有一个是6-31G基组, 内层是一个Slater型轨道, 用6个Gaussian函数拟合, 价层的近核Slater型轨道用3个Gaussian函数拟合, 远离核的用1个gaussian函数. 6-31G基组是最常用的基组.

2.4 Polarization function： Formation from atom to molecule:
changes in both radial and angular part Anisotropic structure, NH3 H2O，dxy orbital increase the overlap effect p function to Hydrogen atom, d function to Carbon atom Without polarization function, NH3 got planar structure 前面我们讲了最小基组, 双zeta基组, 价层分裂基组, 它们的改进都是相对于径向的, 因为原子在形成分子过程中轨道在径向要膨胀或收缩, 但是, 原子在形成分子中还有另外一个重要的变化, 那就是从球形对称型变成非球形对称的, 也就是在角度方面变成了各向异性的了, 为了适应这种变化, 把这种因素考虑进去, 我们就必须使用极化函数, 相对于价层轨道的极化函数! 那么, 什么样的函数是价层轨道的极化函数呢? 从前面对原子轨道的角度部分的分析我们可以看到, s有一个分量, 是球对称的, p有三个分量, 是沿着三个方向延伸的, d轨道有5个分量是沿着9个方向延伸的, f轨道是沿着27个方向延伸的, 可以看到, 是s到p到d到f轨道, 它们的各向异性越来越大, 被极化的程度也越来越大, 因此, 可以把后一个轨道作为前一个轨道的极化函数. 因为极化函数是针对价层而言的, 所以我们一般只需要加一个极化轨道. 比如对氢原子, 要加上p函数, 对碳原子要加上d函数, 对第一过渡金属元素要加上f函数等等. 如果不加极化函数, 只改进径向函数, 使用多zeta基组, Hartree-Fock计算得到的氨分子结构是平面的, 而没有极化函数时, 环丙烷的张力太大. 所以, 在分裂价层基组或双zeta基组的基础上添加极化函数是非常重要的, 相当必要的. 比如, 最常用的有6-31G(d)基组, 它是在6-31G基组的基础上对非氢原子添加d函数, 还有6-31G(d,p)基组, 它是在6-31G基组的基础上, 对非氢原子添加d函数,对氢原子添加p函数. DZP, 带极化是双zeta基组. 现在, 随着计算机运算速度的提高, 分子计算中最小的基组已经变成6-31G(d,p)基组了, 只是在大分子体系的外围我们才采用3-21G或sto-3G等基组.

Effect of Polarization Functions
对H2O，dxy轨道可以显著增加重叠。所谓极化基组就是在劈裂价键基组的基础上添加更高能级原子轨道所对应的基函数，如在第一周期的氢原子上添加p轨道波函数，在第二周期的碳原子上添加d轨道波函数，在过渡金属原子上添加f轨道波函数等等。这些新引入的基函数虽然经过计算没有电子分布，但是实际上会对内层电子构成影响，因而考虑了极化基函数的极化基组能够比劈裂价键基组更好地描述体系。

Polarization makes valence atomic orbitals more flexible in angular part, and thereby help to form bondings with other atom C-O bonding: Polarization orbital : d orbital p of C is polarized toward O p of O is polarized toward C Increase  interactions between O & C

6-31G(d,p)（6-31G**) with d orbital on non hydrogen atom，p on hydrogen
example 6-31G(d)（6-31G*） add d orbital function to 6-31G basis sets, d orbital on non hydrogen atoms 6-31G(d,p)（6-31G**) with d orbital on non hydrogen atom，p on hydrogen DZP Add polarization to DZ basis sets. 6-311G(3df,3pd) with 3 d and 1f on non H, 3 p and 1 d on H Due to the fast development of super-cluster, 6-31G(d,p) has become the smallest basis sets in molecule simulations 3-21G or sto-3G are applied only for large molecules 前面我们讲了最小基组, 双zeta基组, 价层分裂基组, 它们的改进都是相对于径向的, 因为原子在形成分子过程中轨道在径向要膨胀或收缩, 但是, 原子在形成分子中还有另外一个重要的变化, 那就是从球形对称型变成非球形对称的, 也就是在角度方面变成了各向异性的了, 为了适应这种变化, 把这种因素考虑进去, 我们就必须使用极化函数, 相对于价层轨道的极化函数! 那么, 什么样的函数是价层轨道的极化函数呢? 从前面对原子轨道的角度部分的分析我们可以看到, s有一个分量, 是球对称的, p有三个分量, 是沿着三个方向延伸的, d轨道有5个分量是沿着9个方向延伸的, f轨道是沿着27个方向延伸的, 可以看到, 是s到p到d到f轨道, 它们的各向异性越来越大, 被极化的程度也越来越大, 因此, 可以把后一个轨道作为前一个轨道的极化函数. 因为极化函数是针对价层而言的, 所以我们一般只需要加一个极化轨道. 比如对氢原子, 要加上p函数, 对碳原子要加上d函数, 对第一过渡金属元素要加上f函数等等. 如果不加极化函数, 只改进径向函数, 使用多zeta基组, Hartree-Fock计算得到的氨分子结构是平面的, 而没有极化函数时, 环丙烷的张力太大. 所以, 在分裂价层基组或双zeta基组的基础上添加极化函数是非常重要的, 相当必要的. 比如, 最常用的有6-31G(d)基组, 它是在6-31G基组的基础上对非氢原子添加d函数, 还有6-31G(d,p)基组, 它是在6-31G基组的基础上, 对非氢原子添加d函数,对氢原子添加p函数. DZP, 带极化是双zeta基组. 现在, 随着计算机运算速度的提高, 分子计算中最小的基组已经变成6-31G(d,p)基组了, 只是在大分子体系的外围我们才采用3-21G或sto-3G等基组.

2.5 Diffuse function Diffusive s，p orbital
Suitable for systems with diffusion electrons ions，with long paired electrons Negative charged systems Excited system System with low ionization energy Acid system Atom with high electronegativity (F, Cl), In dipole calculations …. 对于我们常见的稳定化学体系, 6-31G(d,p)就可以得到不错的结果了, 但是化学体系是多种多样, 我们在理论计算研究中, 不仅对稳定的化学体系感兴趣, 还对其它体系感兴趣, 比如阴离子, 弱作用, 包括氢键和范德华作用, 电子亲和能, 电子的激发能, 在这些体系中原子轨道有可能很大的膨胀, 或者远距离相互作用是十分重要的, 这时候, 仅仅按照常规思路去增加原子轨道的个数, 也就是采用多zeta基组, 改善效果并不明显, 这时候我们就需要对基组添加弥散函数, 它们的指数一般在 , 对p轨道在, 而通常的zeta在0.5以上, 这样的函数弥散的范围非常大, 比如zeta等于0.1时, 氢原子的轨道的最大几率分布半径将达到5埃, 而我们常见的X-H化学键在1.3埃以下, 因此弥散轨道可以考虑远距离相互作用, 可以考虑电子的激发, 可以考虑松散是电子分布. 常用的基组就是在6-31G(d,p)的基础上添加弥散函数, 比如6-31+G(d,p)是对非氢原子添加弥散的s和p函数; 6-31++G(d,p)是对氢添加弥散的s函数, 对非氢原子添加弥散的s和p函数. 弥散基组是对劈裂价键基组的另一种扩大。在高斯函数中，变量zeta对函数形态有极大的作用，当zeta的取值很大时，函数图像会向原点附近聚集，而当zeta取值很小的时候，函数的图像会向着远离原点的方向弥散，这种zeta很小的高斯函数被称为弥散函数。所谓弥散基组就是在劈裂价键基组的基础上添加了弥散函数的基组，这样的基组可以用于非键相互作用体系的计算 Polarization function improves the angular distribution of valence orbital Diffuse function improves the radial distribution of valence orbital

Add GTO with very small exponent index factor
2.5 Diffuse function Add GTO with very small exponent index factor Necessary for computing Electron affinity, Gas acidity, weak interactions (hydrogen bond, Van der Waals) 6-31+G – Add s and p orbital functions on non hydrogen atom 6-31++G – Add s functions on hydrogen atom, while s and p orbital functions on non hydrogen atom 对于我们常见的稳定化学体系, 6-31G(d,p)就可以得到不错的结果了, 但是化学体系是多种多样, 我们在理论计算研究中, 不仅对稳定的化学体系感兴趣, 还对其它体系感兴趣, 比如阴离子, 弱作用, 包括氢键和范德华作用, 电子亲和能, 电子的激发能, 在这些体系中原子轨道有可能很大的膨胀, 或者远距离相互作用是十分重要的, 这时候, 仅仅按照常规思路去增加原子轨道的个数, 也就是采用多zeta基组, 改善效果并不明显, 这时候我们就需要对基组添加弥散函数, 它们的指数一般在 , 对p轨道在, 而通常的zeta在0.5以上, 这样的函数弥散的范围非常大, 比如zeta等于0.1时, 氢原子的轨道的最大几率分布半径将达到5埃, 而我们常见的X-H化学键在1.3埃以下, 因此弥散轨道可以考虑远距离相互作用, 可以考虑电子的激发, 可以考虑松散是电子分布. 常用的基组就是在6-31G(d,p)的基础上添加弥散函数, 比如6-31+G(d,p)是对非氢原子添加弥散的s和p函数; 6-31++G(d,p)是对氢添加弥散的s函数, 对非氢原子添加弥散的s和p函数. 弥散基组是对劈裂价键基组的另一种扩大。在高斯函数中，变量zeta对函数形态有极大的作用，当zeta的取值很大时，函数图像会向原点附近聚集，而当zeta取值很小的时候，函数的图像会向着远离原点的方向弥散，这种zeta很小的高斯函数被称为弥散函数。所谓弥散基组就是在劈裂价键基组的基础上添加了弥散函数的基组，这样的基组可以用于非键相互作用体系的计算 Homework: G（3df，2pd）for H2O, How many orbitals? How many primitive GTO?

2.6. More basis sets : other named rules
D95, D95v : Designed by Dunning, with nine s primitive and five p primitive New notation: write to (6,311) Some old, but still used. TZV: triple-zeta valence, there are three valence contractions, such as 6-311G SZ: single zeta DZ: Double Zeta aug-cc-pVDZ: aug denotes an augmented basis (diffuse included) cc denotes correlated-consistent basis, optimized for cc p denotes polarization function included. VDZ denotes valence double zeta CBS: Complete Basis Set 前面我们对径向和角度两个方面的基组改进分别进行了讨论, 那么能不能把二者结合起来, 得到一组逐渐改善LCAO近似的基组呢? cc-pVXZ就是这样的一系列基组, 其中X可以是D, T, Q, 5, 6, 它在径向和角度两方面是改善是相互一致的, 并且可以从它外推到完备基组. 我们来看看这是什么样的基组, 首先看cc-pVDZ, 它是双zeta, 对非氢原子添加d函数, 对氢原子添加p函数, cc-pVTZ, 使用价层三重分裂的基组, 且对非氢原子添加2个d函数和1个f函数, 对氢原子添加两个p函数和1个d函数还有cc-pVQZ, cc-pV5Z, cc-pV6Z, 它们有一个共同的规律, 就是价层轨道的zeta数目从2增加到6, 第一极化轨道从1增加到5, 第二极化轨道从0增加到4, 等等, 所以, 它在径向和角度两个方面的改进是一致的, 当然, 在cc-pVXZ的基础上还可以添加弥散函数, 就变成了Aug-cc-pVXZ, 随着计算机近似能量的提高, 这种基组越来越多地用在了分子计算中, 以部分消除由于基组带来的问题, 的确, 在实际的计算过程中经常出现这样的情况, 在6-31G(d,p)下得到的势能面极小点, 在大基组下, 比如Aug-cc-pVTZ下变成鞍点, 甚至是二级鞍点. 所以做计算不得不系统地去做, 否则文章发表出来后白纸黑字, 后来被别人证明计算结果是错误的, 自己都很不好意思. 现在很多方面都采用了这个系列的基组, 用得比较多的是Aug-cc-pVTZ基组, 比如对氢键和范德华作用, 可信的结果必须是MP2/Aug-cc-pVTZ得到的. A basis set designed to extrapolate energies to the infinite basis set limit

3. Numerical Atomic Basis Sets
Atomic orbital as basis function Atomic basis is replaced with numerical data on grid advantages Use atomic orbital The overlap effect can be reduced MIN: minimal, one basis function for one atomic orbital DN: double basis: minimal basis plus a set of valence electron orbitals DND: DN plus polarization function of d orbital DNP: DND plus Hydrogen polarization TNP: DNP plus a set of valence electron orbitals and polarization orbital

Pseudopotential (effective potential) basis
Core orbitals are stable during chemical reactions Valence orbitals are affected by both nucleus and core electrons Build Pseudopotentia to account for the effective electrostatic potential induced by nucleus and core electrons Advantages: Use less basis functions Advantages: Cover relativistic effect for heavy atoms 最后要讲一下赝势基组, 也叫有效势基组, 这在计算重元素中是非常重要和必要的, 这是因为6-31G系列基组只给出到36号元素, 即只有前四周期的元素有6-31G基组. 赝势基组是基于化学考虑, 在原子形成分子的过程中, 内层轨道没有大的变化, 只是为价层轨道提供了一个有效势能, 价层轨道同时受到原子核以及内层电子的共同作用. 那么我们可以构造一个有效势, 来代替原子核和内层电子对价层电子的静电势. 这样, 我们只需要对价层电子给出基函数, 这样基函数数目减少, 计算就可以很快了. 但是其唯一的缺陷就是引入了近似, 就是用一个近似的静电势去代替原子核和内层电子对价层的作用. 因此, 对于我们有完全基组6-31G的元素, 它的结果的准确和可靠性是比较差的, 不过, 对重元素, 它却是必须的, 因为我们讲过, 内层轨道的尖点特征明显, 并且随着原子序数的增加, 其尖点特征更加突出, 比如我们对1-30号元素, 可以对内层轨道采用6个gaussian函数拟合, 对31-36号却需要8个gaussian函数, 对第5, 6, 周期, 需要的更多, 甚至需要用十几个gaussian函数, 而这些内层轨道在原子形成分子时几乎完全不变. 另外, 随着原子序数的增加, 电子运动的动能增加, 越来越接近光速, 因此相对论效应已经变得非常大, 我们的自旋轨道方法对内层电子几乎失效了, 必须用旋轨偶合方法, 极大地增加了我们计算的难度, 但是却改善不了我们的结果. 这时候, 赝势基组就发挥了重要作用, 它可以把相对论效应包含在赝势参数中, 这样, 对价层电子, 我们感兴趣的轨道, 将可以采用在非重元素中采用的方法来处理了. 因此, 从降低计算难度, 包含相对论效应方面, 赝势有不可替代的作用. 另外提一句, 对第4周期的元素, 相对论效应不明显, 可以不作考虑, 对于第5周期, 能包含相对论效应就包含, 不能也对结果没有太大影响, 但是对第6周期以上, 相对论效应是相当重要的, 必须要包含相对论效应.

Procedures for ECP： 1）Compute highly accurate electron wavefunction with relativistic effect (Dirac Equation) 2）Use pseudo-orbitals to replace valence electron orbitals 3）Generate the discrete numerical values of potential induced by core electron 4）Use 2-7 Gaussian function to fit with the discrete potential values

Treating Core Electrons (ECP, RECP)
CREN Available for SC(4s) through Hs(0s6p6d )., A shape consistent basis set developed by Ermler and coworkers that has a large core region and small valence. SBKJC VDZ Available for Li(4s4p) through Hg(7s7p5d ), a relativistic basis set created by Stevens and coworkers to replace all but the outermost electrons. The double-zeta valence contraction is designed to have an accuracy comparable to that of the 3-21G all-electron basis set. Hay-Wadt MB Available for K(5s5p) through Au(5s6p5d ), this basis set contains the valence region with the outermost electrons and the previous shell of electrons. Elements beyond Kr are relativistic core potentials. This basis set uses a minimal valence contraction scheme. Hay-Wadt VDZ Available for K(5s5p) through Au(5s6p5d ), this basis USING EXISTING BASIS SETS set is similar to Hay±Wadt MB, but it has a double-zeta valence contraction. This set is popular for transition metal modeling. LANL2DZ Available for H(4s) through Pu(7s6p2d2f ), this is a collection of double-zeta basis sets, which are all-electron sets prior to Na. CRENBL Available for H(4s) through Hs(0s3p6d5f ), this is a collection of shape-consistent sets, which use a large valence region and small core region. Dolg Also called Stuttgart sets, this is a collection of ECP sets currently under development by Dolg and coworkers. These sets are popular for heavy main group elements. Lanl1 basis sets: consider only valence electrons. Example: V with 5 basis functions Lanl2 basis sets: consider valence and sub-outer electrons. Example: V with 5+8=13 basis functions

Mixed Basis Sets Due to basis sets limitation:
Example: Mo(CO)2+ , use 6-31+G for C and O, while 6-31G is not available for Mo Due to system size (computation cost) limitation: Large and accurate basis sets for the activation centers while small basis for surroundings.

Choice of Pseudopotential
Mixed basis sets ECP basis sets %mem=32mb %chk=hf #p b3lyp/ gen pseudo=read pop=full scfcyc=500 Mo(CO)2+(Line) 1,6 Mo X 1 1.0 C 1 r O 1 r C 1 r O 1 r r1=2.228 r2=3.329 Mo 0 Lanl2mb **** C O 0 6-31G* Choice of Basis sets Choice of Pseudopotential

Choice of basis sets： Larger basis sets leads to higher accuracy Choose suitable basis sets based on investigation purpose Polarization is needed for ring molecules or -interactions diffuse is needed for charged system or weak interactions Example 1: O2 (O-O bond 1.208A) 6-31G a.u. 6-31G* a.u. dE= a.u. 6-31+G a.u. dE= a.u. Due to  interactions in O2, polarization function is more effective than diffuse function Example 2: O22- 6-31G a.u. 6-31G* a.u. dE= a.u. 6-31+G a.u. dE= a.u. 6-31+G* a.u. dE= a.u. Because of extra charge effect, diffuse function is more effective than polarization

Review: Basis Sets Principle in choosing basis functional：
1. If the form functional fits with the actual wavefunction 2. Necessary basis sets with the least computation cost Two forms of basis functionals: Slater Type Orbitals (STO): Gaussian Type Orbitals (GTO): STO is better in describing positions near or very far from nucleus. GTO normally requires three times of the basis number as that of STO GTO is very efficient in computing integrals, resulting in better performance than STO in general. GTO normally centers at nucleus positions, occasionally at chemical bonds.

Type of Basis set Minimum type basis Double-zeta, Triple-zeta basis
The size of basis sets Minimum basis set: The number of freedom of electrons Double Zeta (DZ): Double of the Minimum basis set Triple Zeta (TZ), Quadruple Zeta (QZ), Quintuple Zeta (5Z), 6Z, … Type of Basis set Minimum type basis Double-zeta, Triple-zeta basis split-valence multi-ζ basis Polarization function diffuse function Polarization function improves the angular distribution of valence orbital Diffuse function improves the radial distribution of valence orbital

Various basis sets for different elements：
STO-3G(H-Xe)；3-21G(H-Xe)；6-21G(H-Cl) 4-31G(H-Ne)；6-31G(H-Kr)；6-311G(H-Kr) D95(H-Cl 除了Na, Mg)；D95V(H-Ne)

Gaussian Program

https://bse.pnl.gov/bse/portal

https://bse.pnl.gov/bse/portal
Coefficient Exponent index Exponent index Coefficient Coefficient

Gaussian Program Atom coordinates Basis number MO初始猜测

Orbital symmetry O: Occupied, V: virtual Orbital energy HOMO
Keywords: Pop=full Orbital energy HOMO AO orbital Orbital coefficient LUMO Homework: Compute O2, analyze molecular orbital 6-31G*, O-O bond A , Pop=full

!1s -- exp(-a r^2) !2px -- exp(-a r^2)x !2py -- exp(-a r^2)y !2pz -- exp(-a r^2)z !D exp(-a r^2)1/2*(3z^2-r^2) !D exp(-a r^2)*dsqrt(3)xz !D exp(-a r^2)*dsqrt(3)yz !D exp(-a r^2)1/2*dsqrt(3)(x^2-y^2) !D exp(-a r^2)*dsqrt(3)xy !F exp(-a r^2)1/2(5z^2-3r^2)z !F exp(-a r^2)1/2*dsqrt(3/2)(5z^2-r^2)x !F exp(-a r^2)1/2*dsqrt(3/2)(5z^2-r^2)y !F exp(-a r^2)1/2*dsqrt(15)(x^2-y^2)z !F exp(-a r^2)dsqrt(15)xyz !F exp(-a r^2)1/2*dsqrt(5/2)(x^2-3y^2)x !F exp(-a r^2)1/2*dsqrt(5/2)(3x^2-y^2)y Some times we used 6D orbital: !D XX-- exp(-a r^2)*xx !D YY-- exp(-a r^2)*yy !D ZZ-- exp(-a r^2)*zz !D XY-- exp(-a r^2)*xy !D XZ-- exp(-a r^2)*xz !D YZ-- exp(-a r^2)*yz

Molecular orbital:

Thanks!

Electronic interactions: Dipole Moments
江俊 (Jun Jiang) University of Science and Technology of China (USTC) 各位专家老师，上午好！我是中国科学技术大学的江俊，很荣幸能有机会向您汇报工作。我们的项目是光催化体系表界面电子态的耦合与演化规律研究，依托单位是中国科学技术大学，推荐单位是中国科学院

Electronic Interactions : Convenient descriptor wanted
Ag8 wavefunction Distribution Point Charge Dipole Moment Easy to compute, measure, and control Low Controllability Insufficient Information 一个最清洁的太阳能利用方式，就是光-化学能转化 2-e Integrals Atomic charge Dipole Approx.

1. To compute dipole couplings
Dipole approximation: Available only when μ << R Example: a bad approximation for coupling of two chlorophylls (distance < 3 Å) in PSI and PSII TrEsp An accurate and efficient way : use electrostatic potential to fit charge density with a limit number of cubes Expensive Strong dependence on grid/cube definition

q Grids based on atomic frames and internal coordinates Grid point charge (dipole) fitted by sampling electrostatic potentials induced by a large number of sub-grids

EHEF (Exciton Hamiltonian with Electrostatic Fluctuations) algorithm
1. Define atomic frame for each pair of atoms, (R, θ,γ) 2. Use a limited number of grids (103 for an amino acid) to describe the charge distribution 3. Each Grid is divided into a large number of sub-grids (106 for an amino acid), to maintain the same precision as 2-e integrals. 4. Electrostatic potentials at random sample positions are calculated with charges in the sub-grids, and are used to fit one point charge for one grid. 5. Charge distributions are saved in the atomic frame. 6. Assume fixed charge distribution in the atomic frame 7. Update the new atomic frame for another MD snapshot with the new set of (R’,θ’,γ’) 8. Update the charge density in the new atomic frame to reuse QM-charge density for all MD snapshtos 9. Describe exciton coupling 2-e Integrals Grid Charges Mini- Dipoles J. Jiang*, et.al. S. Mukamel*, Angew. Chem. Int. Ed. 2010, 49, 9666 Phys. Chem. Chem. Phys. 2011, 13, 2394 J. Phys. Chem. B 2013, 117, 14294

Thanks!

相关一致基组(第二周期) (sp)组极化函数 cc-pVDZ [3s2p] 1d cc-pVTZ [4s3p] 2d1f cc-pVQZ
3d2f1g

关键词：lanl1mb,lanl2mb,lanl1dz,lanl2dz,
cep-4g,cep-31g,cep-121g,sdd等说明：通常对于重元素采使用赝势基组，其中，对于过渡金属原子一般使用lanl系列的基组，对于主族元素采用cep系列的基组， sdd基组相对较少使用；赝势基组的适用范围要较全电子基组来得广，一般除了少数稀土和放射性元素外，均可使用赝势基组；赝势基组是将内层原子的影响用势函数来代替，因此，在使用时必须注意根据所采用的赝势基组，检查体系的总电子数是否正确！对lanl1系列的基组，只考虑价层电子；如V原子为5. 对lanl2系列的基组，除了价层电子外，还考虑了次外层的电子(对部分主族元素例外)；如V原子为5+8=13. 对其它赝势基组，请在使用时仔细核查。

对于使用较多的lanl系列基组，需注意：
对于H~Ne范围的原子，lanl1mb和lan2mb基组等价于 sto-3g最小基组；lanl1dz和lanl2dz基组等价于d95基组; 对于含金属—金属键体系一般使用lanl2系列，而不用 lanl1系列； mb结尾含义为‘minimal basis’，dz结尾含义为‘double -zeta’，故后者基函数数目要较前者多。

在有关键能的计算中，除了计算方法以及基组对最终的结果
有较大影响外，若要得到精确的键能，还需考虑基组叠加误差即BSSE。 BSSE是指源于能量计算中基组带来的误差，它可以粗略理解为化学实验中的背景误差，要消除BSSE，G09中可采用counterpoise关键词来实现(G98是采用message关键词)。

Lecture 5 Modeling & Basis Sets

Similar presentations

Presentation on theme: "Lecture 5 Modeling & Basis Sets"— Presentation transcript:

Similar presentations

About project

反馈

请登录

Auth with social network:

Lecture 5 Modeling & Basis Sets

Similar presentations

Presentation on theme: "Lecture 5 Modeling & Basis Sets"— Presentation transcript:

Similar presentations

About project

反馈