Content-type: text/HTML

### A.M. Tokmachev, A.L. Tchougréeff and I.A. Misurkin L.Ya. Karpov Institute of Physical Chemistry, Vorontsovo pole 10, Moscow 103064 RUSSIA

Semiempirical implementation of APSLG approach

# Semiempirical implementation of APSLG approach

## Abstract

The method using the trial wave function in the form of the antisymmetrized product of strictly localized geminals (APSLG) is developed on the semiempirical level. The Hamiltonian is taken in the MINDO/3 form but with resonance parameters slightly reparameterized. The equilibrium geometries and heats of formation of a series of organic compounds are calculated in its framework and compared with the experimental data and results of the SCF-MINDO/3 approach. Two different schemes of calculation of the ionization potentials are developed and thoroughly tested. The O(N)-scalability and acceptable accuracy are proven for the proposed APSLG-MINDO/3 method.

## Introduction

Nowadays, the problem of relative preference of either localized or delocalized description of the molecules remains unsolved []. The localized picture is closer to the usual chemists' concepts and operates if not the quantities of obvious chemical sense but at least with those experimentalists are used to. At the same time the delocalized picture based on the Hartree-Fock approximation became our days more elaborated and can be applied to a wide range of molecules. In this work we describe our efforts in developing an approach employing a localized wave function but possessing some obvious advantages as compared to the delocalized approach of a similar level of complexity and accuracy.

The localized description can be obtained for molecular electronic structure either a posteriori or a priori. The first approach usually uses the results of a previously performed delocalized Hartree-Fock calculation and achieves the localization by unitary transformations restricted to the space of the occupied molecular orbitals [,,,,]. These methods are normally used either for interpretation of the results of the delocalized approaches in chemical terms or as a starting point for subsequent account of electron correlations. A priori approach provides more possibilities. The localization in these methods is usually achieved by the proper choice of the underlying basis functions and of the form of the N-electron trial wave function. For example, the method of strictly localized molecular orbitals (SLMO) [] takes the trial wave function in the form of Slater determinant with the SLMO's assigned either to chemical bonds or lone pairs and expressed through the several basis hybrid orbitals (HO's) assigned to the same bond (lone pair). This form of the wave function significantly reduces the computational costs but the method of SLMO is proved to be poorer than the standard Hartree-Fock method because the so-called tails (non-local part) of the one-electron wave functions are totally neglected.

Another disadvantage of the SLMO approach is that the large intrabond correlations are not taken into account. The PCILO approach [] takes them into account perturbationally, but the large value of the corrections does not allow for the perturbation treatment of intrabond correlations. However, the intrabond correlations can be covered variationally without significant complications by using the geminal form of the bond building blocks for the molecular electronic wave function. The concept of geminals (two-electron wave functions) was originally introduced by V.A. Fock [,]. The wave function of the molecule with 2n electrons can be represented by the antisymmetrized product of spin geminals (APSG) []. The representation of the electronic wave function of organic molecules as constructed of bond two-electron building blocks was elaborated in the work [] and implemented on the ab initio level by different authors [,]. At the same time this method has been applied only to a very small set of molecules and the results obtained do not allow to estimate the quality of this approach.

Semiempirical implementation allows for some useful expansion of the local geminal-based method. It is well known how important large molecules (especially biological) are. Ab initio approaches (even on the Hartree-Fock level) can not be applied to these systems due to incredible computational costs, which grow proportionally to N4¸N7 with the increasing of the system size (where N is the number of basis one-electron functions). The costs of semiempirical methods grow as the N3 due to diagonalization of the Fock matrix. Even this growth is very rapid when the calculations on biomolecules are concerned. Therefore, an important development would be a construction of methods with linear dependence of the computational costs on the system size (the so-called O(N) methods). Such dependence is achieved, for example, by ignoring some matrix elements of the Fock matrix that results in avoiding the diagonalization of N×N matrix [,] or by using pseudodiagonalization instead of diagonalization. In the case of the neglect of matrix elements it is not counterbalanced. Incidentally, avoiding the diagonalization of N×N matrix and, therefore, the approximately linear dependence of the computational costs on the system size for the semiempirical approaches can be reached by balanced local description of the molecular electronic structure. The typical examples of this statement are the SLMO and PCILO methods in their semiempirical implementation. The geminal method of the local description of the electronic structure provides the possibility to obtain the computationally fast variational method with proper account of intrabond correlations.

The general term ''geminal approach'' can be further specified as the antisymmetrized product of strictly localized geminals (APSLG) []. It is close to other pair theories, for example, to the GVB theories GVB. The comparison of the APSLG and GVB ab initio implementations exemplified by methane dissociation is given in Refs. [,]. It is stressed in Ref. [] that the geminal approach is particularly well adapted to the study of ion-molecule reactions and can be applied to an additive separation of the total electronic energy into components associated with the various bonds and lone pairs. At the same time it is essential that only a limited class of molecules can be fairly described by the electron pair picture. The structure of inorganic and many organic (for example, aromatic) compounds does not fit into the classical picture of definite chemical bonds and electron pairs dating back to Boutleroff [] and Lewis [].

The important aspect in the derivation of a priori local approaches is the method they employ to obtain the HO's. The latter are linear combinations of several AO's centered on the same atom. There are different approaches which are based either on the geometric consideration when one attempts to figure out the HO's directions towards the other end of the bond [] or on the maximization of the overlap between the HO's, belonging to different ends of the bond []. These criteria give quite different results for strained cyclic hydrocarbons (for example, for cyclopropane). It results in an ambiguity of the electronic structure evaluation. At the same time it is obvious that the total electronic energy depends on the coefficients of the HO's expansion through the initial AO basis set. The expansion coefficients are the parameters of the wave function and, hence, must be obtained variationally.

Another aspect of chemical behavior of molecular systems is related to the relative ease of electron extraction, i.e. to their ionization potentials (IP's). Their numerical estimates are important not only for interpreting photoelectron spectra but also for predicting chemical reactivity. It is a common practice to estimate the IP's in the Hartree-Fock approximation with use of the Koopmans' theorem. The approaches based on the configuration interaction schemes are not widely used. The success of the molecular orbitals methods in the explanation of the form of photoelectron spectra (especially, of the degeneracy of the ionic states which coincide with the orbital degeneracies) have led to a widely spread belief that the form of the photoelectron spectra confirms the orbital (and thus ultimately the delocalized) concept of molecular electronic structure and also the one-electron character of the molecular wave functions. The ionic states are usually delocalized (it is clearly seen in the case of highly symmetric systems). Therefore, it would be interesting to see, whether the method based on the local description of the molecular electronic structure can reproduce the form of the photoelectron spectra.

This communication is devoted to the elaboration of the semiempirical variant of the APSLG approach and the systematical investigation of its properties. As a starting point of our considerations we have chosen the well known MINDO/3 form of the Hamiltonian and its set of parameters MINDO. As it is noted in Ref. [], the one-center parameters of the MINDO/3 method are close to those estimates derived from the spectra of free atoms and ions [] and/or those made in the framework of the theory of the effective valence shell Hamiltonian Freed. This allows to conclude that the incorrect asymptotic behavior of the trial SCF wave function is compensated within the MINDO/3 approach by the two-center parameters of the MINDO/3 method whereas the one-center parameters are quite universal and are not strictly bound to the form of the trial electronic wave function.

## Theory

### Wave function and the Hamiltonian

We use the second quantization technique which is convenient for deriving the form of the electronic energy. The AO basis set is transformed into the HO one by unitary transformations of four (one of s-type and three of p-type) orbitals on each ''heavy'' (non hydrogen) atom. Each of the HO's thus obtained is uniquely assigned to a chemical bond or a lone pair. Therefore, the HO's can be labelled as | rm ñ and |lm ñ (right and left HO's of the m-th bond). We use the notation | tm ñ for the both | rm ñ and | lm ñ when the type of the orbital is not specified. The operators annihilating electron with the spin projection s on the HO's can be presented as
 tms = å i Î A hmiAais,
(0)
where 4×4 matrices hA Î SO(4) mix the AO's of the heavy atom A. The geminal representing a single chemical bond is constructed from two HO's (or four spin-HO's) assigned to this bond. We consider only singlet two-electron configurations. Therefore, the geminal is a superposition of two ionic and one covalent (Heitler-London type) configurations with variable amplitudes:
 gm+ = umrma+rmb++vmlma+lmb++wm( rma+lmb++lma+rmb+) .
(0)
In the case of an electron lone pair only one HO is assigned to it. Therefore, only the ionic (in the above sense) contribution to the geminal remains. Without the loss of generality we assume that this contribution is
 gm+ = rma+rmb+.
(0)
The geminals are normalized and orthogonal due to the previously assumed orthogonality of the HO's:
 á 0| gmgm¢+|0 ñ = umum¢+vmvm¢+2wmwm¢ = dmm¢.
(0)
The electronic wave function is the antisymmetrized product of these geminals:

 | F ñ = Õ m gm+|0 ñ .
(0)
The wave function | F ñ is normalized and hence the electronic energy in the APSLG approximation is
 E = á F| H| F ñ .
(0)
The MINDO/3 Hamiltonian in the atomic (or hybrid) basis is a sum of one- and two-center contributions:
 H = å A HA+ 12 å A ¹ B HAB.
(0)
The construction of the APSLG method requires the transformation of the molecular integrals from AO basis to the HO one. The attraction of electrons to its own core transforms as:
 Umt = å i Î A ( hmiA) 2Ui(A),
(0)
where i Î { s,px,py,pz} . The electron attraction to other cores and repulsion of electrons on the orbitals of different atoms are invariant under the basis transformation used. Only the Coulomb and exchange integrals of the electron repulsion on the same atom are known in the AO basis. The transformation to the HO basis generates other types of integrals. Nevertheless it must be stressed that the expression for the electronic energy requires only the Coulomb and exchange repulsion integrals also in the hybrid basis. The transformation reads
 ( \QATOPt1m1\QATOPt2m2 | \QATOPt3m3\QATOPt4m4)A = å i Î A hm1iAhm2iAhm3iAhm4iA(ii | ii)A+
 + å i < j Î A (ii | jj)A(hm1iAhm2iAhm3jAhm4jA+
 +hm1jAhm2jAhm3iAhm4iA)+ å i < j Î A (ij | ij)A×
 ×(hm1iAhm2jAhm3iAhm4jA+hm1iAhm2jAhm3jAhm4iA+
 +hm1jAhm2iAhm3iAhm4jA+hm1jAhm2iAhm3jAhm4iA).
(0)
The resonance integral (corresponding to a one-electron transfer) between the HO tm1 (centered on the atom A) and the HO tm2¢ (centered on the atom B) is expressed through the resonance integrals in the AO basis:
 btm1tm2¢AB = å i Î A å j Î B hm1iAhm2jBbijAB.
(0)
The MINDO/3 Hamiltonian in the hybrid basis has the form of Eq. ( 7) with the following contributions:
 HA = å \Sb tm Î A
 s\endSb æè Umt- å B ¹ A gABZB öø tms+tms-
 - å \Sb tm1,tm2¢ Î A
 m1 < m2\endSb å s btm1tm2¢AA( tm1s+tm2s¢+h.c.) +
 + 12 å \Sb tm1,tm2¢ Î A
 tm3¢¢,tm4¢¢¢ Î A\endSb å st ( \QATOPtm1\QATOPt¢m2 | \QATOPt¢¢m3\QATOPt¢¢¢m4) Atm1s+tm3t¢¢+tm4t¢¢¢tm2s¢,
 HAB = - å \Sb tm1 Î A
 tm2¢ Î B\endSb å s btm1tm2¢AB( tm1s+tm2s¢+h.c.) +
 +gAB å \Sb tm1 Î A
 tm2¢ Î B\endSb å st tm1s+tm2t¢+tm2t¢tm1s,
(0)
where h.c. denotes the hermitean. conjugation.

### Electronic energy

To obtain an expression for the electronic energy in the APSLG approximation we evaluate the elements of the density matrix for the APSLG N-electron trial wave function. The required one-geminal matrix elements of one- and two-electron density matrices are spin independent and can be denoted as
 Pmtt¢ = á 0| gmtms+tms¢gm+| 0 ñ ,
 Gmtt¢ = á 0| gmtms+tm-s¢+tm-s¢tmsgm+| 0 ñ .
(0)
These matrix elements can be directly evaluated
 Pmrr = um2+wm2, Pmll = vm2+wm2,
 Pmrl = Pmlr = (um+vm)wm,
 Gmrr = um2, Gmll = vm2, Gmrl = Gmlr = wm2.
(0)
The contribution to the energy from the interaction of electrons with the cores in its turn is expressed in terms of diagonal elements of the one-electron density matrix:
 E1 = 2 å A å tm Î A æè Umt- å B ¹ A gABZB öø Pmtt.
(0)
The one-center Coulomb repulsion of electrons contributes to the energy:
 Ecoul(1) = å A å tm Î A ( \QATOPtm\QATOPtm | \QATOPtm\QATOPtm) AGmtt+
 +2 å A å \Sb tm1,tm2¢ Î A
 m1 < m2\endSb [ 2( \QATOPtm1\QATOPtm1 | \QATOPt¢m2\QATOPt¢m2) A-( \QATOPtm1 \QATOPt¢m2 | \QATOPt¢m2\QATOPtm1)A] Pm1ttPm2t¢t¢
(0)
through both the one- and two-electron density matrices. The contribution to the energy from the resonance interaction (one-electron interatomic transfers) is:
 Eres = -4 å m brmlmRmLmPmrl,
(0)
where we use notation Rm for the right- and Lm for the left-end atom of the m-th bond. The contribution from the Coulomb interaction of electrons located on different atoms (two-center) to the total energy can be written as:
 Ecoul(2) = 4 å A < B gAB å \Sbtm1 Î A tm2¢ Î B m1 ¹ m2\endSbPm1ttPm2t¢t¢+ å m gRmLmGmrl
(0)
and also involves the matrix elements of both one- and two-electron density matrices.

Finally, the electronic energy is a sum of the above four terms:
 E = E1+Ecoul(1)+Eres+Ecoul(2).
(0)
The molecular integrals entering the above expressions Eqs. (14)-(17) depend on the elements of the SO(4) matrices hA. The SO(4) group transformations depend in their turn on six parameters each [], which define six sequential rotations in two-dimensional subspaces of each four-dimensional space spanned by the AO's of each heavy atom A. Thus the total number of parameters of the wave function to be varied is 6L+2M, where L is the number of heavy atoms and M is the number of geminals, representing chemical bonds (not the lone electron pairs). Each iteration step includes optimization of the HO parameters (six angles per heavy atom) and of the geminal amplitudes. For the set of the amplitudes um , vm, and wm the angles determining the matrices hA are obtained. After that for the fixed coefficients of the HO's the amplitudes of the geminals are determined. The alternating optimizations are consistent and the convergence is rapidly achieved in all variables. To obtain the optimal parameters of the matrices hA we use the direct gradient minimization of the energy. At the same time the optimal geminal amplitudes are obtained by diagonalizing the effective bond Hamiltonians. The effective Hamiltonian for the k-th bond can be written as
 Hkeff = W1kr å s rks+rks+W1kl å s lks+lks+W1krl å s ( rks+lks+lks+rks) +
 +W2krrka+rkb+rkbrka+W2kllka+lkb+lkblka+W2krl å s rks+lk-s+lk-srks,
(0)
with
 W1kr = UkRk- å B ¹ Rk gRkBZB+2 å B ¹ Rk gRkB å \Sb tn Î B
 n ¹ k\endSb Pntt+
 + å \Sb tn Î Rk
 n ¹ k\endSb [ 2( \QATOPrk\QATOPrk | \QATOPtn\QATOPtn) Rk-( \QATOPrk\QATOPtn | \QATOPtn\QATOPrk) Rk]Pntt,
 W1kl = UkLk- å B ¹ Lk gLkBZB+2 å B ¹ Lk gLkB å \Sb tn Î B
 n ¹ k\endSb Pntt+
 + å \Sb tn Î Lk
 n ¹ k\endSb [ 2( \QATOPlk\QATOPlk | \QATOPtn\QATOPtn) Lk-( \QATOPlk\QATOPtn | \QATOPtn\QATOPlk) Lk]Pntt,
 W1krl = -brklkRkLk, W2krl = gRkLk,
 W2kr = ( \QATOPrk\QATOPrk | \QATOPrk\QATOPrk)Rk, W2kl = ( \QATOPlk\QATOPlk | \QATOPlk\QATOPlk) Lk.
(0)
In the basis of the two-electron states contributing to the k-th geminal it becomes a 3×3 matrix, whose lowest eigenvalue and the amplitudes characterizing this eigenstate can be easily found.

### Ionization potentials

Now we consider the configuration interaction scheme for obtaining the vertical ionization potentials. The construction of configurations is conveniently done in the basis of bond orbitals (BO's). These orbitals (bonding bm and antibonding am) are defined in terms of the HO's by:
ì
í
î
 bms = xmlms+ymrms
 ams = -ymlms+xmrms
.
(0)
The normalization condition requires xm2+ym2 = 1. The BO's obey the usual and L-orthogonality conditions (c,d Î { a,b} ):
 á 0| cmsdms+|0 ñ = á 0| gmcmsdms+gm+| 0 ñ = dcd.
(0)
In the basis of BO's the geminals have a simple form
 gm+ = Umbma+bmb++Vmama+amb+,
(0)
where Um2+Vm2 = 1.

Two approaches to constructing the ionized configurations are thinkable. The first way is that one electron is extracted from a geminal and the remaining electron occupy either bonding or antibonding orbital. All other bonds are represented by their ground state geminals. The many-electron ((N-1)-electron) configurations in this case are:
 æè Õ k ¹ m gk+ öø cms+| 0 ñ .
(0)
The second approach to construct the basis configurations is to extract an electron from a geminal, but then to readjust the remaining geminals to the extra potential induced by a hole. It takes into account the polarization of remaining geminals by a hole. The polarized geminals are obtained by solving the eigenvalue problem
 Hkcmteff ~g kcmt = ~e kcmt ~g kcmt
(0)
(where [(g)\tilde]kcmt is the k-th geminal polarized by a hole on the m-th geminal with remaining electron on the bond spin orbital | cmt ñ (| cm ñ =|am ñ or | bm ñ )) and by taking the lowest eigenstate. The effective Hamiltonian for a geminal interacting with an extra attracting potential induced by the hole in some other geminal differs from that of Eq. (19) and (20) and can be presented as:
 Hkcmteff = Hkcmtcore+Hkcmt1,intra+Hkcmt1,inter+Hkcmtres+Hkcmt2,intra+Hkcmt2,inter,
(0)
where the attraction of electrons to the core gives:
 Hkcmtcore = å t Î { r,l} æè UkTk- å B ¹ Tk gTkBZB öø å s tks+tks.
(0)
The intrabond repulsion of electrons on the same atom contributes:
 Hkcmt1,intra = å t Î {r,l} ( \QATOPtk\QATOPtk | \QATOPtk\QATOPtk) Tktka+tkb+tkbtka,
(0)
while the interbond one-center repulsion of electrons can be written as:
 Hkcmt1,inter = å t Î { r,l} å \Sb tq¢ Î Tk
 q ¹ k,m\endSb [ 2( \QATOPtk\QATOPtk | \QATOPt¢q\QATOPt¢q) Tk-( \QATOPtk\QATOPt¢q | \QATOPt¢q\QATOPtk) Tk] Pqt¢t¢ å s tks+tks+
 + å tt¢ Î { r,l} dTkTm¢(hmt¢c)2 éë ( \QATOPtk\QATOPtk | \QATOPt¢m\QATOPt¢m) Tk å s tks+tks-( \QATOPtk\QATOPt¢m | \QATOPt¢m\QATOPtk) Tktkt+tkt ùû ,
(0)
where
 hmtc(t Î { r,l} ;c Î { a,b}) = á 0| tmscms+| 0 ñ
(0)
and it is simply the coefficient of the HO | tm ñ in the expansion for the BO | cm ñ Eq. (21). The resonance contribution is:
 Hkcmtres = -brklkRkLk å s ( rks+lks+lks+rks) .
(0)
The Coulomb repulsion between electrons on the different atoms is divided into the intrabond
 Hkcmt2,intra = gRkLk å s rks+lk-s+lk-srks
(0)
and the interbond
 Hkcmt2,inter = å t Î { r,l} å B ¹ Tk gTkB×
 × å \Sb tq¢ Î B
 q ¹ k\endSb [ dqm(hmt¢c)2+2(1-dqm)Pqt¢t¢] å s tks+tks.
(0)
contributions. After the adjusted (polarized) geminals are calculated according to Eqs. (25)-(33) they can be used for constructing many-electron basis states:
 æè Õ k ¹ m ~g +kbms öø cms+| 0 ñ .
(0)

The vertical ionization potentials and the eigenstates of the ion are obtained by the diagonalization of the operator H-E0I (E0 is the APSLG energy of the ground state) in one of configuration basis sets ((24) or (34)). For the case of non-polarized geminals (24) the matrix elements of the operator H-E0I can be relatively easily written. The MINDO/3 Hamiltonian does not contain the matrix elements of interaction of many-electron states ( 24) with different spin projections. Therefore, the matrix of the Hamiltonian is block-diagonal with two equal blocks on the diagonal. The total dimension of the matrix to be diagonalized is 2M+N, where M is the number of chemical bonds and N is the number of electron lone pairs in the molecules. The diagonal matrix elements of the operator H-E0I are
 Hcmcm = 0 êê æè Õ k¢ ¹ m gk¢ öø cms( H-E0I) cms+ æè Õ k ¹ m gk+ öø êê 0 =
 = å t Î {r,l} { æè Umt- å B ¹ Tm gTmBZB öø [ (hmtc)2-2Pmtt] -( \QATOPtm\QATOPtm | \QATOPtm\QATOPtm) TmGmtt}-
 -2brmlmRmLm(hmrchmlc-2Pmrl)-2gRmLmGmrl+
 + å t Î {r,l} å \Sb tq¢ Î Tm
 q ¹ m\endSb [ 2( \QATOPtm\QATOPtm | \QATOPt¢q\QATOPt¢q) Tm-( \QATOPtm\QATOPt¢q | \QATOPt¢q\QATOPtm) Tm] Pqt¢t¢[ (hmtc)2-2Pmtt] +
 +2 å t Î {r,l} [ (hmtc)2-2Pmtt] å B ¹ Tm gBTm å \Sb tq¢ Î B
 q ¹ m\endSb Pqt¢t¢.
(0)
We introduce the notation:

c

= ì
í
î
 a, if c = b
 b, if c = a
,
Cmc = ì
í
î
 Um, if c = b
 Vm, if c = a
.
(0)
The off-diagonal matrix elements of the operator H-E0I can be divided in two classes: those between ionized states with electron extracted from the same geminal (these states differ by the BO, the remaining electron occupies):
 Hcm[(c)]m = 0 êê æè Õ k¢ ¹ m gk¢ öø cms( H-E0I) c +ms æè Õ k ¹ m gk+ öø êê 0 =
 = å t Î { r,l} æè Umt- å B ¹ Tm gTmBZB öø hmtchmt[(c)]-brmlmRmLm( hmrchml[(c)]+hmr[(c)]hmlc) +
 + å t Î {r,l} hmtchmt[(c)] å \Sb tq¢ Î Tm
 q ¹ m\endSb [2( \QATOPtm\QATOPtm | \QATOPt¢q\QATOPt¢q) Tm-( \QATOPtm\QATOPt¢q | \QATOPt¢q\QATOPtm) Tm] Pqt¢t¢+
 +2 å t Î {r,l} hmtchmt[(c)] å B ¹ Tm gBTm å \Sb tq¢ Î B
 q ¹ m\endSb Pqt¢t¢
(0)
and those between the states where an electron is extracted from different geminals
 Hcmcn¢ = 0 êê æè Õ k¢ ¹ m gk¢ öø cms( H-E0I) cns¢+ æè Õ k ¹ n gk+ öø êê 0 =
 = å tt¢ Î { r,l} CmcCnc¢hmtchnt¢c¢×
 ×{ btmtn¢TmTn¢-dTmTn¢ å \Sb tq¢¢ Î Tm
 q ¹ m,n\endSb Pqt¢¢t¢¢[ 2( \QATOPt¢¢q\QATOPt¢¢q | \QATOPtm \QATOPt¢n) Tm-( \QATOPt¢¢q \QATOPtm | \QATOPt¢¢q\QATOPt¢n)Tm] } -
 - å tt¢ Î {r,l} dTmTn¢hmtchnt¢c¢( \QATOPtm\QATOPtm | \QATOPtm\QATOPt¢n)TmCnc¢[ (hmta)2Cma+(hmtb)2Cmb] -
 - å tt¢ Î {r,l} dTmTn¢hmtchnt¢c¢( \QATOPt¢n\QATOPt¢n | \QATOPt¢n\QATOPtm) TmCmc[ (hnt¢a)2Cna+(hnt¢b)2Cnb] .
(0)
The expressions for the matrix elements of the operator H-E0I in the basis of ionic states with the polarized geminals Eq. ( 34) are not presented here due to their complexity. The main difference in the off-diagonal matrix elements corresponds to multiplying them by the respective overlap factors. Meanwhile, the diagonal matrix elements are shifted due to changes in the contributions to the Hamiltonian coming from the non-ionized geminals due to their polarization.

## Results

The formulae given above developed recently in Refs. ZhFizKhim,JCompChem were implemented as a computational tool called BF []. The search of the equilibrium molecular geometric structure was implemented using the gradient minimization. The important problem in the context of constructing a semiempirical approach is the choice of the Hamiltonian parameters. The parameters of the semiempirical SCF approaches implicitly include the correlation of electrons. In Ref. [] we adjusted the resonance parameters bAB of the MINDO/3 Hamiltonian to cure the discrepancies due to neglect of the electron correlations. These parameters were readjusted on the ground of the results on the heats of formation and the modified values can be found in the Ref. [], also containing the discussion of criteria for adjusting the parameters. Table 1 contains the results of the calculations of the heats of formation for the set of organic molecules obtained by the APSLG-MINDO/3 and SCF-MINDO/3 methods with the optimized geometric structures in comparison with experimental data taken from Ref. [] devoted to the parameterization of the MNDO method.The results obtained demonstrate that the APSLG-MINDO/3 and SCF-MINDO/3 methods have the similar precision when applied to the description of the heats of formation (the mean deviation from the experiment is 6.6 kcal/mol for the APSLG-MINDO/3 method and 7.6 kcal/mol for the SCF-MINDO/3 method). At the same time the results for the branched compounds are rather poor for both computational methods, i.e. the account of the intrabond correlation does not improve this disadvantage of the MINDO/3 approach. It was shown in our previous works ZhFizKhim,JCompChem that the total energies obtained by the APSLG-MINDO/3 method are approximately additive in the homologic series in agreement with experiment [,]. The differences between the homologues are correctly reproduced by the present method in contrast with the SCF-MINDO/3 approach. Therefore, when the molecular size increases the estimates of the heats of formation within the APSLG-MINDO/3 method significantly improve as compared to those of the SCF-MINDO/3 method.

Consistent reproducing the heats of formation does not suffice to demonstrate the consistence of a quantum chemical method. It is also important that the estimates of acceptable precision for the heats of formation would be obtained together with the geometry structure, i.e. positions of the minima on the potential energy surfaces. We have compared the equilibrium geometry structures obtained by the APSLG-MINDO/3 method with the experimental ones. The APSLG-MINDO/3 method predicts the length of the H-H bond in the dihydrogen molecule longer than the experimental value by 0.01 Å . At the same time the calculated length of the C-H bond in the methane (1.092 Å ) fairly coincides with the experimental one (1.094 Å ). The situation for the C-C bond in the hydrocarbons is more complex: the value of the length of the C-C bond in the ethane calculated by the SCF-MINDO/3 method is 1.474 Å , which is significantly smaller than the experimental value of 1.536 Å . The APSLG-MINDO/3 method gives the value 1.484 Å  for this length. This result is better than that of the SCF-MINDO/3 but the difference with the experiment is still too large. This situation is slightly better for the higher hydrocarbons. At the same time the difference between the calculated and experimental length of the C=C bond in the ethylene is not very great (1.335 Å  vs. 1.339 Å ). The SCF-MINDO/3 method gives the value 1.313 Å  for this quantity which is significantly lower than the experiment. Fascinating example is the optimization of the geometric structure of the norbornadiene and quadricyclane molecules. Starting from the same geometry but with different positions of the chemical bonds (connectivity scheme) we obtain different results. If all the bonds are admitted to be single the optimization procedure results in the geometric structure very close to that of the quadricyclane. If two bonds are set to be double the geometry obtained is close to that of the norbornadiene.

Both schemes of the calculation of vertical ionization potentials and wave functions of the ionized states described in Section 2.3 were implemented in the computational package BF []. The results of both methods can be discussed with use of the concept of Dyson orbitals []. These orbitals are defined by the expression:
 gn(x) = á Yn+| y(x)| Y0 ñ ,
(0)
where y(x) is the operator annihilating an electron in the point x = (r,s). In the case of the SCF-MINDO/3 method using the Koopmans' theorem for description of the ionic states, the Dyson orbitals coincide with the molecular orbitals and are, therefore, normalized. In the case of the geminal trial wave function for the ground state of the neutral molecule the Dyson orbitals are not normalized. We applied both procedures to the calculation of the ionization potentials in a series of normal hydrocarbons from CH4 to C20H42. The results obtained by the both APSLG-type computational procedures are presented in Fig. 1 in comparison with relevant experimental data and the results obtained by the SCF-MINDO/3 method. Also we computed the first vertical ionization potentials for some other hydrocarbons and organic molecules containing oxygen and nitrogen. These values are calculated within the scheme with fixed geminals. The results accompanied by the corresponding experimental data are present in Table 2.

## Discussion

The essential feature of the APSLG-MINDO/3 approach is the partial account of electron correlations in contrast with the SCF-MINDO/3 approach. This may be thought to be unimportant from the refined theoretical point of view since the method itself anyway belongs to the family of other semiempirical methods where the correlations are all the same at least to some extent taken into account by parameterization. However, the correlated and local form of the trial wave function of the APSLG-based approach allows for correct asymptotic behavior for the bond cleavage. To this extent one may expect that the APSLG method is capable to reproduce qualitative features of molecular electronic structure for reactive species. At the same time APSLG-MINDO/3 method neglects the interbond one-electron transfers which are covered by the parent SCF-MINDO/3 approach. It is interesting to find out which factor (the intrabond electron correlations or interbond electron transfers) is more important for description of the electronic structure. The data of the Table 1 on the heats of formation and equilibrium geometries show that both factors are of similar importance. The transition to the APSLG structure of the wave function has advantages for the equilibrium geometries. The wave function of the SCF method has wrong limit under bond cleavage.

The local character of the APSLG approach provides a possibility of linear dependence of the computational costs on the size of a molecule. To demonstrate it we carried out calculations of the electronic structure for a series of normal saturated hydrocarbons from CH4 to C20H42 without geometry optimization by the APSLG-MINDO/3 and SCF-MINDO/3 methods. The dependencies of the computation time (in seconds) on the number of carbon atoms in the chain are presented in the Fig. 2. It is clearly seen that the method APSLG-MINDO/3 possesses a linear dependence of the computational costs on the length of the chain in contrast with the SCF-MINDO/3 method. In the case of normal hydrocarbon C20H42 the computation time for two methods differs 30 times in favor of the APSLG approach.

The HO's for double bonds are normally ambiguously defined. Does this bond consist of two equivalent (bent or ''banana'') or non-equivalent (s- and p-) bonds? The variational determination of the HO's in the proposed approach allows to discriminate between the two models on the ground of the energy criterion. Our calculations have revealed that the s-/p-separated model is only less than 1 kcal mol-1 energetically more favorable than the model with the bent bonds. The question about the preferability of one or another representation of the double bond was also studied in Ref. [] with use of the full GVB approach. In the case of the ethylene molecule the conclusion about closeness of the energies for the two ways of bonding has been drawn as well. The calculations with the frozen core have shown that the s-/p-separation is slightly more preferable in agreement with our results.

Another question is about the form of the HO's in strained cyclocarbons (especially, cyclopropane). Our calculations predict that the optimal HO's are not directed along the C-C bonds. In the framework of the APSLG approach certain concepts specific for chemistry became natural. For example, the hybridization can be considered as a parameter of the wave function and is determined on the basis of the variational principle. The bond ionicity, covalency and polarity become clearly defined if the geminal is rewritten in the form
 gm+ = ImÖ2 éë Ö 1+lm rma+rmb++ Ö 1-lm lma+lmb+ ùû + CmÖ2 [ rma+lmb++lma+rmb+] ,
(0)
where Im and Cm are the total amplitudes of the ionic and covalent contributions respectively, and lm is the bond polarity (i.e. asymmetry of the charge distribution in the m-th bond). Table 3 represents the data on bond covalency, ionicity and polarity as well as the atomic charges obtained with use of the Mulliken's scheme. Table 3 shows that the covalent contribution exceeds the ionic contribution for all considered molecules and these contributions are only weakly sensitive to the chemical structure of the bond and its surrounding. At the same time the numerical values of the parameter lm vary significantly and corresponds to the chemists' intuitive ideas concerning the bond polarity. Taking into account the weak dependence of the covalency and ionicity on the type of bond, the bond polarities ultimately control the charges on the atoms. The interesting result is obtained for the methane molecule: the SCF-MINDO/3 method predicts positive charge for the carbon atom. At the same time the APSLG-MINDO/3 method gives the reverse charge distribution (bond polarity) in accordance with the results of ab initio calculations and with the intuitive belief of organic chemists.

It is also interesting to consider the hybridization as determined variationally. In the case of methane the sp3-hybridization is obtained. The transition to ethane essentially changes the hybridization: the HO's of the C-C bond have the sp2.26-hybridization and the HO's of the C-H bonds have the sp3.33-hybridization. It can be noted that in the molecule with lone electron pairs the latter have substantially the s-character and, therefore, the HO's assigned to the chemical bonds have mainly the p-character. For example, the HO of the F-H bond in the HF molecule has sp39.0-hybridization. It is necessary to mention that the mixing the HO's of different lone electron pairs which are centered on the same atom does not change the electronic energy and degree of this mixing is controlled by initial conditions of the optimization.

The correct description of the ionization potentials requires the same level of the calculation of the ion and neutral molecule. It can be only approximately achieved in the case when the molecule is described by the local method, because the ion must be described as essentially delocalized. The data of the Fig. 1 demonstrate that the APSLG-based method with fixed geminals gives the vertical ionization potentials which are slightly larger than the experimental adiabatic ones. At the same time, the scheme with the polarization of the geminals by a hole predicts the vertical ionization potentials which are significantly lower even than the experimental adiabatic ones. It is due to different degree of correlation accounted in the ion and neutral molecule. It is necessary to mention that different explorers obtain experimentally very divergent values for the ionization potentials (for example, authors of Refs. [,,] report very different results for the methane molecule).

It is interesting to consider the structure of a hole in the polyethylene. The one-electron approximation predicts the delocalized character of the hole (the corresponding Dyson orbital is a plane wave). At the same time the variation of the lengths of the chemical bonds in the ion can localize the hole []. In the case of the APSLG-based method the Dyson orbital of the hole is slightly more localized than that in the case of the SCF approach but it also has the sinus-like form with the maximum at the middle of the chain. In the case of the APSLG-MINDO/3 method with the fixed geminals the charges on the hydrogen atoms are very close to those in the neutral molecule while in the case of the APSLG-MINDO/3 method with the polarized geminals the charges on the hydrogen atoms became essentially more positive than in the neutral molecule, due to electron redistribution in the C-H bonds, though they are not affected by the ionization directly.

In the case of highly symmetric molecules the ionization experiments manifest the degeneracy of the ionic states. The molecular orbital theories correlate it with the degeneracy of the molecular orbitals the electron is extracted from. The localized method like APSLG can describe the degeneracy only if the interaction between the local configurations is turned on. The degeneracies of the states are due to the symmetry of the Hamiltonian matrix. Our calculations demonstrate that the APSLG-MINDO/3 approach reproduces the degeneracies and the order of the ionized states (in the case of methane we obtained the triply degenerated, in the case of ethane and ammonia the doubly degenerated first ionization states).

The analysis of the data of the Table 2 shows that the APSLG-MINDO/3 based method with fixed geminals gives the value of the first vertical ionization potential for all classes of molecules close to the experimental ones. The ionization potentials of the methylamine and ethylamine are very close because the lone pare mainly contributing to the ionized state is only slightly affected by the changing in the remote surrounding. In contrast the addition of alkyl-groups to the atom with lone electron pair significantly decreases the first vertical ionization potential due to electron-donating character of alkyl-groups in the closest surrounding of the lone electron pair.

## Conclusions

A semiempirical method using the trial wave function in the form of the antisymmetrized product of strictly localized geminals is constructed and implemented as a computational tool. The quality (accuracy) of the results related to the heats of formation and equilibrium geometries obtained by the APSLG-MINDO/3 method is comparable and somewhat better than that of the SCF-MINDO/3 method. At the same time the method APSLG-MINDO/3 possesses three important features: (1) it guarantees the correct behavior of the wave function under the homolytic cleavage of chemical bonds; (2) the computational costs of the method increase linearly with the growth of the system size and (3) some chemical concepts such as the hybridization, covalency, ionicity, polarity of chemical bonds are naturally defined in the framework of the APSLG approach. The APSLG-based configuration interaction schemes for the calculation of the vertical ionization potentials correctly reproduce the degeneracies of the ionic states. At the same time only the APSLG-based method with fixed geminals gives the values of the first vertical ionization potential, which are close to the experimental ones. Analogous approach with polarization of geminals systematically lowers the values of the first ionization potential.

This work is supported by the RFBR through the grant 99-03-33176. One of us (A.M.T.) acknowledges financial support from the Haldor Topsø e A/S.

## References

[]
Localization and Delocalization in Quantum Chemistry, Chalvet, O., Daudel, R., Diner, S., and Malrieu, J.-P., Eds., Dordrecht: D. Reidel Publishing Company, 1975.

[]
Edmiston, C. and Ruedenberg, K., Rev. Mod. Phys., 1963, vol. 35, p. 457.

[]
Edmiston, C. and Ruedenberg, K., J. Chem. Phys., 1965, vol. 43, p. 97.

[]
Foster, J.M. and Boys, S.F., Rev. Mod. Phys., 1960, vol. 32, p. 300.

[]
Boys, S.F., Rev. Mod. Phys., 1960, vol. 32, p. 296.

[]
von Niessen, W., Theor. Chim. Acta, 1972, vol. 27, p. 9.

[]
Smits, G.F. and Altona, C., Theor. Chim. Acta, 1985, vol. 67, p. 461.

[]
Diner, S., Malrieu, J.-P., and Claverie, P., Theor. Chim. Acta, 1969, vol. 13, p. 1.

[]
Fock, V.A., Veselov, M.G., and Petrashen', M.I., Zhurn. Eksp. and Theor. Fiz., 1940, vol. 10, p. 723 [in Russian].

[]
Fock, V.A., DAN SSSR, 1950, vol. 73, p. 735 [in Russian].

[]
Hurley, A.C., Lennard-Jones, J.E., and Pople, J.A., Proc. R. Soc. London, Ser. A, 1953, vol. 220, p. 446.

[]
Parks, J.M. and Parr, R.G., J. Chem. Phys., 1958, vol. 28, p. 335.

[]
Surjàn, P.R., The Concept of the Chemical Bond, in Theoretical Models of Chemical Bonding, Part 2, Maksi\'c, Z.B., Ed., Heidelberg: Springer, 1989, p. 205.

[]
Wu, W. and McWeeny, R., J. Chem. Phys., 1994, vol. 101, p. 4826.

[]
Dixon, S.L. and Merz, K.M., J. Chem. Phys., 1997, vol. 107, p. 879.

[]
Stewart, J.J.P., Int. J. Quantum Chem., 1996, vol. 58, p. 133.

[]
Bobrowicz, F.W. and Goddard III, W.A., The Self-Consistent Field Equations for Generalized Valence Bond and Open-Shell Hartree-Fock Wave Functions, in Methods of Electronic Structure Theory, Schaefer III, H.F., Ed., New York: Plenum Press, 1977.

[]
Boutleroff, A.M., Selected works, Moscow: Izd. AN SSSR, 1953 [in Russian].

[]
Lewis, G.N., J. Am. Chem. Soc., 1916, vol. 38, p. 762.

[]
Pauling, L., J. Am. Chem. Soc., 1931, vol. 57, p. 1367.

[]
Del Re, G., Theor. Chim. Acta, 1963, vol. 1, p. 188.

[]
Bingham, R.C., Dewar, M.J.S., and Lo, D.H., J. Am. Chem. Soc., 1975, vol. 97, p. 1302, p. 1307.

[]
Misurkin, I.A., Opt. and Spectr., 1975, vol. 39, p. 617 [in Russian].

[]
Misurkin, I.A. and Ovchinnikov, A.A., Opt. and Spectr., 1971, vol. 30, p. 616 [in Russian].

[]
Freed, K., Chem. Phys. Lett., 1972, vol. 13, p. 181; ibid. 1972, vol. 15, p. 331.

[]
Smirnov, V.I., Course of Higher Mathematics, vol. 3, Part. I, Moscow, 1958 [in Russian].

[]
Tokmachev, A.M. and Tchougréeff, A.L., Russ. J. Phys. Chem., 1999, vol. 73, p. 259.

[]
Tokmachev, A.M., Tchougréeff, A.L., and Misurkin, I.A., J. Comp. Chem., submitted.

[]

[]
Dewar, M.J.S. and Thiel, W., J. Am. Chem. Soc., 1977, vol. 99, p. 4899.

[]
Cohen, N. and Benson, S.W., Chem. Rev., 1993, vol. 93, p. 2419.

[]
Stepanov, N.F., Erlykina, M.E., and Filippov, G.G., Methods of linear algebra in physical chemistry, Moscow, 1976 [in Russian].

[]
McWeeny, R., Methods of Molecular Quantum Mechanics, 2nd Edition, AP, London, 1992.

[]
Herzberg, G., J. Mol. Spectr., 1970, vol. 33, p. 147.

[]
Turner, D.W., Baker, C., Baker, A.D., and Brundle, C.R., Molecular photoelectron spectroscopy, London: Wiley-Interscience, 1970.

[]
Brundle, C.R., Robin, M.B., Kuebler, N.A., and Basch, H., J. Am. Chem. Soc., 1972, vol. 94, p. 1451.

[]
Thermal Constants of Substances (Handbook), Glushko, V.P., Ed., Issues 1-7, VINITI: Moscow, 1965-1973 [in Russian].

[]
Franklin, J.L., Dillard, J.G., Rosenstock, H.M., Herron, J.T., and Draxl, K., Ionization Potentials, Apearance Potentials, and Heats of Formation of Gaseous Positive Ions, Washington: NBS, 1969.

[]
Watanabe, K. and Mottl, J.R., J. Chem. Phys., 1957, vol. 26, p. 1773.

[]
Potts, A.W. and Price, W.C., Proc. R. Soc. London, Ser. A, 1972, vol. 326, p. 165, p. 181.

[]
Refaye, K.M.A. and Chupka, W.A., J. Chem. Phys., 1968, vol. 48, p. 5205.

[]
Ogliaro, F., Cooper, D.L. and Karadakov, P.B., Int. J. Quantum Chem., 1999, vol. 74, p. 223.

[]
Watanabe, K., Nakayama, T., and Mottl, J.R., J. Quant. Spectr. Rad. Transfer, 1962, vol. 2, p. 369.

[]
Al-Joboury, M.L. and Turner, D.W., J. Chem. Soc., 1964, p. 4434.

[]
Dewar, M.J.S and Worley, S.D., J. Chem. Phys., 1969, vol. 50, p. 654.

[]
Ovchinnikov, A.A., Zhurn. Struct. Khim., 1965, vol. 6, p. 291 [in Russian].

Table 0: Heats of formation (kcal/mol) of test molecules obtained experimentally and calculated with use of the APSLG-MINDO/3 and SCF-MINDO/3 methods.
 DHf DHf DHf Molecule (expt.) APSLG SCF H2 0.0 -0.101 0.131 CH4 -17.8 -8.430 -6.275 C2H6 -20.04 -19.514 -19.849 C3H8 -25.00 -25.185 -26.527 n-C4H10 -30.00 -29.993 -32.655 iso-C4H10 -32.00 -22.379 -24.710 n-C5H12 -35.09 -35.225 -38.973 neo-C5H12 -40.15 -12.126 -14.632 cyclopropane 12.7 18.472 8.524 cyclobuthane 6.8 2.360 -8.088 cyclopenthane -18.3 -28.193 -27.795 cyclohexane -29.49 -29.166 -32.505 NH3 -11.0 -4.529 -9.125 CH3NH2 -5.5 -1.034 -4.615 C2H5NH2 -11.3 -11.777 -15.708 n-C3H7NH2 -16.8 -16.776 -21.874 iso-C3H7NH2 -20.0 -14.718 -18.341 tert-BuNH2 -28.9 -6.757 -13.215 (CH3)2NH -4.4 5.535 4.310 (CH3)3N -5.7 19.136 21.027 N2H4 22.8 21.307 3.166 CH3NHNH2 22.6 23.102 8.366 (CH3)2NNH2 20.1 33.039 22.242 CH3NHNHCH3 22.0 27.588 15.019 H2O -57.8 -55.677 -53.611 CH3OH -48.16 -48.549 -50.573 C2H5OH -56.21 -60.167 -64.292 1-C3H7OH -60.98 -65.113 -70.417 2-C3H7OH -65.19 -66.008 -69.117 tert-BuOH -74.7 -63.509 -65.614 (CH3)2O -60.3 -33.905 -51.191 H2O2 -32.5 -32.720 -29.265

Table 0: The first vertical ionization potentials (eV) for some simple molecules obtained by the APSLG method with fixed geminals and some experimental results.
 Molecule Calculated value, eV Experimental value, eV H2 15.614 15.43 [] cyclopropane 10.058 11.0 [] C2H4 10.418 10.51 [] NH3 10.117 10.15 [] N2H4 8.856 8.74 [] CH3NH2 8.741 8.97 [] C2H5NH2 8.789 8.66 [] H2O 12.794 12.62 [] CH3OH 10.781 10.84 [] C2H5OH 10.470 10.47 []

Table 0: Charges, ionic and covalent contributions and polarities of chemical bonds calculated by the APSLG-MINDO/3 method.
 Ionic Covalent Bond Molecule Bond contribution contribution polarity Charge Charge A-B I2 C2 l A B H2 H-H 0.434 0.566 0 0 0 CH4 C-H 0.415 0.585 -0.157 -0.260 0.065 C2H6 C-C 0.411 0.589 0 -0.126 -0.126 C-H 0.411 0.589 -0.102 -0.126 0.043 N2H4 N-N 0.363 0.637 0 -0.114 -0.114 N-H 0.415 0.585 -0.137 -0.114 0.057 NH3 N-H 0.408 0.592 -0.111 -0.135 0.045 CH3NH2 C-N 0.398 0.602 0.051 -0.145 -0.142 C-H 0.411 0.589 -0.133 -0.145 0.055 N-H 0.415 0.585 -0.148 -0.142 0.061 HF F-H 0.470 0.530 -0.699 -0.329 0.329 CH3F C-H 0.416 0.584 -0.147 0.032 0.061 C-F 0.386 0.614 0.556 0.032 -0.215 H2O O-H 0.463 0.537 -0.523 -0.484 0.242

File translated from TEX by TTH, version 2.67.
On 12 May 2000, 23:49.