A hypervalent molecule is a molecule that contains one or more main group elements formally bearing more than eight electrons in their valence shells. Phosphorus pentachloride (PCl5), sulfur hexafluoride (SF6), the phosphate (PO43−) ion, chlorine trifluoride (ClF3) and the triiodide (I3−) ion are examples of hypervalent molecules.
Contents |
Hypervalent molecules were first formally defined by Jeremy I. Musher in 1969 as molecules having central atoms of group 15-18 in any oxidation state other than the lowest.[1]
Several specific classes of hypervalent molecules exist:
N-X-L nomenclature, introduced in 1980,[3] is often used to classify hypervalent compounds of main group elements, where:
Examples of N-X-L nomenclature include:
The debate over the nature and classification of hypervalent molecules goes back to Gilbert N. Lewis and Irving Langmuir and the debate over the nature of the chemical bond in the 1920s.[4] Lewis maintained the importance of the two-center two-electron (2c-2e) bond in describing hypervalence, thus allowing for expanded octets. Langmuir, on the other hand, upheld the dominance of the octet rule and preferred the use of ionic bonds to account for hypervalence without violating the rule (e.g. SF42+, F22−).
In the late 1920s and 1930s, Sugden argued for the existence of a two-center one-electron (2c-1e) bond and thus rationalized bonding in hypervalent molecules without the need for expanded octets or ionic bond character; this was poorly accepted at the time.[4] In the 1940s and 1950s, Rundle and Pimentel popularized the idea of the three-center four-electron bond, which is essentially the same concept which Sugden attempted to advance decades earlier; the three-center four-electron bond can be alternatively viewed as consisting of two collinear two-center one-electron bonds, with the remaining two nonbonding electrons localized to the ligands.[4]
The attempt to actually prepare hypervalent organic molecules began with Hermann Staudinger and Georg Wittig in the first half of the twentieth century, who sought to challenge the extant valence theory and successfully prepare nitrogen and phosphorus-centered hypervalent molecules.[5] The theoretical basis for hypervalency was not delineated until J.I. Musher's work in 1969.[1]
In 1990, Magnusson published a seminal work definitively excluding the role of d-orbital hybridization in bonding in hypervalent compounds of second-row elements. This had long been a point of contention and confusion in describing these molecules using molecular orbital theory. Part of the confusion here originates from the fact that one must include d-functions in the basis sets used to describe these compounds (or else unreasonably high energies and distorted geometries result), and the contribution of the d-function to the molecular wavefunction is large. These facts were historically interpreted to mean that d-orbitals must be involved in bonding. However, Magnusson concludes in his work that d-orbital involvement is not implicated in hypervalency.[6]
Both the term and concept of hypervalency still fall under criticism. In 1984, in response to this general controversy, Paul von Ragué Schleyer proposed the replacement of 'hypervalency' with use of the term hypercoordination because this term does not imply any mode of chemical bonding and the question could thus be avoided altogether.[4]
The concept itself has been criticized by Ronald Gillespie who, based on an analysis of electron localization functions, wrote in 2002 that "as there is no fundamental difference between the bonds in hypervalent and non-hypervalent (Lewis octet) molecules there is no reason to continue to use the term hypervalent."[7]
For hypercoordinated molecules with electronegative ligands such as PF5 it has been demonstrated that the ligands can pull away enough electron density from the central atom so that its net content is again 8 electrons or fewer. Consistent with this alternative view is the finding that hypercoordinated molecules based on fluorine ligands, for example PF5 do not have hydride counterparts e.g. phosphorane PH5 which is an unstable molecule.
Even an ionic model holds up well in thermochemical calculations. It predicts favorable exothermic formation of PF4+F− from phosphorus trifluoride PF3 and fluorine F2 whereas a similar reaction forming PH4+H− is not favorable.[8]
Early considerations of the structure of hypervalent molecules, returned familiar arrangements that were well explained by the VSEPR model for atomic bonding. Accordingly, AB5 and AB6 type molecules would possess a trigonal bi-pyramidal and octahedral geometry, respectively. However in order to account for the observed bond angles, bond lengths and apparent violation of the Lewis octet rule, several alternative models have been proposed.[9]
In the 1950s molecular orbital treatment of hypervalent bonding was adduced to explain the molecular architecture. According to MO theory, the central atom of penta- and hexacoordinated molecules would be sp3d and sp3d2 hybridized, which requires the promotion of central atom electrons to unoccupied d-orbitals. However, advances in the study of ab initio calculations have revealed that the contribution of d-orbitals to hypervalent bonding is too small to describe the bonding properties, and this hybrid orbital description is now regarded as much less important.[6] It was shown that in the case of hexacoordinated SF6, d-orbitals are not involved in S-F bond formation, but charge transfer between the sulfur and fluorine atoms and the apposite resonance structures were able to account for the hypervalency.
Additional modifications to the octet rule have been attempted to involve ionic characteristics in hypervalent bonding. As one of these modifications, in 1951, the concept of the 3-center-4-electron (3c-4e) bond, which described hypervalent bonding with a qualitative molecular orbital, was proposed.[10] The 3c-4e bond is described as three molecular orbitals given by the combination of a p orbital on the central atom and two ligand orbitals leading to an occupied non-bonding orbital (HOMO), and an unoccupied anti-bonding orbital (LUMO). This model in which the octet rule is preserved was also advocated by Musher.[4]
An example of this is the hexacoordinated SF6, which has been proposed to be composed of three 3c-4e bonds. In this model each bond is equivalent, linear and orthogonal with one lying along each of x, y and z axes. These interactions are F(p1)-S(3px2)-F(p1), F(p1)-S(3py2)-F(p1), and F(p1)-S(3pz2)-F(p1). Together these data account for both the octahedral symmetry of the molecule as well as observed molecular structure
A more complete description of hypervalent molecules arises from consideration of molecular orbital theory through quantum mechanical methods. A LCAO in, for example, sulfur hexafluoride, taking a basis set of the one sulfur 3s-orbital, the three sulfur 3p-orbitals, and six octahedral geometry symmetry-adapted linear combinations (SALCs) of fluorine orbitals, a total of ten molecular orbitals are obtained (four fully occupied bonding MOs of the lowest energy, two fully occupied intermediate energy non-bonding MOs and four vacant antibonding MOs with the highest energy) providing room for all 12 valence electrons. This is a stable configuration only for SX6 molecules containing electronegative ligand atoms like fluorine, which explains why SH6 doesn't form. In tbe bonding model, the two non-bonding MOs (1eg) involve the sulfur 3d orbitals and are further stabilized to lower-energy bonding MOs because of overlap with the two degenerate 3d orbitals of the proper symmetry (eg). However, the extent of d-orbital participation is thought to be minimal.
For hypervalent compounds in which the ligands are more electronegative than the central, hypervalent atom, resonance structures can be drawn with no more than four covalent electron pair bonds and completed with ionic bonds to obey the octet rule. For example, phosphorus pentafluoride’s three equatorial bonds can be formed from sp2-hybridized phosphorus orbitals. The axial bonds can then be described by two resonance forms each containing one ionic bond and one covalent bond, thus satisfying the octet rule and explaining both the observed molecular geometry and relative discrepancy between the axial and equatorial bond lengths. The axial bonds may be represented as two half-bonds (the ‘average’ of the symmetrical resonance forms) or a single 3c-4e bond. However, the magnitude of the discrepancy between the axial and equatorial bond lengths is substantially smaller than this structural model predicts.[11]
For a hexacoordinate molecule such as sulfur hexafluoride, each of the six bonds is the same length. The rationalization described above can be applied to generate resonance structures each with two covalent bonds and two 3c-4e bonds, such that the 3c-4e bond character is distributed across each of the sulfur-fluorine bonds.
Hexacoordinate phosphorus molecules involving nitrogen, oxygen, or sulfur ligands provide examples of Lewis acid-Lewis base hexacoordination.[12] For the two similar complexes shown below, the length of the C-P bond increases with decreasing length of the N-P bond; the strength of the C-P bond decreases with increasing strength of the N-P Lewis acid-Lewis base interaction.
This trend is also generally true of pentacoordinated main-group elements with one or more lone-pair-containing ligand, including the oxygen-pentacoordinated silicon examples shown below.
Interestingly, complexes such as these provide a model for the SN2 transition state; the Si-O bonds range from close to the expected van der Waals value in A (a weak bond, representing an early SN2 transition state) almost to the expected covalent single bond value in C (a strong bond, representing a late SN2 transition state).[12]
Corriu and coworkers performed early work characterizing reactions thought to proceed through a hypervalent transition state.[13] Measurements of the reaction rates of hydrolysis of tetravalent chlorosilanes incubated with catalytic amounts of water returned a rate that is first order in chlorosilane and second order in water. This indicated that two water molecules interacted with the silane during hydrolysis and from this a binucleophilic reaction mechanism was proposed. Corriu and coworkers then measured the rates of hydrolysis in the presence of nucleophilic catalyst HMPT, DMSO or DMF. It was shown that the rate of hydrolysis was again first order in chlorosilane, first order in catalyst and now first order in water. Appropriately, the rates of hydrolysis also exhibited a dependence on the magnitude of charge on the oxygen of the nucleophile.
Taken together this led the group to propose a reaction mechanism in which there is a pre-rate determining nucleophilic attack of the tetracoordinated silane by the nucleophile (or water) in which a hypervalent pentacoordinated silane is formed. This is followed by a nucleophilic attack of the intermediate by water in a rate determining step leading to hexacoordinated species that quickly decomposes giving the hydroxysilane.
Silane hydrolysis was further investigated by Holmes and coworkers [14] in which tetracoordinated Mes2SiF2 (Mes = mesityl) and pentacoordinated Mes2SiF3- were reacted with two equivalents of water. Following twenty-four hours, almost no hydrolysis of the tetracoordinated silane was observed, while the pentacoordinated silane was completely hydrolyzed after fifteen minutes. Additionally, X-ray diffraction data collected for the tetraethylammonium salts of the fluorosilanes showed the formation of hydrogen bisilonate lattice supporting a hexacoordinated intermediate from which HF2- is quickly displaced leading to the hydroxylated product. This reaction and crystallographic data support the mechanism proposed by Corriu et al..
The apparent increased reactivity of hypervalent molecules, contrasted with tetravalent analogues, has also been observed for Grignard reactions. The Corriu group measured [15] Grignard reaction half-times by NMR for related 18-crown-6 potassium salts of a variety of tetra- and pentacoordinated methylphenylfluorosilanes in the presence of catalytic amounts of nucleophile.
Though the half reaction method is imprecise, the magnitudinal differences in reactions rates allowed for a proposed reaction scheme wherein, a pre-rate determining attack of the tetravalent silane by the nucleophile results in an equilibrium between the neutral tetracoordinated species and the anionic pentavalent compound. This is followed by nucleophilic coordination by two Grignard reagents as normally seen, forming a hexacoordinated transition state and yielding the expected product.
Similar reactivity has also been observed for other hypervalent structures such as the miscellany of phosphorus compounds, for which hexacoordinated transition states have been proposed. Hydrolysis of phosphoranes and oxyphosphoranes have been studied [16] and shown to be second order in water. Bel'skii et al.. have proposed a prerate determining nucleophilic attack by water resulting in an equilibrium between the penta- and hexacoordinated phosphorus species, which is followed by a proton transfer involving the second water molecule in a rate determining ring-opening step, leading to the hydroxlyated product.
Alchoholysis of pentacoordinated phosphorus compounds, such as trimethoxyphospholene with benzyl alcohol, have also been postulated to occur through a similar octahedral transition state, as in hydrolysis, however without ring opening.[17]
It can be understood from these experiments that the increased reactivity observed for hypervalent molecules, contrasted with analogous nonhypervalent compounds, can be attributed to the congruence of these species to the hypercoordinated activated states normally formed during the course of the reaction.
The enhanced reactivity at pentacoordinated silicon is not fully understood. Corriu and coworkers suggested that greater electropositive character at the pentavalent silicon atom may be responsible for its increased reactivity.[18] Preliminary ab initio calculations supported this hypothesis to some degree, but used a small basis set.[19]
A software program for ab initio calculations, Gaussian 86, was used by Dieters and coworkers to compare tetracoordinated silicon and phosphorus to their pentacoordinate analogues. This ab initio approach is used as a supplement to determine why reactivity improves in nucleophilic reactions with pentacoordinated compounds. For silicon, the 6-31+G* basis set was used because of its pentacoordinated anionic character and for phosphorus, the 6-31G* basis set was used.[19]
Pentacoordinated compounds should theoretically be less electrophilic than tetracoordinated analogues due to steric hindrance and greater electron density from the ligands, yet experimentally show greater reactivity with nucleophiles than their tetracoordinated analogues. Advanced ab initio calculations were performed on series of tetracoordinated and pentacoordinated species to further understand this reactivity phenomenon. Each series varied by degree of fluorination. Bond lengths and charge densities are shown as functions of how many hydride ligands are on the central atoms. For every new hydride, there is one less fluoride.[19]
For silicon and phosphorus bond lengths, charge densities, and Mulliken bond overlap, populations were calculated for tetra and pentacoordinated species by this ab initio approach.[19] Addition of a fluoride ion to tetracoordinated silicon shows an overall average increase of 0.1 electron charge, which is considered insignificant. In general, bond lengths in trigonal bipyramidal pentacoordinate species are longer than those in tetracoordinate analogues. Si-F bonds and Si-H bonds both increase in length upon pentacoordination and related effects are seen in phosphorus species, but to a lesser degree. The reason for the greater magnitude in bond length change for silicon species over phosphorus species is the increased effective nuclear charge at phosphorus. Therefore, silicon is concluded to be more loosely bound to its ligands.
Effects of fluorine substitution on positive charge density
In addition Dieters and coworkers [19] show an inverse correlation between bond length and bond overlap for all series. Pentacoordinated species are concluded to be more reactive because of their looser bonds as trigonal-bipyramidal structures.
Calculated bond length and bond overlap with degree of fluorination
The energies for addition and removal of a fluoride ion in silicon and phosphorus species were calculated.
As the table shows, tetracoordinated species have much higher energy requirements for ligand removal than do pentacoordinated species. Overall, silicon species have lower energy requirements for ligand removal than do phosphorus species, which is an indication of weaker bonds in silicon.
In conclusion, it has been shown that charge density changes are insignificant in accounting for enhanced reactivity with nucleophiles in hypercoordinated silicon and phosphorus. On the other hand, enhanced reactivity is due to weaker bonds, particularly in the axial positions, of pentacoordinated species.[19]
The mechanistic implications of this are extended to a hexacoordinated silicon species, which is thought to be active as a transition state in reactions such as the allylation of aldehydes with allyltrifluorosilane. The reaction only precedes with fluoride activation to the pentacoordinated state and weakening of the bond between silicon and carbon in the hexacoordinate state drives this reaction.[20]