Implicit solvation (sometimes termed continuum solvation) is a method to represent
solvent
A solvent (s) (from the Latin '' solvō'', "loosen, untie, solve") is a substance that dissolves a solute, resulting in a solution. A solvent is usually a liquid but can also be a solid, a gas, or a supercritical fluid. Water is a solvent for ...
as a continuous medium instead of individual “explicit” solvent molecules, most often used in
molecular dynamics
Molecular dynamics (MD) is a computer simulation method for analyzing the physical movements of atoms and molecules. The atoms and molecules are allowed to interact for a fixed period of time, giving a view of the dynamic "evolution" of the ...
simulations and in other applications of
molecular mechanics
Molecular mechanics uses classical mechanics to model molecular systems. The Born–Oppenheimer approximation is assumed valid and the potential energy of all systems is calculated as a function of the nuclear coordinates using force fields. Mo ...
. The method is often applied to estimate
free energy of
solute
In chemistry, a solution is a special type of homogeneous mixture composed of two or more substances. In such a mixture, a solute is a substance dissolved in another substance, known as a solvent. If the attractive forces between the solvent ...
-
solvent
A solvent (s) (from the Latin '' solvō'', "loosen, untie, solve") is a substance that dissolves a solute, resulting in a solution. A solvent is usually a liquid but can also be a solid, a gas, or a supercritical fluid. Water is a solvent for ...
interactions in structural and chemical processes, such as folding or
conformational transitions of
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respo ...
s,
DNA,
RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
, and
polysaccharide
Polysaccharides (), or polycarbohydrates, are the most abundant carbohydrates found in food. They are long chain polymeric carbohydrates composed of monosaccharide units bound together by glycosidic linkages. This carbohydrate can react with wa ...
s, association of biological macromolecules with
ligand
In coordination chemistry, a ligand is an ion or molecule (functional group) that binds to a central metal atom to form a coordination complex. The bonding with the metal generally involves formal donation of one or more of the ligand's electr ...
s, or transport of
drug
A drug is any chemical substance that causes a change in an organism's physiology or psychology when consumed. Drugs are typically distinguished from food and substances that provide nutritional support. Consumption of drugs can be via insuffla ...
s across
biological membrane
A biological membrane, biomembrane or cell membrane is a selectively permeable membrane that separates the interior of a cell from the external environment or creates intracellular compartments by serving as a boundary between one part of the ce ...
s.
The implicit solvation model is justified in liquids, where the
potential of mean force When examining a system computationally one may be interested in knowing how the free energy changes as a function of some inter- or intramolecular coordinate (such as the distance between two atoms or a torsional angle). The free energy surface alo ...
can be applied to approximate the averaged behavior of many highly dynamic solvent molecules. However, the interfaces and the interiors of
biological membrane
A biological membrane, biomembrane or cell membrane is a selectively permeable membrane that separates the interior of a cell from the external environment or creates intracellular compartments by serving as a boundary between one part of the ce ...
s or
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respo ...
s can also be considered as media with specific
solvation
Solvation (or dissolution) describes the interaction of a solvent with dissolved molecules. Both ionized and uncharged molecules interact strongly with a solvent, and the strength and nature of this interaction influence many properties of the ...
or
dielectric
In electromagnetism, a dielectric (or dielectric medium) is an electrical insulator that can be polarised by an applied electric field. When a dielectric material is placed in an electric field, electric charges do not flow through the mate ...
properties. These media are not necessarily uniform, since their properties can be described by different analytical functions, such as “polarity profiles” of
lipid bilayer
The lipid bilayer (or phospholipid bilayer) is a thin polar membrane made of two layers of lipid molecules. These membranes are flat sheets that form a continuous barrier around all cells. The cell membranes of almost all organisms and many vir ...
s.
There are two basic types of implicit solvent methods: models based on
accessible surface area
The accessible surface area (ASA) or solvent-accessible surface area (SASA) is the surface area of a biomolecule that is accessible to a solvent. Measurement of ASA is usually described in units of square angstroms (a standard unit of measurement ...
s (ASA) that were historically the first, and more recent continuum electrostatics models, although various modifications and combinations of the different methods are possible.
The accessible surface area (ASA) method is based on experimental linear relations between
Gibbs free energy
In thermodynamics, the Gibbs free energy (or Gibbs energy; symbol G) is a thermodynamic potential that can be used to calculate the maximum amount of work that may be performed by a thermodynamically closed system at constant temperature and pr ...
of transfer and the
surface area
The surface area of a solid object is a measure of the total area that the surface of the object occupies. The mathematical definition of surface area in the presence of curved surfaces is considerably more involved than the definition of arc ...
of a
solute
In chemistry, a solution is a special type of homogeneous mixture composed of two or more substances. In such a mixture, a solute is a substance dissolved in another substance, known as a solvent. If the attractive forces between the solvent ...
molecule.
This method operates directly with free energy of
solvation
Solvation (or dissolution) describes the interaction of a solvent with dissolved molecules. Both ionized and uncharged molecules interact strongly with a solvent, and the strength and nature of this interaction influence many properties of the ...
, unlike
molecular mechanics
Molecular mechanics uses classical mechanics to model molecular systems. The Born–Oppenheimer approximation is assumed valid and the potential energy of all systems is calculated as a function of the nuclear coordinates using force fields. Mo ...
or
electrostatic
Electrostatics is a branch of physics that studies electric charges at rest (static electricity).
Since classical times, it has been known that some materials, such as amber, attract lightweight particles after rubbing. The Greek word for amber ...
methods that include only the
enthalpic
Enthalpy , a property of a thermodynamic system, is the sum of the system's internal energy and the product of its pressure and volume. It is a state function used in many measurements in chemical, biological, and physical systems at a constant p ...
component of free energy. The continuum representation of solvent also significantly improves the computational speed and reduces errors in statistical averaging that arise from incomplete sampling of solvent conformations,
so that the energy landscapes obtained with implicit and explicit solvent are different.
Although the implicit solvent model is useful for simulations of biomolecules, this is an approximate method with certain limitations and problems related to parameterization and treatment of
ionization
Ionization, or Ionisation is the process by which an atom or a molecule acquires a negative or positive charge by gaining or losing electrons, often in conjunction with other chemical changes. The resulting electrically charged atom or molecule i ...
effects.
Accessible surface area-based method
The free energy of solvation of a
solute
In chemistry, a solution is a special type of homogeneous mixture composed of two or more substances. In such a mixture, a solute is a substance dissolved in another substance, known as a solvent. If the attractive forces between the solvent ...
molecule in the simplest ASA-based method is given by:
:
where
is the
accessible surface area
The accessible surface area (ASA) or solvent-accessible surface area (SASA) is the surface area of a biomolecule that is accessible to a solvent. Measurement of ASA is usually described in units of square angstroms (a standard unit of measurement ...
of atom ''i'', and
is ''solvation parameter'' of atom ''i'', i.e., a contribution to the free energy of
solvation
Solvation (or dissolution) describes the interaction of a solvent with dissolved molecules. Both ionized and uncharged molecules interact strongly with a solvent, and the strength and nature of this interaction influence many properties of the ...
of the particular atom i per surface unit area. The needed solvation parameters for different types of atoms (
carbon
Carbon () is a chemical element with the symbol C and atomic number 6. It is nonmetallic and tetravalent
In chemistry, the valence (US spelling) or valency (British spelling) of an element is the measure of its combining capacity with o ...
(C),
nitrogen
Nitrogen is the chemical element with the symbol N and atomic number 7. Nitrogen is a nonmetal and the lightest member of group 15 of the periodic table, often called the pnictogens. It is a common element in the universe, estimated at se ...
(N),
oxygen
Oxygen is the chemical element with the symbol O and atomic number 8. It is a member of the chalcogen group in the periodic table, a highly reactive nonmetal, and an oxidizing agent that readily forms oxides with most elements as wel ...
(O),
sulfur
Sulfur (or sulphur in British English) is a chemical element with the symbol S and atomic number 16. It is abundant, multivalent and nonmetallic. Under normal conditions, sulfur atoms form cyclic octatomic molecules with a chemical formula ...
(S), etc.) are usually determined by a
least squares
The method of least squares is a standard approach in regression analysis to approximate the solution of overdetermined systems (sets of equations in which there are more equations than unknowns) by minimizing the sum of the squares of the res ...
fit of the calculated and experimental transfer free energies for a series of
organic compound
In chemistry, organic compounds are generally any chemical compounds that contain carbon-hydrogen or carbon-carbon bonds. Due to carbon's ability to catenate (form chains with other carbon atoms), millions of organic compounds are known. The ...
s. The experimental energies are determined from
partition coefficient
In the physical sciences, a partition coefficient (''P'') or distribution coefficient (''D'') is the ratio of concentrations of a compound in a mixture of two immiscible solvents at equilibrium. This ratio is therefore a comparison of the solub ...
s of these compounds between different solutions or media using standard mole concentrations of the solutes.
Notably, ''solvation energy'' is the free energy needed to transfer a solute molecule from a solvent to ''vacuum'' (gas phase). This energy can supplement the intramolecular energy in vacuum calculated in
molecular mechanics
Molecular mechanics uses classical mechanics to model molecular systems. The Born–Oppenheimer approximation is assumed valid and the potential energy of all systems is calculated as a function of the nuclear coordinates using force fields. Mo ...
. Thus, the needed atomic solvation parameters were initially derived from water-gas partition data.
However, the dielectric properties of proteins and
lipid bilayer
The lipid bilayer (or phospholipid bilayer) is a thin polar membrane made of two layers of lipid molecules. These membranes are flat sheets that form a continuous barrier around all cells. The cell membranes of almost all organisms and many vir ...
s are much more similar to those of nonpolar solvents than to vacuum. Newer parameters have thus been derived from
octanol-water partition coefficient
The ''n''-octanol-water partition coefficient, ''K''ow is a partition coefficient for the two-phase system consisting of ''n''-octanol and water. ''K''ow is also frequently referred to by the symbol P, especially in the English literature. It is a ...
s
or other similar data. Such parameters actually describe ''transfer'' energy between two condensed media or the ''difference'' of two solvation energies.
Poisson-Boltzmann
The
Poisson-Boltzmann equation (PB) describes the electrostatic environment of a solute in a solvent containing
ion
An ion () is an atom or molecule with a net electrical charge.
The charge of an electron is considered to be negative by convention and this charge is equal and opposite to the charge of a proton, which is considered to be positive by conve ...
s. It can be written in
cgs units as:
:
or (in
mks):
:
where
represents the position-dependent dielectric,
represents the electrostatic potential,
represents the charge density of the solute,
represents the concentration of the ion ''i'' at a distance of infinity from the solute,
is the valence of the ion, ''q'' is the charge of a proton, ''k'' is the
Boltzmann constant
The Boltzmann constant ( or ) is the proportionality factor that relates the average relative kinetic energy of particles in a gas with the thermodynamic temperature of the gas. It occurs in the definitions of the kelvin and the gas constant, ...
, ''T'' is the
temperature
Temperature is a physical quantity that expresses quantitatively the perceptions of hotness and coldness. Temperature is measured with a thermometer.
Thermometers are calibrated in various temperature scales that historically have relied o ...
, and
is a factor for the position-dependent accessibility of position ''r'' to the ions in solution (often set to uniformly 1). If the potential is not large, the equation can be
linearized to be solved more efficiently.
Although this equation has solid theoretical justification, it is computationally expensive to calculate without approximations. A number of numerical Poisson-Boltzmann equation solvers of varying generality and efficiency have been developed,
including one application with a specialized computer hardware platform.
However, performance from PB solvers does not yet equal that from the more commonly used generalized Born approximation.
Generalized Born model
The ''Generalized Born'' (GB) model is an approximation to the exact (linearized) Poisson-Boltzmann equation. It is based on modeling the solute as a set of spheres whose internal dielectric constant differs from the external solvent. The model has the following functional form:
:
where
:
and
where
is the
permittivity of free space
Vacuum permittivity, commonly denoted (pronounced "epsilon nought" or "epsilon zero"), is the value of the absolute dielectric permittivity of classical vacuum. It may also be referred to as the permittivity of free space, the electric consta ...
,
is the
dielectric constant
The relative permittivity (in older texts, dielectric constant) is the permittivity of a material expressed as a ratio with the electric permittivity of a vacuum. A dielectric is an insulating material, and the dielectric constant of an insulat ...
of the solvent being modeled,
is the
electrostatic charge
Electric charge is the physical property of matter that causes charged matter to experience a force when placed in an electromagnetic field. Electric charge can be ''positive'' or ''negative'' (commonly carried by protons and electrons respectiv ...
on particle ''i'',
is the distance between particles ''i'' and ''j'', and
is a quantity (with the dimension of length) termed the ''effective Born radius''.
The effective Born radius of an atom characterizes its degree of burial inside the solute; qualitatively it can be thought of as the distance from the atom to the molecular surface. Accurate estimation of the effective Born radii is critical for the GB model.
With accessible surface area
The Generalized Born (GB) model augmented with the hydrophobic solvent accessible surface area (SA) term is GBSA. It is among the most commonly used implicit solvent model combinations. The use of this model in the context of
molecular mechanics
Molecular mechanics uses classical mechanics to model molecular systems. The Born–Oppenheimer approximation is assumed valid and the potential energy of all systems is calculated as a function of the nuclear coordinates using force fields. Mo ...
is termed MM/GBSA. Although this formulation has been shown to successfully identify the
native state
In biochemistry, the native state of a protein or nucleic acid is its properly folded and/or assembled form, which is operative and functional. The native state of a biomolecule may possess all four levels of biomolecular structure, with the s ...
s of short peptides with well-defined
tertiary structure
Protein tertiary structure is the three dimensional shape of a protein. The tertiary structure will have a single polypeptide chain "backbone" with one or more protein secondary structures, the protein domains. Amino acid side chains may int ...
,
the conformational ensembles produced by GBSA models in other studies differ significantly from those produced by explicit solvent and do not identify the protein's native state.
In particular,
salt bridge
In electrochemistry, a salt bridge or ion bridge is a laboratory device used to connect the oxidation and reduction half-cells of a galvanic cell (voltaic cell), a type of electrochemical cell. It maintains electrical neutrality within the in ...
s are overstabilized, possibly due to insufficient electrostatic screening, and a higher-than-native
alpha helix
The alpha helix (α-helix) is a common motif in the secondary structure of proteins and is a right hand-helix conformation in which every backbone N−H group hydrogen bonds to the backbone C=O group of the amino acid located four residues e ...
population was observed. Variants of the GB model have also been developed to approximate the electrostatic environment of membranes, which have had some success in folding the
transmembrane helix
A transmembrane domain (TMD) is a membrane-spanning protein domain. TMDs generally adopt an alpha helix topological conformation, although some TMDs such as those in porins can adopt a different conformation. Because the interior of the lipid bi ...
es of
integral membrane protein
An integral, or intrinsic, membrane protein (IMP) is a type of membrane protein that is permanently attached to the biological membrane. All ''transmembrane proteins'' are IMPs, but not all IMPs are transmembrane proteins. IMPs comprise a signi ...
s.
Ad hoc fast solvation models
Another possibility is to use ad hoc quick strategies to estimate solvation free energy. A first generation of fast implicit solvents is based on the calculation of a per-atom solvent accessible surface area. For each of group of atom types, a different parameter scales its contribution to solvation ("ASA-based model" described above).
Another strategy is implemented for the
CHARMM
Chemistry at Harvard Macromolecular Mechanics (CHARMM) is the name of a widely used set of force fields for molecular dynamics, and the name for the molecular dynamics simulation and analysis computer software package associated with them. The CHA ...
19 force-field and is called EEF1.
EEF1 is based on a Gaussian-shaped solvent exclusion. The solvation free energy is
:
The reference solvation free energy of ''i'' corresponds to a suitably chosen small molecule in which group i is essentially fully solvent-exposed. The integral is over the volume ''V
j'' of group ''j'' and the summation is over all groups ''j'' around ''i''. EEF1 additionally uses a distance-dependent (non-constant) dielectric, and ionic side-chains of proteins are simply neutralized. It is only 50% slower than a vacuum simulation. This model was later augmented with the hydrophobic effect and called Charmm19/SASA.
Hybrid implicit-explicit solvation models
It is possible to include a layer or sphere of water molecules around the solute, and model the bulk with an implicit solvent. Such an approach is proposed by M. J. Frisch and coworkers
and by other authors.
For instance in Ref.
the bulk solvent is modeled with a Generalized Born approach and the multi-grid method used for Coulombic pairwise particle interactions. It is reported to be faster than a full explicit solvent simulation with the particle mesh
Ewald summation Ewald summation, named after Paul Peter Ewald, is a method for computing long-range interactions (e.g. electrostatic interactions) in periodic systems. It was first developed as the method for calculating electrostatic energies of ionic crystals, an ...
(PME) method of electrostatic calculation. There are a range of hybrid methods available capable of accessing and acquiring information on solvation.
Effects unaccounted for
The hydrophobic effect
Models like PB and GB allow estimation of the mean electrostatic free energy but do not account for the (mostly)
entropic
Entropy is a scientific concept, as well as a measurable physical property, that is most commonly associated with a state of disorder, randomness, or uncertainty. The term and the concept are used in diverse fields, from classical thermodynam ...
effects arising from solute-imposed constraints on the organization of the water or solvent molecules. This is termed the
hydrophobic effect
The hydrophobic effect is the observed tendency of nonpolar substances to aggregate in an aqueous solution and exclude water molecules. The word hydrophobic literally means "water-fearing", and it describes the segregation of water and nonpolar ...
and is a major factor in the
folding
Fold, folding or foldable may refer to:
Arts, entertainment, and media
* ''Fold'' (album), the debut release by Australian rock band Epicure
* Fold (poker), in the game of poker, to discard one's hand and forfeit interest in the current pot
*Abov ...
process of
globular protein
In biochemistry, globular proteins or spheroproteins are spherical ("globe-like") proteins and are one of the common protein types (the others being fibrous, disordered and membrane proteins). Globular proteins are somewhat water-soluble (formi ...
s with
hydrophobic core
The hydrophobic effect is the observed tendency of nonpolar substances to aggregate in an aqueous solution and exclude water molecules. The word hydrophobic literally means "water-fearing", and it describes the segregation of water and nonpolar ...
s. Implicit solvation models may be augmented with a term that accounts for the hydrophobic effect. The most popular way to do this is by taking the solvent accessible surface area (SASA) as a
proxy
Proxy may refer to:
* Proxy or agent (law), a substitute authorized to act for another entity or a document which authorizes the agent so to act
* Proxy (climate), a measured variable used to infer the value of a variable of interest in climate ...
of the extent of the hydrophobic effect. Most authors place the extent of this effect between 5 and 45 cal/(Å
2 mol). Note that this surface area pertains to the solute, while the hydrophobic effect is mostly entropic in nature at physiological temperatures and occurs on the side of the solvent.
Viscosity
Implicit solvent models such as PB, GB, and SASA lack the viscosity that water molecules impart by randomly colliding and impeding the motion of solutes through their van der Waals repulsion. In many cases, this is desirable because it makes sampling of configurations and
phase space
In dynamical system theory, a phase space is a space in which all possible states of a system are represented, with each possible state corresponding to one unique point in the phase space. For mechanical systems, the phase space usually ...
much faster. This acceleration means that more configurations are visited per simulated time unit, on top of whatever CPU acceleration is achieved in comparison to explicit solvent. It can, however, lead to misleading results when kinetics are of interest.
Viscosity may be added back by using
Langevin dynamics
In physics, Langevin dynamics is an approach to the mathematical modeling of the dynamics of molecular systems. It was originally developed by French physicist Paul Langevin. The approach is characterized by the use of simplified models while acco ...
instead of
Hamiltonian mechanics
Hamiltonian mechanics emerged in 1833 as a reformulation of Lagrangian mechanics. Introduced by Sir William Rowan Hamilton, Hamiltonian mechanics replaces (generalized) velocities \dot q^i used in Lagrangian mechanics with (generalized) ''momenta ...
and choosing an appropriate damping constant for the particular solvent. In practical bimolecular simulations one can often speed-up conformational search significantly (up to 100 times in some cases) by using much lower collision frequency
. Recent work has also been done developing thermostats based on fluctuating hydrodynamics to account for momentum transfer through the solvent and related thermal fluctuations.
One should keep in mind, though, that the folding rate of proteins does not depend linearly on viscosity for all regimes.
Hydrogen bonds with solvent
Solute-solvent
hydrogen bond
In chemistry, a hydrogen bond (or H-bond) is a primarily electrostatic force of attraction between a hydrogen (H) atom which is covalently bound to a more electronegative "donor" atom or group (Dn), and another electronegative atom bearing a ...
s in the first
solvation shell
A solvation shell or solvation sheath is the solvent interface of any chemical compound or biomolecule that constitutes the solute. When the solvent is water it is called a hydration shell or hydration sphere. The number of solvent molecules sur ...
are important for solubility of organic molecules and especially
ion
An ion () is an atom or molecule with a net electrical charge.
The charge of an electron is considered to be negative by convention and this charge is equal and opposite to the charge of a proton, which is considered to be positive by conve ...
s. Their average energetic contribution can be reproduced with an implicit solvent model.
Problems and limitations
All implicit solvation models rest on the simple idea that nonpolar atoms of a
solute
In chemistry, a solution is a special type of homogeneous mixture composed of two or more substances. In such a mixture, a solute is a substance dissolved in another substance, known as a solvent. If the attractive forces between the solvent ...
tend to cluster together or occupy nonpolar media, whereas polar and charged groups of the solute tend to remain in water. However, it is important to properly balance the opposite energy contributions from different types of atoms. Several important points have been discussed and investigated over the years.
Choice of model solvent
It has been noted that wet
1-octanol solution is a poor approximation of proteins or biological membranes because it contains ~2M of water, and that
cyclohexane
Cyclohexane is a cycloalkane with the molecular formula . Cyclohexane is non-polar. Cyclohexane is a colorless, flammable liquid with a distinctive detergent-like odor, reminiscent of cleaning products (in which it is sometimes used). Cyclohexan ...
would be a much better approximation.
Investigation of passive permeability barriers for different compounds across lipid bilayers led to conclusion that 1,9-decadiene can serve as a good approximations of the bilayer interior,
whereas
1-octanol was a very poor approximation.
A set of solvation parameters derived for protein interior from
protein engineering
Protein engineering is the process of developing useful or valuable proteins. It is a young discipline, with much research taking place into the understanding of protein folding and recognition for protein design principles. It has been used to imp ...
data was also different from octanol scale: it was close to
cyclohexane
Cyclohexane is a cycloalkane with the molecular formula . Cyclohexane is non-polar. Cyclohexane is a colorless, flammable liquid with a distinctive detergent-like odor, reminiscent of cleaning products (in which it is sometimes used). Cyclohexan ...
scale for nonpolar atoms but intermediate between cyclohexane and octanol scales for polar atoms.
Thus, different atomic solvation parameters should be applied for modeling of protein folding and protein-membrane binding. This issue remains controversial. The original idea of the method was to derive all solvation parameters directly from experimental
partition coefficient
In the physical sciences, a partition coefficient (''P'') or distribution coefficient (''D'') is the ratio of concentrations of a compound in a mixture of two immiscible solvents at equilibrium. This ratio is therefore a comparison of the solub ...
s of organic molecules, which allows calculation of solvation free energy. However, some of the recently developed electrostatic models use ''ad hoc'' values of 20 or 40 cal/(Å
2 mol) for ''all'' types of atoms. The non-existent “hydrophobic” interactions of polar atoms are overridden by large electrostatic energy penalties in such models.
Solid-state applications
Strictly speaking, ASA-based models should only be applied to describe ''solvation'', i.e., energetics of transfer between
liquid
A liquid is a nearly incompressible fluid that conforms to the shape of its container but retains a (nearly) constant volume independent of pressure. As such, it is one of the four fundamental states of matter (the others being solid, gas, a ...
or uniform media. It is possible to express van der Waals interaction energies in the
solid
Solid is one of the State of matter#Four fundamental states, four fundamental states of matter (the others being liquid, gas, and Plasma (physics), plasma). The molecules in a solid are closely packed together and contain the least amount o ...
state in the surface energy units. This was sometimes done for interpreting
protein engineering
Protein engineering is the process of developing useful or valuable proteins. It is a young discipline, with much research taking place into the understanding of protein folding and recognition for protein design principles. It has been used to imp ...
and
ligand binding
In biochemistry and pharmacology, a ligand is a substance that forms a complex with a biomolecule to serve a biological purpose. The etymology stems from ''ligare'', which means 'to bind'. In protein-ligand binding, the ligand is usually a mo ...
energetics,
which leads to “solvation” parameter for
aliphatic
In organic chemistry, hydrocarbons ( compounds composed solely of carbon and hydrogen) are divided into two classes: aromatic compounds and aliphatic compounds (; G. ''aleiphar'', fat, oil). Aliphatic compounds can be saturated, like hexane, ...
carbon of ~40 cal/(Å
2 mol),
which is 2 times bigger than ~20 cal/(Å
2 mol) obtained for transfer from water to liquid hydrocarbons, because the parameters derived by such fitting represent sum of the hydrophobic energy (i.e., 20 cal/Å
2 mol) and energy of van der Waals attractions of aliphatic groups in the solid state, which corresponds to
fusion enthalpy of
alkane
In organic chemistry, an alkane, or paraffin (a historical trivial name that also has other meanings), is an acyclic saturated hydrocarbon. In other words, an alkane consists of hydrogen and carbon atoms arranged in a tree structure in which ...
s.
Unfortunately, the simplified ASA-based model cannot capture the "specific" distance-dependent interactions between different types of atoms in the solid state which are responsible for clustering of atoms with similar polarities in protein structures and molecular crystals. Parameters of such interatomic interactions, together with atomic solvation parameters for the protein interior, have been approximately derived from
protein engineering
Protein engineering is the process of developing useful or valuable proteins. It is a young discipline, with much research taking place into the understanding of protein folding and recognition for protein design principles. It has been used to imp ...
data.
The implicit solvation model breaks down when solvent molecules associate strongly with binding cavities in a protein, so that the protein and the solvent molecules form a continuous solid body.
On the other hand, this model can be successfully applied for describing transfer from water to the ''
fluid
In physics, a fluid is a liquid, gas, or other material that continuously deforms (''flows'') under an applied shear stress, or external force. They have zero shear modulus, or, in simpler terms, are substances which cannot resist any shear ...
'' lipid bilayer.
Importance of extensive testing
More testing is needed to evaluate the performance of different implicit solvation models and parameter sets. They are often tested only for a small set of molecules with very simple structure, such as hydrophobic and amphiphilic
alpha helix
The alpha helix (α-helix) is a common motif in the secondary structure of proteins and is a right hand-helix conformation in which every backbone N−H group hydrogen bonds to the backbone C=O group of the amino acid located four residues e ...
es (α). This method was rarely tested for hundreds of protein structures.
Treatment of ionization effects
Ionization of charged groups has been neglected in continuum
electrostatic
Electrostatics is a branch of physics that studies electric charges at rest (static electricity).
Since classical times, it has been known that some materials, such as amber, attract lightweight particles after rubbing. The Greek word for amber ...
models of implicit solvation, as well as in standard
molecular mechanics
Molecular mechanics uses classical mechanics to model molecular systems. The Born–Oppenheimer approximation is assumed valid and the potential energy of all systems is calculated as a function of the nuclear coordinates using force fields. Mo ...
and
molecular dynamics
Molecular dynamics (MD) is a computer simulation method for analyzing the physical movements of atoms and molecules. The atoms and molecules are allowed to interact for a fixed period of time, giving a view of the dynamic "evolution" of the ...
. The transfer of an ion from water to a nonpolar medium with
dielectric constant
The relative permittivity (in older texts, dielectric constant) is the permittivity of a material expressed as a ratio with the electric permittivity of a vacuum. A dielectric is an insulating material, and the dielectric constant of an insulat ...
of ~3 (lipid bilayer) or 4 to 10 (interior of proteins) costs significant energy, as follows from the
Born equation and from experiments. However, since the charged protein residues are ionizable, they simply lose their charges in the nonpolar environment, which costs relatively little at the neutral
pH: ~4 to 7 kcal/mol for Asp, Glu, Lys, and Arg
amino acid
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha am ...
residues, according to the
Henderson-Hasselbalch equation, ''ΔG = 2.3RT (pH - pK)''. The low energetic costs of such ionization effects have indeed been observed for protein mutants with buried ionizable residues.
and hydrophobic α-helical peptides in membranes with a single ionizable residue in the middle.
However, all electrostatic methods, such as PB, GB, or GBSA assume that ionizable groups remain charged in the nonpolar environments, which leads to grossly overestimated electrostatic energy. In the simplest
accessible surface area
The accessible surface area (ASA) or solvent-accessible surface area (SASA) is the surface area of a biomolecule that is accessible to a solvent. Measurement of ASA is usually described in units of square angstroms (a standard unit of measurement ...
-based models, this problem was treated using different solvation parameters for charged atoms or Henderson-Hasselbalch equation with some modifications.
However even the latter approach does not solve the problem. Charged residues can remain charged even in the nonpolar environment if they are involved in intramolecular ion pairs and H-bonds. Thus, the energetic penalties can be overestimated even using the Henderson-Hasselbalch equation. More rigorous theoretical methods describing such ionization effects have been developed,
and there are ongoing efforts to incorporate such methods into the implicit solvation models.
See also
References
{{DEFAULTSORT:Implicit solvation
Molecular modelling
Computational chemistry
Molecular dynamics
Protein structure