In the
physical sciences
Physical science is a branch of natural science that studies non-living systems, in contrast to life science. It in turn has many branches, each referred to as a "physical science", together called the "physical sciences".
Definition
Phy ...
, a partition coefficient (''P'') or distribution coefficient (''D'') is the ratio of
concentration
In chemistry, concentration is the abundance of a constituent divided by the total volume of a mixture. Several types of mathematical description can be distinguished: '' mass concentration'', ''molar concentration'', ''number concentration'', an ...
s of a
compound in a mixture of two
immiscible
Miscibility () is the property of two chemical substance, substances to mix in all mixing ratio, proportions (that is, to fully dissolution (chemistry), dissolve in each other at any concentration), forming a homogeneity and heterogeneity, homoge ...
solvents at
equilibrium. This ratio is therefore a comparison of the solubilities of the solute in these two liquids. The partition coefficient generally refers to the concentration ratio of
un-ionized species of compound, whereas the distribution coefficient refers to the concentration ratio of all species of the compound (ionized plus un-ionized).
In the
chemical and
pharmaceutical sciences
Pharmacy is the science and practice of discovering, producing, preparing, dispensing, reviewing and monitoring medications, aiming to ensure the safe, effective, and affordable use of medicines. It is a miscellaneous science as it links healt ...
, both phases usually are
solvent
A solvent (s) (from the Latin '' solvō'', "loosen, untie, solve") is a substance that dissolves a solute, resulting in a solution. A solvent is usually a liquid but can also be a solid, a gas, or a supercritical fluid. Water is a solvent for ...
s.
Most commonly, one of the solvents is water, while the second is
hydrophobic
In chemistry, hydrophobicity is the physical property of a molecule that is seemingly repelled from a mass of water (known as a hydrophobe). In contrast, hydrophiles are attracted to water.
Hydrophobic molecules tend to be nonpolar and, th ...
, such as
1-octanol
1-Octanol, also known as octan-1-ol, is the organic compound with the molecular formula CH3(CH2)7OH. It is a fatty alcohol. Many other isomers are also known generically as octanols. 1-Octanol is manufactured for the synthesis of esters for use ...
.
Hence the partition coefficient measures how
hydrophilic
A hydrophile is a molecule or other molecular entity that is attracted to water molecules and tends to be dissolved by water.Liddell, H.G. & Scott, R. (1940). ''A Greek-English Lexicon'' Oxford: Clarendon Press.
In contrast, hydrophobes are no ...
("water-loving") or
hydrophobic
In chemistry, hydrophobicity is the physical property of a molecule that is seemingly repelled from a mass of water (known as a hydrophobe). In contrast, hydrophiles are attracted to water.
Hydrophobic molecules tend to be nonpolar and, th ...
("water-fearing") a chemical substance is. Partition coefficients are useful in estimating the
distribution Distribution may refer to:
Mathematics
*Distribution (mathematics), generalized functions used to formulate solutions of partial differential equations
* Probability distribution, the probability of a particular value or value range of a vari ...
of drugs within the body. Hydrophobic drugs with high
octanol-water partition coefficient
The ''n''-octanol-water partition coefficient, ''K''ow is a partition coefficient for the two-phase system consisting of ''n''-octanol and water. ''K''ow is also frequently referred to by the symbol P, especially in the English literature. It is a ...
s are mainly distributed to hydrophobic areas such as
lipid bilayers
The lipid bilayer (or phospholipid bilayer) is a thin polar membrane made of two layers of lipid molecules. These membranes are flat sheets that form a continuous barrier around all cell (biology), cells. The cell membranes of almost all organis ...
of cells. Conversely, hydrophilic drugs (low octanol/water partition coefficients) are found primarily in aqueous regions such as
blood serum
Serum () is the fluid and solute component of blood which does not play a role in clotting. It may be defined as blood plasma without the clotting factors, or as blood with all cells and clotting factors removed. Serum includes all proteins not u ...
.
If one of the solvents is a gas and the other a liquid, a gas/liquid partition coefficient can be determined. For example, the
blood/gas partition coefficient of a
general anesthetic
General anaesthetics (or anesthetics, see spelling differences) are often defined as compounds that induce a loss of consciousness in humans or loss of righting reflex in animals. Clinical definitions are also extended to include an induced coma ...
measures how easily the anesthetic passes from gas to blood. Partition coefficients can also be defined when one of the phases is
solid
Solid is one of the State of matter#Four fundamental states, four fundamental states of matter (the others being liquid, gas, and Plasma (physics), plasma). The molecules in a solid are closely packed together and contain the least amount o ...
, for instance, when one phase is a molten
metal
A metal (from Greek μέταλλον ''métallon'', "mine, quarry, metal") is a material that, when freshly prepared, polished, or fractured, shows a lustrous appearance, and conducts electricity and heat relatively well. Metals are typicall ...
and the second is a solid metal,
or when both phases are solids. The partitioning of a substance into a solid results in a
solid solution
A solid solution, a term popularly used for metals, is a homogenous mixture of two different kinds of atoms in solid state and have a single crystal structure. Many examples can be found in metallurgy, geology, and solid-state chemistry. The word ...
.
Partition coefficients can be measured experimentally in various ways (by shake-flask,
HPLC
High-performance liquid chromatography (HPLC), formerly referred to as high-pressure liquid chromatography, is a technique in analytical chemistry used to separate, identify, and quantify each component in a mixture. It relies on pumps to pa ...
, etc.) or estimated by calculation based on a variety of methods (fragment-based, atom-based, etc.).
If a substance is present as several
chemical species
A chemical species is a chemical substance or ensemble composed of chemically identical molecular entity, molecular entities that can explore the same set of molecular energy levels on a characteristic or delineated time scale. These energy levels ...
in the partition system due to
association
Association may refer to:
*Club (organization), an association of two or more people united by a common interest or goal
*Trade association, an organization founded and funded by businesses that operate in a specific industry
*Voluntary associatio ...
or
dissociation
Dissociation, in the wide sense of the word, is an act of disuniting or separating a complex object into parts. Dissociation may also refer to:
* Dissociation (chemistry), general process in which molecules or ionic compounds (complexes, or salts) ...
, each species is assigned its own ''K''
ow value. A related value, D, does not distinguish between different species, only indicating the concentration ratio of the substance between the two phases.
Nomenclature
Despite formal recommendation to the contrary, the term ''partition coefficient'' remains the predominantly used term in the scientific literature.
In contrast, the
IUPAC
The International Union of Pure and Applied Chemistry (IUPAC ) is an international federation of National Adhering Organizations working for the advancement of the chemical sciences, especially by developing nomenclature and terminology. It is ...
recommends that the title term no longer be used, rather, that it be replaced with more specific terms.
For example, ''partition constant'', defined as
where ''K''
D is the process
equilibrium constant
The equilibrium constant of a chemical reaction is the value of its reaction quotient at chemical equilibrium, a state approached by a dynamic chemical system after sufficient time has elapsed at which its composition has no measurable tendency ...
,
represents the concentration of solute A being tested, and "org" and "aq" refer to the organic and aqueous phases respectively. The IUPAC further recommends "partition ratio" for cases where
transfer activity coefficients can be determined, and "distribution ratio" for the ratio of total analytical concentrations of a solute between phases, regardless of chemical form.
Partition coefficient and log ''P''
The partition coefficient, abbreviated ''P'', is defined as a particular ratio of the
concentration
In chemistry, concentration is the abundance of a constituent divided by the total volume of a mixture. Several types of mathematical description can be distinguished: '' mass concentration'', ''molar concentration'', ''number concentration'', an ...
s of a
solute
In chemistry, a solution is a special type of homogeneous mixture composed of two or more substances. In such a mixture, a solute is a substance dissolved in another substance, known as a solvent. If the attractive forces between the solvent ...
between the two solvents (a biphase of liquid phases), specifically for un-
ionized
Ionization, or Ionisation is the process by which an atom or a molecule acquires a negative or positive charge by gaining or losing electrons, often in conjunction with other chemical changes. The resulting electrically charged atom or molecule ...
solutes, and the
logarithm
In mathematics, the logarithm is the inverse function to exponentiation. That means the logarithm of a number to the base is the exponent to which must be raised, to produce . For example, since , the ''logarithm base'' 10 o ...
of the ratio is thus log ''P''.
When one of the solvents is water and the other is a
non-polar solvent
A solvent (s) (from the Latin '' solvō'', "loosen, untie, solve") is a substance that dissolves a solute, resulting in a solution. A solvent is usually a liquid but can also be a solid, a gas, or a supercritical fluid. Water is a solvent for p ...
, then the log ''P'' value is a measure of
lipophilicity
Lipophilicity (from Greek λίπος "fat" and φίλος "friendly"), refers to the ability of a chemical compound to dissolve in fats, oils, lipids, and non-polar solvents such as hexane or toluene. Such non-polar solvents are themselves lipop ...
or
hydrophobicity
In chemistry, hydrophobicity is the physical property of a molecule that is seemingly repelled from a mass of water (known as a hydrophobe). In contrast, hydrophiles are attracted to water.
Hydrophobic molecules tend to be nonpolar and, th ...
.
[ The defined precedent is for the lipophilic and ]hydrophilic
A hydrophile is a molecule or other molecular entity that is attracted to water molecules and tends to be dissolved by water.Liddell, H.G. & Scott, R. (1940). ''A Greek-English Lexicon'' Oxford: Clarendon Press.
In contrast, hydrophobes are no ...
phase types to always be in the numerator
A fraction (from la, fractus, "broken") represents a part of a whole or, more generally, any number of equal parts. When spoken in everyday English, a fraction describes how many parts of a certain size there are, for example, one-half, eight ...
and denominator respectively; for example, in a biphasic system of ''n''-octanol
Octanols are alcohols with the formula C8H17OH. A simple and important member is 1-octanol, with an unbranched chain of carbons. Other commercially important octanols are 2-octanol and 2-ethylhexanol.
There are 89 possible isomers
In ch ...
(hereafter simply "octanol") and water:
:
To a first approximation, the non-polar phase in such experiments is usually dominated by the un-ionized form of the solute, which is electrically neutral, though this may not be true for the aqueous phase. To measure the ''partition coefficient of ionizable solutes'', the pH of the aqueous phase is adjusted such that the predominant form of the compound in solution is the un-ionized, or its measurement at another pH of interest requires consideration of all species, un-ionized and ionized (see following).
A corresponding partition coefficient for ionizable compounds, abbreviated log ''P'' ''I'', is derived for cases where there are dominant ionized
Ionization, or Ionisation is the process by which an atom or a molecule acquires a negative or positive charge by gaining or losing electrons, often in conjunction with other chemical changes. The resulting electrically charged atom or molecule ...
forms of the molecule, such that one must consider partition of all forms, ionized and un-ionized, between the two phases (as well as the interaction of the two equilibria, partition and ionization). ''M'' is used to indicate the number of ionized forms; for the -th form () the logarithm of the corresponding partition coefficient, , is defined in the same manner as for the un-ionized form. For instance, for an octanol–water partition, it is
:
To distinguish between this and the standard, un-ionized, partition coefficient, the un-ionized is often assigned the symbol log ''P''0, such that the indexed expression for ionized solutes becomes simply an extension of this, into the range of values .
Distribution coefficient and log ''D''
The distribution coefficient, log ''D'', is the ratio of the sum of the concentrations of all forms of the compound (ionized plus un-ionized) in each of the two phases, one essentially always aqueous; as such, it depends on the pH of the aqueous phase, and log ''D'' = log ''P'' for non-ionizable compounds at any pH. For measurements of distribution coefficients, the pH of the aqueous phase is buffered to a specific value such that the pH is not significantly perturbed by the introduction of the compound. The value of each log ''D'' is then determined as the logarithm of a ratio—of the sum of the experimentally measured concentrations of the solute's various forms in one solvent, to the sum of such concentrations of its forms in the other solvent; it can be expressed as
:
In the above formula, the superscripts "ionized" each indicate the sum of concentrations of all ionized species in their respective phases. In addition, since log ''D'' is pH-dependent, the pH at which the log ''D'' was measured must be specified. In areas such as drug discovery—areas involving partition phenomena in biological systems such as the human body—the log ''D'' at the physiologic pH = 7.4 is of particular interest.
It is often convenient to express the log ''D'' in terms of ''P''I, defined above (which includes ''P''0 as state ), thus covering both un-ionized and ionized species. For example, in octanol–water:
:
which sums the individual partition coefficients (not their logarithms), and where indicates the pH-dependent mole fraction
In chemistry, the mole fraction or molar fraction (''xi'' or ) is defined as unit of the amount of a constituent (expressed in moles), ''ni'', divided by the total amount of all constituents in a mixture (also expressed in moles), ''n''tot. This ex ...
of the -th form (of the solute) in the aqueous phase, and other variables are defined as previously.
Example partition coefficient data
The values for the octanol-water system in the following table are from the Dortmund Data Bank
The Dortmund Data Bank (short DDB) is a factual data bank for thermodynamic and thermophysical data. Its main usage is the data supply for process simulation where experimental data are the basis for the design, analysis, synthesis, and optimizati ...
. They are sorted by the partition coefficient, smallest to largest (acetamide being hydrophilic, and 2,2',4,4',5-pentachlorobiphenyl lipophilic), and are presented with the temperature at which they were measured (which impacts the values).
Values for other compounds may be found in a variety of available reviews and monographs. Critical discussions of the challenges of measurement of log ''P'' and related computation of its estimated values (see below) appear in several reviews.
Applications
Pharmacology
A drug's distribution coefficient strongly affects how easily the drug can reach its intended target in the body, how strong an effect it will have once it reaches its target, and how long it will remain in the body in an active form. Hence, the log ''P'' of a molecule is one criterion used in decision-making by medicinal chemists in pre-clinical drug discovery, for example, in the assessment of druglikeness
Druglikeness is a qualitative concept used in drug design for how "druglike" a substance is with respect to factors like bioavailability. It is estimated from the molecular structure before the substance is even synthesized and tested. A druglike m ...
of drug candidates. Likewise, it is used to calculate lipophilic efficiency
Lipophilic efficiency (LiPE), sometimes referred to as ligand-lipophilicity efficiency (LLE) is a parameter used in drug design and drug discovery to evaluate the quality of research compounds, linking potency and lipophilicity in an attempt to es ...
in evaluating the quality of research compounds, where the efficiency for a compound is defined as its potency
Potency may refer to:
* Potency (pharmacology), a measure of the activity of a drug in a biological system
* Virility
* Cell potency, a measure of the differentiation potential of stem cells
* In homeopathic dilutions, potency is a measure of how ...
, via measured values of pIC50 or pEC50, minus its value of log ''P''.
Pharmacokinetics
In the context of pharmacokinetics
Pharmacokinetics (from Ancient Greek ''pharmakon'' "drug" and ''kinetikos'' "moving, putting in motion"; see chemical kinetics), sometimes abbreviated as PK, is a branch of pharmacology dedicated to determining the fate of substances administered ...
(how the body absorbs, metabolizes, and excretes a drug), the distribution coefficient has a strong influence on ADME
ADME is an abbreviation in pharmacokinetics and pharmacology for " absorption, distribution, metabolism, and excretion", and describes the disposition of a pharmaceutical compound within an organism. The four criteria all influence the drug le ...
properties of the drug. Hence the hydrophobicity of a compound (as measured by its distribution coefficient) is a major determinant of how drug-like it is. More specifically, for a drug to be orally absorbed, it normally must first pass through lipid bilayer
The lipid bilayer (or phospholipid bilayer) is a thin polar membrane made of two layers of lipid molecules. These membranes are flat sheets that form a continuous barrier around all cells. The cell membranes of almost all organisms and many vir ...
s in the intestinal epithelium
Epithelium or epithelial tissue is one of the four basic types of animal tissue, along with connective tissue, muscle tissue and nervous tissue. It is a thin, continuous, protective layer of compactly packed cells with a little intercellul ...
(a process known as transcellular
Transcellular transport involves the transportation of solutes by a cell ''through'' a cell. Transcellular transport can occur in three different ways active transport, passive transport, and transcytosis.
Active Transport
Main article: Active ...
transport). For efficient transport, the drug must be hydrophobic enough to partition into the lipid bilayer, but not so hydrophobic, that once it is in the bilayer, it will not partition out again. Likewise, hydrophobicity plays a major role in determining where drugs are distributed within the body after absorption and, as a consequence, in how rapidly they are metabolized and excreted.
Pharmacodynamics
In the context of pharmacodynamics
Pharmacodynamics (PD) is the study of the biochemical and physiologic effects of drugs (especially pharmaceutical drugs). The effects can include those manifested within animals (including humans), microorganisms, or combinations of organisms (fo ...
(how the drug affects the body), the hydrophobic effect
The hydrophobic effect is the observed tendency of nonpolar substances to aggregate in an aqueous solution and exclude water molecules. The word hydrophobic literally means "water-fearing", and it describes the segregation of water and nonpolar ...
is the major driving force for the binding of drugs to their receptor
Receptor may refer to:
* Sensory receptor, in physiology, any structure which, on receiving environmental stimuli, produces an informative nerve impulse
*Receptor (biochemistry), in biochemistry, a protein molecule that receives and responds to a ...
targets. On the other hand, hydrophobic drugs tend to be more toxic because they, in general, are retained longer, have a wider distribution within the body (e.g., intracellular
This glossary of biology terms is a list of definitions of fundamental terms and concepts used in biology, the study of life and of living organisms. It is intended as introductory material for novices; for more specific and technical definitions ...
), are somewhat less selective in their binding to proteins, and finally are often extensively metabolized. In some cases the metabolites may be chemically reactive. Hence it is advisable to make the drug as hydrophilic as possible while it still retains adequate binding affinity to the therapeutic protein target. For cases where a drug reaches its target locations through passive mechanisms (i.e., diffusion through membranes), the ideal distribution coefficient for the drug is typically intermediate in value (neither too lipophilic, nor too hydrophilic); in cases where molecules reach their targets otherwise, no such generalization applies.
Environmental science
The hydrophobicity of a compound can give scientists an indication of how easily a compound might be taken up in groundwater to pollute waterways, and its toxicity to animals and aquatic life. Partition coefficient can also be used to predict the mobility of radionuclides
A radionuclide (radioactive nuclide, radioisotope or radioactive isotope) is a nuclide that has excess nuclear energy, making it unstable. This excess energy can be used in one of three ways: emitted from the nucleus as gamma radiation; transfer ...
in groundwater. In the field of hydrogeology
Hydrogeology (''hydro-'' meaning water, and ''-geology'' meaning the study of the Earth) is the area of geology that deals with the distribution and movement of groundwater in the soil and rocks of the Earth's crust (commonly in aquif ...
, the octanol–water partition coefficient
The ''n''-octanol-water partition coefficient, ''K''ow is a partition coefficient for the two-phase system consisting of ''n''-octanol and water. ''K''ow is also frequently referred to by the symbol P, especially in the English literature. It is a ...
''K''ow is used to predict and model the migration of dissolved hydrophobic organic compounds in soil and groundwater.
Agrochemical research
Hydrophobic insecticides and herbicides tend to be more active. Hydrophobic agrochemicals in general have longer half-lives and therefore display increased risk of adverse environmental impact.
Metallurgy
In metallurgy
Metallurgy is a domain of materials science and engineering that studies the physical and chemical behavior of metallic elements, their inter-metallic compounds, and their mixtures, which are known as alloys.
Metallurgy encompasses both the sc ...
, the partition coefficient is an important factor in determining how different impurities are distributed between molten and solidified metal. It is a critical parameter for purification using zone melting
Zone melting (or zone refining, or floating-zone method, or floating-zone technique) is a group of similar methods of purifying crystals, in which a narrow region of a crystal is melted, and this molten zone is moved along the crystal. The molte ...
, and determines how effectively an impurity can be removed using directional solidification, described by the Scheil equation
In metallurgy, the Scheil-Gulliver equation (or Scheil equation) describes solute redistribution during solidification of an alloy.
Assumptions
Four key assumptions in Scheil analysis enable determination of phases present in a cast part. ...
.
Consumer product development
Many other industries take into account distribution coefficients for example in the formulation of make-up, topical ointments, dyes, hair colors and many other consumer products.
Measurement
A number of methods of measuring distribution coefficients have been developed, including the shake-flask, separating funnel method, reverse-phase HPLC, and pH-metric techniques.
Separating-funnel method
In this method the solid particles present into the two immiscible liquids can be easily separated by suspending those solid particles directly into these immiscible or some what miscible liquids.
Shake flask-type
The classical and most reliable method of log ''P'' determination is the ''shake-flask method'', which consists of dissolving some of the solute in question in a volume of octanol and water, then measuring the concentration of the solute in each solvent. The most common method of measuring the distribution of the solute is by UV/VIS spectroscopy.
HPLC-based
A faster method of log ''P'' determination makes use of high-performance liquid chromatography
High-performance liquid chromatography (HPLC), formerly referred to as high-pressure liquid chromatography, is a technique in analytical chemistry used to separate, identify, and quantify each component in a mixture. It relies on pumps to pa ...
. The log ''P'' of a solute can be determined by correlating its retention time
In chemical analysis, chromatography is a laboratory technique for the separation of a mixture into its components. The mixture is dissolved in a fluid solvent (gas or liquid) called the ''mobile phase'', which carries it through a system (a ...
with similar compounds with known log ''P'' values.
An advantage of this method is that it is fast (5–20 minutes per sample). However, since the value of log ''P'' is determined by linear regression
In statistics, linear regression is a linear approach for modelling the relationship between a scalar response and one or more explanatory variables (also known as dependent and independent variables). The case of one explanatory variable is call ...
, several compounds with similar structures must have known log ''P'' values, and extrapolation from one chemical class to another—applying a regression equation derived from one chemical class to a second one—may not be reliable, since each chemical classes will have its characteristic regression parameters.
pH-metric
The pH-metric set of techniques determine lipophilicity pH profiles directly from a single acid-base titration in a two-phase water–organic-solvent system. Hence, a single experiment can be used to measure the logarithms of the partition coefficient (log ''P'') giving the distribution of molecules that are primarily neutral in charge, as well as the distribution coefficient (log ''D'') of all forms of the molecule over a pH range, e.g., between 2 and 12. The method does, however, require the separate determination of the pKa value(s) of the substance.
Electrochemical
Polarized liquid interfaces have been used to examine the thermodynamics and kinetics of the transfer of charged species from one phase to another. Two main methods exist. The first is ITIES
In electrochemistry, ITIES (interface between two immiscible electrolyte solutions) is an electrochemical interface that is either polarisable or polarised. An ITIES is polarisable if one can change the Galvani potential difference, or in other wor ...
, "interfaces between two immiscible electrolyte solutions". The second is droplet experiments. Here a reaction at a triple interface between a conductive solid, droplets of a redox active liquid phase and an electrolyte
An electrolyte is a medium containing ions that is electrically conducting through the movement of those ions, but not conducting electrons. This includes most soluble salts, acids, and bases dissolved in a polar solvent, such as water. Upon dis ...
solution have been used to determine the energy required to transfer a charged species across the interface.
Prediction
There are many situations where prediction of partition coefficients prior to experimental measurement is useful. For example, tens of thousands of industrially manufactured chemicals are in common use, but only a small fraction have undergone rigorous toxicological
Toxicology is a scientific discipline, overlapping with biology, chemistry, pharmacology, and medicine, that involves the study of the adverse effects of chemical substances on living organisms and the practice of diagnosing and treating expos ...
evaluation. Hence there is a need to prioritize the remainder for testing. QSAR equations, which in turn are based on calculated partition coefficients, can be used to provide toxicity estimates. Calculated partition coefficients are also widely used in drug discovery to optimize screening libraries and to predict druglikeness
Druglikeness is a qualitative concept used in drug design for how "druglike" a substance is with respect to factors like bioavailability. It is estimated from the molecular structure before the substance is even synthesized and tested. A druglike m ...
of designed drug candidates before they are synthesized. As discussed in more detail below, estimates of partition coefficients can be made using a variety of methods, including fragment-based, atom-based, and knowledge-based that rely solely on knowledge of the structure of the chemical. Other prediction methods rely on other experimental measurements such as solubility. The methods also differ in accuracy and whether they can be applied to all molecules, or only ones similar to molecules already studied.
Atom-based
Standard approaches of this type, using atomic contributions, have been named by those formulating them with a prefix letter: AlogP, XlogP, MlogP, etc. A conventional method for predicting log ''P'' through this type of method is to parameterize the distribution coefficient contributions of various atoms to the overall molecular partition coefficient, which produces a parametric model
In statistics, a parametric model or parametric family or finite-dimensional model is a particular class of statistical models. Specifically, a parametric model is a family of probability distributions that has a finite number of parameters.
Def ...
. This parametric model can be estimated using constrained least-squares
The method of least squares is a standard approach in regression analysis to approximate the solution of overdetermined systems (sets of equations in which there are more equations than unknowns) by minimizing the sum of the squares of the res ...
estimation, using a training set
In machine learning, a common task is the study and construction of algorithms that can learn from and make predictions on data. Such algorithms function by making data-driven predictions or decisions, through building a mathematical model from ...
of compounds with experimentally measured partition coefficients. In order to get reasonable correlations, the most common elements contained in drugs (hydrogen, carbon, oxygen, sulfur, nitrogen, and halogens) are divided into several different atom types depending on the environment of the atom within the molecule. While this method is generally the least accurate, the advantage is that it is the most general, being able to provide at least a rough estimate for a wide variety of molecules.
Fragment-based
The most common of these uses a group contribution method A group-contribution method in chemistry is a technique to estimate and predict thermodynamic and other properties from molecular structures.
Introduction
In today's chemical processes hundreds of thousands of components are used. The Chemical Ab ...
and is termed cLogP. It has been shown that the log ''P'' of a compound can be determined by the sum of its non-overlapping molecular fragments (defined as one or more atoms covalently bound to each other within the molecule). Fragmentary log ''P'' values have been determined in a statistical method analogous to the atomic methods (least-squares fitting to a training set). In addition, Hammett-type corrections are included to account of electronic and steric effects
Steric effects arise from the spatial arrangement of atoms. When atoms come close together there is a rise in the energy of the molecule. Steric effects are nonbonding interactions that influence the shape ( conformation) and reactivity of ions ...
. This method in general gives better results than atomic-based methods, but cannot be used to predict partition coefficients for molecules containing unusual functional groups for which the method has not yet been parameterized (most likely because of the lack of experimental data for molecules containing such functional groups).
Knowledge-based
A typical data-mining-based prediction uses support-vector machine
In machine learning, support vector machines (SVMs, also support vector networks) are supervised learning models with associated learning algorithms that analyze data for classification and regression analysis. Developed at AT&T Bell Laboratorie ...
s, decision trees
A decision tree is a decision support tool that uses a tree-like model of decisions and their possible consequences, including chance event outcomes, resource costs, and utility. It is one way to display an algorithm that only contains condit ...
, or neural networks
A neural network is a network or circuit of biological neurons, or, in a modern sense, an artificial neural network, composed of artificial neurons or nodes. Thus, a neural network is either a biological neural network, made up of biological ...
. This method is usually very successful for calculating log ''P'' values when used with compounds that have similar chemical structures and known log ''P'' values. Molecule mining
This page describes mining for molecules. Since molecules may be represented by molecular graphs this is strongly related to graph mining and structured data mining. The main problem is how to represent molecules while discriminating the data inst ...
approaches apply a similarity-matrix-based prediction or an automatic fragmentation scheme into molecular substructures. Furthermore, there exist also approaches using maximum common subgraph searches or molecule kernels.
Log ''D'' from log ''P'' and p''K''a
For cases where the molecule is un-ionized:
:
For other cases, estimation of log ''D'' at a given pH, from log ''P'' and the known mole fraction
In chemistry, the mole fraction or molar fraction (''xi'' or ) is defined as unit of the amount of a constituent (expressed in moles), ''ni'', divided by the total amount of all constituents in a mixture (also expressed in moles), ''n''tot. This ex ...
of the un-ionized form, , in the case where partition of ionized forms into non-polar phase can be neglected, can be formulated as
:
The following approximate expressions are valid only for monoprotic acids and bases:
:
Further approximations for when the compound is largely ionized:
* for acids with , ,
* for bases with , .
For prediction of p''K''a, which in turn can be used to estimate log ''D'', Hammett type equations have frequently been applied.
Log ''P'' from log ''S''
If the solubility, ''S'', of an organic compound is known or predicted in both water and 1-octanol, then log ''P'' can be estimated as
:
There are a variety of approaches to predict solubilities, and so log ''S''.
Octanol-water partition coefficient
The partition coefficient between ''n''-Octanol and water
Water (chemical formula ) is an inorganic, transparent, tasteless, odorless, and nearly colorless chemical substance, which is the main constituent of Earth's hydrosphere and the fluids of all known living organisms (in which it acts as a ...
is known as the ''n''-octanol-water partition coefficient , or ''K''ow. It is also frequently referred to by the symbol P, especially in the English literature. It is also known as ''n''-octanol-water partition ratio.
''K''ow, being a type of partition coefficient, serves as a measure of the relationship between lipophilicity
Lipophilicity (from Greek λίπος "fat" and φίλος "friendly"), refers to the ability of a chemical compound to dissolve in fats, oils, lipids, and non-polar solvents such as hexane or toluene. Such non-polar solvents are themselves lipop ...
(fat solubility) and hydrophilicity
A hydrophile is a molecule or other molecular entity that is intermolecular force, attracted to water molecules and tends to be dissolution (chemistry), dissolved by water.Liddell, H.G. & Scott, R. (1940). ''A Greek-English Lexicon'' Oxford: Clar ...
(water solubility) of a substance. The value is greater than one if a substance is more soluble in fat-like solvents such as n-octanol, and less than one if it is more soluble in water.
Example values
Values for log ''K''ow typically range between -3 (very hydrophilic) and +10 (extremely lipophilic/hydrophobic).
The values listed here are sorted by the partition coefficient. Acetamide is hydrophilic, and 2,2′,4,4′,5-Pentachlorobiphenyl is lipophilic.
See also
*
*
**
**
*
*
*
References
Further reading
*
*
*
*
*
*
*
*
*
*
*
*
*
External links
vcclab.org
Overview of the many logP and other physical property calculators available commercially and on-line.
{{Authority control
Physical chemistry
Equilibrium chemistry
Medicinal chemistry
Solvents