Molecular Descriptor
   HOME

TheInfoList



OR:

Molecular descriptors play a fundamental role in chemistry, pharmaceutical sciences, environmental protection policy, and health researches, as well as in quality control, being the way molecules, thought of as real bodies, are transformed into numbers, allowing some mathematical treatment of the chemical information contained in the molecule. This was defined by Todeschini and Consonni as: "''The molecular descriptor is the final result of a logic and mathematical procedure which transforms chemical information encoded within a symbolic representation of a molecule into a useful number or the result of some standardized experiment.''" By this definition, the molecular descriptors are divided into two main categories: experimental measurements, such as log P,
molar refractivity Molar refractivity,W. Foerst et.al. ''Chemie für Labor und Betrieb'', 1967, ''3'', 32-34. https://organic-btc-ilmenau.jimdo.com/app/download/9062135220/molrefraktion.pdf?t=1616948905 A, is a measure of the total polarizability of a mole of a subs ...
, dipole moment,
polarizability Polarizability usually refers to the tendency of matter, when subjected to an electric field, to acquire an electric dipole moment in proportion to that applied field. It is a property of all matter, considering that matter is made up of elementar ...
, and, in general, additive physico-chemical properties, and theoretical molecular descriptors, which are derived from a symbolic representation of the molecule and can be further classified according to the different types of molecular representation. The main classes of theoretical molecular descriptors are: 1) 0D-descriptors (i.e. constitutional descriptors, count descriptors), 2) 1D-descriptors (i.e. list of structural fragments, fingerprints),3) 2D-descriptors (i.e. graph invariants),4) 3D-descriptors (such as, for example, 3D-MoRSE descriptors, WHIM descriptors, GETAWAY descriptors, quantum-chemical descriptors, size, steric, surface and volume descriptors),5) 4D-descriptors (such as those derived from GRID or CoMFA methods, Volsurf).


Invariance properties of molecular descriptors

The invariance properties of molecular descriptors can be defined as the ability of the algorithm for their calculation to give a descriptor value that is independent of the particular characteristics of the molecular representation, such as atom numbering or labeling, spatial reference frame, molecular conformations, etc. Invariance to molecular numbering or labeling is assumed as a minimal basic requirement for any descriptor. Two other important invariance properties,
translational invariance In geometry, to translate a geometric figure is to move it from one place to another without rotating it. A translation "slides" a thing by . In physics and mathematics, continuous translational symmetry is the invariance of a system of equatio ...
and
rotational invariance In mathematics, a function defined on an inner product space is said to have rotational invariance if its value does not change when arbitrary rotations are applied to its argument. Mathematics Functions For example, the function :f(x,y) = x^2 ...
, are the invariance of a descriptor value to any translation or rotation of the molecules in the chosen reference frame. These last invariance properties are required for the 3D-descriptors.


Degeneracy of molecular descriptors

This property refers to the ability of a descriptor to avoid equal values for different molecules. In this sense, descriptors can show no degeneracy at all, low, intermediate, or high degeneracy. For example, the number of molecule atoms and the molecular weights are high degeneracy descriptors, while, usually, 3D-descriptors show low or no degeneracy at all.


Basic requirements for optimal descriptors

# Should have structural interpretation # Should have good correlation with at least one property # Should preferably discriminate among isomers # Should be possible to apply to local structure # Should be possible to generalize to "higher" descriptors # Should be simple # Should not be based on experimental properties # Should not be trivially related to other descriptors # Should be possible to construct efficiently # Should use familiar structural concepts # Should change gradually with gradual changes in structures # Should have the correct size dependence, if related to the molecule size


Software for molecular descriptors calculation

Here there is a list of a selection of commercial and free descriptor calculation tools. {, class="wikitable" , - ! Name !! Descriptors !! Fingerprints !! CLI !! GUI !! KNIME !! Comments !! License !! Website , - , alvaDesc , , 5666 , , , , , , , , , , Available for Windows, Linux and macOS , , ,
commercial Commercial may refer to: * a dose of advertising conveyed through media (such as - for example - radio or television) ** Radio advertisement ** Television advertisement * (adjective for:) commerce, a system of voluntary exchange of products and s ...
, , https://www.alvascience.com/alvadesc/ , - , Dragon , , 5270 , , , , , , , , , , Discontinued , , ,
commercial Commercial may refer to: * a dose of advertising conveyed through media (such as - for example - radio or television) ** Radio advertisement ** Television advertisement * (adjective for:) commerce, a system of voluntary exchange of products and s ...
, , https://chm.kode-solutions.net/products_dragon.php , - , Mordred , , 1826 , , , , , , , , , , Based on RDKit , ,
open source Open source is source code that is made freely available for possible modification and redistribution. Products include permission to use the source code, design documents, or content of the product. The open-source model is a decentralized sof ...
, , https://github.com/mordred-descriptor , - , PaDEL-descriptorYap, C. W. (2011). PaDEL-descriptor: An open source software to calculate molecular descriptors and fingerprints. Journal of Computational Chemistry. https://doi.org/10.1002/jcc.21707 , , 1875 , , , , , , , , , , Based on CDK , , {{free
open source Open source is source code that is made freely available for possible modification and redistribution. Products include permission to use the source code, design documents, or content of the product. The open-source model is a decentralized sof ...
, , http://www.yapcwsoft.com/dd/padeldescriptor/


See also

*
Mathematical chemistry Mathematical chemistry is the area of research engaged in novel applications of mathematics to chemistry; it concerns itself principally with the mathematical modeling of chemical phenomena. Mathematical chemistry has also sometimes been called co ...
*
Topological index In the fields of chemical graph theory, molecular topology, and mathematical chemistry, a topological index, also known as a connectivity index, is a type of a molecular descriptor that is calculated based on the molecular graph of a chemical co ...
* QSAR *
Applicability domain The applicability domain (AD) (for both chemistry and machine learning) of a QSAR model is the physico-chemical, structural or biological space, knowledge or information on which the training set of the model has been developed, and for which it ...
*
Chemical database A chemical database is a database specifically designed to store chemical information. This information is about chemical and crystal structures, spectra, reactions and syntheses, and thermophysical data. Types of chemical databases Bioactivit ...
*
Docking (molecular) In the field of molecular modeling, docking is a method which predicts the preferred orientation of one molecule to a second when a ligand and a target are bound to each other to form a stable complex. Knowledge of the preferred orientation in ...
* Cahn-Ingold-Prelog priority rule


References


Further reading

* Roberto Todeschini and Viviana Consonni, ''Molecular Descriptors for Chemoinformatics'' (2 volumes), Wiley-VCH, 2009. * Mati Karelson, Molecular Descriptors in QSAR/QSPR, John Wiley & Sons, 2000. * James Devillers and Alexandru T. Balaban (Eds.), ''Topological indices and related descriptors in QSAR and QSPR.'' Taylor & Francis, 2000. * Lemont Kier and Lowell Hall, ''Molecular structure description.'' Academic Press, 1999. * Alexandru T. Balaban (Ed.), ''From chemical topology to three-dimensional geometry.'' Plenum Press, 1997 Cheminformatics Mathematical chemistry