chemistry Chemistry is the science, scientific study of the properties and behavior of matter. It is a natural science that covers the Chemical element, elements that make up matter to the chemical compound, compounds made of atoms, molecules and ions ...

, a chemical formula is a way of presenting information about the chemical proportions of

atom Every atom is composed of a nucleus and one or more electrons bound to the nucleus. The nucleus is made of one or more protons and a number of neutrons. Only the most common variety of hydrogen has no neutrons. Every solid, liquid, gas, and ...

s that constitute a particular

chemical compound A chemical compound is a chemical substance composed of many identical molecules (or molecular entities) containing atoms from more than one chemical element held together by chemical bonds. A molecule consisting of atoms of only one element ...

molecule A molecule is a group of two or more atoms held together by attractive forces known as chemical bonds; depending on context, the term may or may not include ions which satisfy this criterion. In quantum physics, organic chemistry, and bioch ...

, using

chemical element A chemical element is a species of atoms that have a given number of protons in their nuclei, including the pure substance consisting only of that species. Unlike chemical compounds, chemical elements cannot be broken down into simpler sub ...

symbols, numbers, and sometimes also other symbols, such as parentheses, dashes, brackets, commas and ''plus'' (+) and ''minus'' (−) signs. These are limited to a single typographic line of symbols, which may include subscripts and superscripts. A chemical formula is not a

chemical name A chemical nomenclature is a set of rules to generate systematic names for chemical compounds. The nomenclature used most frequently worldwide is the one created and developed by the International Union of Pure and Applied Chemistry (IUPAC). The ...

, and it contains no words. Although a chemical formula may imply certain simple

chemical structure A chemical structure determination includes a chemist's specifying the molecular geometry and, when feasible and necessary, the electronic structure of the target molecule or other solid. Molecular geometry refers to the spatial arrangement of at ...

s, it is not the same as a full chemical

structural formula The structural formula of a chemical compound is a graphic representation of the molecular structure (determined by structural chemistry methods), showing how the atoms are possibly arranged in the real three-dimensional space. The chemical bondi ...

. Chemical formulae can fully specify the structure of only the simplest of molecules and

chemical substance A chemical substance is a form of matter having constant chemical composition and characteristic properties. Some references add that chemical substance cannot be separated into its constituent elements by physical separation methods, i.e., wi ...

s, and are generally more limited in power than chemical names and structural formulae. The simplest types of chemical formulae are called ''

empirical formula In chemistry, the empirical formula of a chemical compound is the simplest whole number ratio of atoms present in a compound. A simple example of this concept is that the empirical formula of sulfur monoxide, or SO, would simply be SO, as is th ...

e'', which use letters and numbers indicating the numerical ''proportions'' of atoms of each type. Molecular formulae indicate the simple numbers of each type of atom in a molecule, with no information on structure. For example, the empirical formula for

glucose Glucose is a simple sugar with the molecular formula . Glucose is overall the most abundant monosaccharide, a subcategory of carbohydrates. Glucose is mainly made by plants and most algae during photosynthesis from water and carbon dioxide, using ...

is (twice as many

hydrogen Hydrogen is the chemical element with the symbol H and atomic number 1. Hydrogen is the lightest element. At standard conditions hydrogen is a gas of diatomic molecules having the formula . It is colorless, odorless, tasteless, non-toxic, an ...

atoms as

carbon Carbon () is a chemical element with the symbol C and atomic number 6. It is nonmetallic and tetravalent In chemistry, the valence (US spelling) or valency (British spelling) of an element is the measure of its combining capacity with o ...

and

oxygen Oxygen is the chemical element with the symbol O and atomic number 8. It is a member of the chalcogen group in the periodic table, a highly reactive nonmetal, and an oxidizing agent that readily forms oxides with most elements as wel ...

), while its molecular formula is (12 hydrogen atoms, six carbon and oxygen atoms). Sometimes a chemical formula is complicated by being written as a

condensed formula The structural formula of a chemical compound is a graphic representation of the molecular structure (determined by structural chemistry methods), showing how the atoms are possibly arranged in the real three-dimensional space. The chemical bond ...

(or condensed molecular formula, occasionally called a "semi-structural formula"), which conveys additional information about the particular ways in which the atoms are chemically bonded together, either in

covalent bond A covalent bond is a chemical bond that involves the sharing of electrons to form electron pairs between atoms. These electron pairs are known as shared pairs or bonding pairs. The stable balance of attractive and repulsive forces between atoms ...

ionic bond Ionic bonding is a type of chemical bonding that involves the electrostatic attraction between oppositely charged ions, or between two atoms with sharply different electronegativities, and is the primary interaction occurring in ionic compounds. ...

s, or various combinations of these types. This is possible if the relevant bonding is easy to show in one dimension. An example is the condensed molecular/chemical formula for

ethanol Ethanol (abbr. EtOH; also called ethyl alcohol, grain alcohol, drinking alcohol, or simply alcohol) is an organic compound. It is an Alcohol (chemistry), alcohol with the chemical formula . Its formula can be also written as or (an ethyl ...

, which is or . However, even a condensed chemical formula is necessarily limited in its ability to show complex bonding relationships between atoms, especially atoms that have bonds to four or more different

substituent A substituent is one or a group of atoms that replaces (one or more) atoms, thereby becoming a moiety in the resultant (new) molecule. (In organic chemistry and biochemistry, the terms ''substituent'' and ''functional group'', as well as ''side ...

s. Since a chemical formula must be expressed as a single line of chemical

element symbol Chemical symbols are the abbreviations used in chemistry for chemical elements, functional groups and chemical compounds. Element symbols for chemical elements normally consist of one or two letters from the Latin alphabet and are written with th ...

s, it often cannot be as informative as a true structural formula, which is a graphical representation of the spatial relationship between atoms in chemical compounds (see for example the figure for butane structural and chemical formulae, at right). For reasons of structural complexity, a single condensed chemical formula (or semi-structural formula) may correspond to different molecules, known as

isomer In chemistry, isomers are molecules or polyatomic ions with identical molecular formulae – that is, same number of atoms of each element – but distinct arrangements of atoms in space. Isomerism is existence or possibility of isomers. Iso ...

s. For example, glucose shares its

molecular formula In chemistry, a chemical formula is a way of presenting information about the chemical proportions of atoms that constitute a particular chemical compound or molecule, using chemical element symbols, numbers, and sometimes also other symbols, ...

with a number of other

sugar Sugar is the generic name for sweet-tasting, soluble carbohydrates, many of which are used in food. Simple sugars, also called monosaccharides, include glucose, fructose, and galactose. Compound sugars, also called disaccharides or double ...

s, including

fructose Fructose, or fruit sugar, is a Ketose, ketonic monosaccharide, simple sugar found in many plants, where it is often bonded to glucose to form the disaccharide sucrose. It is one of the three dietary monosaccharides, along with glucose and galacto ...

galactose Galactose (, '' galacto-'' + '' -ose'', "milk sugar"), sometimes abbreviated Gal, is a monosaccharide sugar that is about as sweet as glucose, and about 65% as sweet as sucrose. It is an aldohexose and a C-4 epimer of glucose. A galactose molec ...

and

mannose Mannose is a sugar monomer of the aldohexose series of carbohydrates. It is a C-2 epimer of glucose. Mannose is important in human metabolism, especially in the glycosylation of certain proteins. Several congenital disorders of glycosylation ...

. Linear equivalent chemical ''names'' exist that can and do specify uniquely any complex structural formula (see

chemical nomenclature A chemical nomenclature is a set of rules to generate systematic names for chemical compounds. The nomenclature used most frequently worldwide is the one created and developed by the International Union of Pure and Applied Chemistry (IUPAC). The ...

), but such names must use many terms (words), rather than the simple element symbols, numbers, and simple typographical symbols that define a chemical formula. Chemical formulae may be used in

chemical equation A chemical equation is the symbolic representation of a chemical reaction in the form of symbols and chemical formulas. The reactant entities are given on the left-hand side and the product entities on the right-hand side with a plus sign between ...

s to describe

chemical reaction A chemical reaction is a process that leads to the IUPAC nomenclature for organic transformations, chemical transformation of one set of chemical substances to another. Classically, chemical reactions encompass changes that only involve the pos ...

s and other chemical transformations, such as the dissolving of ionic compounds into solution. While, as noted, chemical formulae do not have the full power of structural formulae to show chemical relationships between atoms, they are sufficient to keep track of numbers of atoms and numbers of electrical charges in chemical reactions, thus balancing chemical equations so that these equations can be used in chemical problems involving conservation of atoms, and conservation of electric charge.

Overview

A chemical formula identifies each constituent element by its

chemical symbol Chemical symbols are the abbreviations used in chemistry for chemical elements, functional groups and chemical compounds. Element symbols for chemical elements normally consist of one or two letters from the Latin alphabet and are written with th ...

and indicates the proportionate number of atoms of each element. In empirical formulae, these proportions begin with a key element and then assign numbers of atoms of the other elements in the compound, by ratios to the key element. For molecular compounds, these ratio numbers can all be expressed as whole numbers. For example, the empirical formula of

may be written because the molecules of ethanol all contain two carbon atoms, six hydrogen atoms, and one oxygen atom. Some types of ionic compounds, however, cannot be written with entirely whole-number empirical formulae. An example is boron carbide, whose formula of is a variable non-whole number ratio with n ranging from over 4 to more than 6.5. When the chemical compound of the formula consists of simple

s, chemical formulae often employ ways to suggest the structure of the molecule. These types of formulae are variously known as ''molecular formulae'' and '' condensed formulae''. A molecular formula enumerates the number of atoms to reflect those in the molecule, so that the molecular formula for

is rather than the glucose empirical formula, which is . However, except for very simple substances, molecular chemical formulae lack needed structural information, and are ambiguous. For simple molecules, a condensed (or semi-structural) formula is a type of chemical formula that may fully imply a correct structural formula. For example, ethanol may be represented by the condensed chemical formula , and

dimethyl ether Dimethyl ether (DME; also known as methoxymethane) is the organic compound with the formula CH3OCH3, (sometimes ambiguously simplified to C2H6O as it is an isomer of ethanol). The simplest ether, it is a colorless gas that is a useful precursor ...

by the condensed formula . These two molecules have the same empirical and molecular formulae (), but may be differentiated by the condensed formulae shown, which are sufficient to represent the full structure of these simple

organic compound In chemistry, organic compounds are generally any chemical compounds that contain carbon-hydrogen or carbon-carbon bonds. Due to carbon's ability to catenate (form chains with other carbon atoms), millions of organic compounds are known. The ...

s. Condensed chemical formulae may also be used to represent ionic compounds that do not exist as discrete molecules, but nonetheless do contain covalently bound clusters within them. These

polyatomic ion A polyatomic ion, also known as a molecular ion, is a covalent bonded set of two or more atoms, or of a metal complex, that can be considered to behave as a single unit and that has a net charge that is not zero. The term molecule may or may no ...

s are groups of atoms that are covalently bound together and have an overall ionic charge, such as the

sulfate The sulfate or sulphate ion is a polyatomic anion with the empirical formula . Salts, acid derivatives, and peroxides of sulfate are widely used in industry. Sulfates occur widely in everyday life. Sulfates are salts of sulfuric acid and many ar ...

ion. Each polyatomic ion in a compound is written individually in order to illustrate the separate groupings. For example, the compound

dichlorine hexoxide Dichlorine hexoxide is the chemical compound with the molecular formula , which is correct for its gaseous state. However, in liquid or solid form, this chlorine oxide ionizes into the dark red ionic compound chloryl perchlorate , which may be tho ...

has an empirical formula , and molecular formula , but in liquid or solid forms, this compound is more correctly shown by an ionic condensed formula , which illustrates that this compound consists of ions and ions. In such cases, the condensed formula only need be complex enough to show at least one of each ionic species. Chemical formulae as described here are distinct from the far more complex chemical systematic names that are used in various systems of

. For example, one systematic name for glucose is (2''R'',3''S'',4''R'',5''R'')-2,3,4,5,6-pentahydroxyhexanal. This name, interpreted by the rules behind it, fully specifies glucose's structural formula, but the name is not a chemical formula as usually understood, and uses terms and words not used in chemical formulae. Such names, unlike basic formulae, may be able to represent full structural formulae without graphs.

Types

Empirical formula

, the

of a chemical is a simple expression of the relative number of each type of atom or ratio of the elements in the compound. Empirical formulae are the standard for ionic compounds, such as , and for macromolecules, such as . An empirical formula makes no reference to

ism, structure, or absolute number of atoms. The term ''empirical'' refers to the process of

elemental analysis Elemental analysis is a process where a sample of some material (e.g., soil, waste or drinking water, bodily fluids, minerals, chemical compounds) is analyzed for its elemental and sometimes isotopic composition. Elemental analysis can be qualita ...

, a technique of

analytical chemistry Analytical chemistry studies and uses instruments and methods to separate, identify, and quantify matter. In practice, separation, identification or quantification may constitute the entire analysis or be combined with another method. Separati ...

used to determine the relative percent composition of a pure chemical substance by element. For example,

hexane Hexane () is an organic compound, a straight-chain alkane with six carbon atoms and has the molecular formula C6H14. It is a colorless liquid, odorless when pure, and with boiling points approximately . It is widely used as a cheap, relatively ...

has a molecular formula of , and (for one of its isomers, n-hexane) a structural formula , implying that it has a chain structure of 6

atoms, and 14

atoms. However, the empirical formula for hexane is . Likewise the empirical formula for

hydrogen peroxide Hydrogen peroxide is a chemical compound with the formula . In its pure form, it is a very pale blue liquid that is slightly more viscous than water. It is used as an oxidizer, bleaching agent, and antiseptic, usually as a dilute solution (3%� ...

, , is simply , expressing the 1:1 ratio of component elements.

Formaldehyde Formaldehyde ( , ) (systematic name methanal) is a naturally occurring organic compound with the formula and structure . The pure compound is a pungent, colourless gas that polymerises spontaneously into paraformaldehyde (refer to section F ...

and

acetic acid Acetic acid , systematically named ethanoic acid , is an acidic, colourless liquid and organic compound with the chemical formula (also written as , , or ). Vinegar is at least 4% acetic acid by volume, making acetic acid the main component ...

have the same empirical formula, . This is the actual chemical formula for formaldehyde, but acetic acid has double the number of atoms.

Molecular formula

Molecular formulae indicate the simple numbers of each type of atom in a molecule of a molecular substance. They are the same as empirical formulae for molecules that only have one atom of a particular type, but otherwise may have larger numbers. An example of the difference is the empirical formula for glucose, which is (''ratio'' 1:2:1), while its molecular formula is (''number of atoms'' 6:12:6). For water, both formulae are . A molecular formula provides more information about a molecule than its empirical formula, but is more difficult to establish. A molecular formula shows the number of elements in a molecule, and determines whether it is a

binary compound In materials chemistry, a binary phase or binary compound is a chemical compound containing two different elements. Some binary phase compounds are molecular, e.g. carbon tetrachloride (CCl4). More typically binary phase refers to extended soli ...

ternary compound In inorganic chemistry and materials chemistry, a ternary compound or ternary phase is a chemical compound containing three different elements. While some ternary compounds are molecular, ''e.g.'' chloroform (), more typically ternary phases r ...

quaternary compound In chemistry, a quaternary compound is a compound consisting of exactly four chemical elements. In another use of the term in organic chemistry, a quaternary compound is or has a cation consisting of a central positively charged atom with four ...

, or has even more elements.

Structural formula

In addition to quantitative description of a molecule, a structural formula captures how the atoms are organized, and shows (or implies) the

chemical bond A chemical bond is a lasting attraction between atoms or ions that enables the formation of molecules and crystals. The bond may result from the electrostatic force between oppositely charged ions as in ionic bonds, or through the sharing of ...

s between the atoms. There are multiple types of structural formulas focused on different aspects of the molecular structure. The two diagrams show two molecules which are

structural isomer In chemistry, a structural isomer (or constitutional isomer in the IUPAC nomenclature) of a chemical compound, compound is another compound whose molecule has the same number of atoms of each element, but with logically distinct chemical bond, b ...

s of each other, since they both have the same molecular formula , but they have different structural formulas as shown.

Condensed formula

The

connectivity Connectivity may refer to: Computing and technology * Connectivity (media), the ability of the social media to accumulate economic capital from the users connections and activities * Internet connectivity, the means by which individual terminal ...

of a molecule often has a strong influence on its physical and chemical properties and behavior. Two molecules composed of the same numbers of the same types of atoms (i.e. a pair of

s) might have completely different chemical and/or physical properties if the atoms are connected differently or in different positions. In such cases, a

is useful, as it illustrates which atoms are bonded to which other ones. From the connectivity, it is often possible to deduce the approximate shape of the molecule. A condensed (or semi-structural) formula may represent the types and spatial arrangement of bonds in a simple chemical substance, though it does not necessarily specify

s or complex structures. For example,

ethane Ethane ( , ) is an organic chemical compound with chemical formula . At standard temperature and pressure, ethane is a colorless, odorless gas. Like many hydrocarbons, ethane is isolated on an industrial scale from natural gas and as a petr ...

consists of two carbon atoms single-bonded to each other, with each carbon atom having three hydrogen atoms bonded to it. Its chemical formula can be rendered as . In

ethylene Ethylene (IUPAC name: ethene) is a hydrocarbon which has the formula or . It is a colourless, flammable gas with a faint "sweet and musky" odour when pure. It is the simplest alkene (a hydrocarbon with carbon-carbon double bonds). Ethylene i ...

there is a double bond between the carbon atoms (and thus each carbon only has two hydrogens), therefore the chemical formula may be written: , and the fact that there is a double bond between the carbons is implicit because carbon has a valence of four. However, a more explicit method is to write or less commonly . The two lines (or two pairs of dots) indicate that a

double bond In chemistry, a double bond is a covalent bond between two atoms involving four bonding electrons as opposed to two in a single bond. Double bonds occur most commonly between two carbon atoms, for example in alkenes. Many double bonds exist betwee ...

connects the atoms on either side of them. A

triple bond A triple bond in chemistry is a chemical bond between two atoms involving six bonding electrons instead of the usual two in a covalent single bond. Triple bonds are stronger than the equivalent single bonds or double bonds, with a bond order o ...

may be expressed with three lines () or three pairs of dots (), and if there may be ambiguity, a single line or pair of dots may be used to indicate a single bond. Molecules with multiple

functional group In organic chemistry, a functional group is a substituent or moiety in a molecule that causes the molecule's characteristic chemical reactions. The same functional group will undergo the same or similar chemical reactions regardless of the rest ...

s that are the same may be expressed by enclosing the repeated group in round brackets. For example,

isobutane Isobutane, also known as ''i''-butane, 2-methylpropane or methylpropane, is a chemical compound with molecular formula HC(CH3)3. It is an isomer of butane. Isobutane is a colourless, odourless gas. It is the simplest alkane with a tertiary carbon a ...

may be written . This condensed structural formula implies a different connectivity from other molecules that can be formed using the same atoms in the same proportions (

s). The formula implies a central carbon atom connected to one hydrogen atom and three

methyl group In organic chemistry, a methyl group is an alkyl derived from methane, containing one carbon atom bonded to three hydrogen atoms, having chemical formula . In formulas, the group is often abbreviated as Me. This hydrocarbon group occurs in many ...

s (). The same number of atoms of each element (10 hydrogens and 4 carbons, or ) may be used to make a straight chain molecule, ''n''-

butane Butane () or ''n''-butane is an alkane with the formula C4H10. Butane is a gas at room temperature and atmospheric pressure. Butane is a highly flammable, colorless, easily liquefied gas that quickly vaporizes at room temperature. The name but ...

: .

Law of composition

In any given chemical compound, the elements always combine in the same proportion with each other. This is the

law of constant composition In chemistry, the law of definite proportions, sometimes called Proust's law, or law of constant composition states that a given chemical compound always contains its component elements in fixed ratio (by mass) and does not depend on its source an ...

. The law of constant composition says that, in any particular chemical compound, all samples of that compound will be made up of the same elements in the same proportion or ratio. For example, any water molecule is always made up of two hydrogen atoms and one oxygen atom in a 2:1 ratio. If we look at the relative masses of oxygen and hydrogen in a water molecule, we see that 94% of the mass of a water molecule is accounted for by oxygen and the remaining 6% is the mass of hydrogen. This mass proportion will be the same for any water molecule.

Chemical names in answer to limitations of chemical formulae

The alkene called but-2-ene has two isomers, which the chemical formula does not identify. The relative position of the two methyl groups must be indicated by additional notation denoting whether the methyl groups are on the same side of the double bond (''cis'' or ''Z'') or on the opposite sides from each other (''trans'' or ''E''). As noted above, in order to represent the full structural formulae of many complex organic and inorganic compounds,

may be needed which goes well beyond the available resources used above in simple condensed formulae. See

IUPAC nomenclature of organic chemistry In chemical nomenclature, the IUPAC nomenclature of organic chemistry is a method of naming organic chemical compounds as recommended by the International Union of Pure and Applied Chemistry (IUPAC). It is published in the ''Nomenclature of Or ...

and

IUPAC nomenclature of inorganic chemistry 2005 Nomenclature of Inorganic Chemistry, IUPAC Recommendations 2005 is the 2005 version of '' Nomenclature of Inorganic Chemistry'' (which is informally called the Red Book). It is a collection of rules for naming inorganic compounds, as recommended by ...

for examples. In addition, linear naming systems such as

International Chemical Identifier The International Chemical Identifier (InChI or ) is a textual identifier for chemical substances, designed to provide a standard way to encode molecular information and to facilitate the search for such information in databases and on the we ...

(InChI) allow a computer to construct a structural formula, and

simplified molecular-input line-entry system The simplified molecular-input line-entry system (SMILES) is a specification in the form of a line notation for describing the structure of chemical species using short ASCII strings. SMILES strings can be imported by most molecule editors f ...

(SMILES) allows a more human-readable ASCII input. However, all these nomenclature systems go beyond the standards of chemical formulae, and technically are chemical naming systems, not formula systems.

Polymers in condensed formulae

For

polymer A polymer (; Greek '' poly-'', "many" + ''-mer'', "part") is a substance or material consisting of very large molecules called macromolecules, composed of many repeating subunits. Due to their broad spectrum of properties, both synthetic a ...

s in condensed chemical formulae, parentheses are placed around the repeating unit. For example, a

hydrocarbon In organic chemistry, a hydrocarbon is an organic compound consisting entirely of hydrogen and carbon. Hydrocarbons are examples of group 14 hydrides. Hydrocarbons are generally colourless and hydrophobic, and their odors are usually weak or ex ...

molecule that is described as , is a molecule with fifty repeating units. If the number of repeating units is unknown or variable, the letter ''n'' may be used to indicate this formula: .

Ions in condensed formulae

For

ion An ion () is an atom or molecule with a net electrical charge. The charge of an electron is considered to be negative by convention and this charge is equal and opposite to the charge of a proton, which is considered to be positive by conve ...

s, the charge on a particular atom may be denoted with a right-hand superscript. For example, , or . The total charge on a charged molecule or a

may also be shown in this way, such as for

hydronium In chemistry, hydronium (hydroxonium in traditional British English) is the common name for the aqueous cation , the type of oxonium ion produced by protonation of water. It is often viewed as the positive ion present when an Arrhenius acid is d ...

, , or

, . Note that + and - are used in place of +1 and -1, respectively. For more complex ions, brackets are often used to enclose the ionic formula, as in , which is found in compounds such as caesium dodecaborate, . Parentheses ( ) can be nested inside brackets to indicate a repeating unit, as in

Hexamminecobalt(III) chloride Hexaamminecobalt(III) chloride is the chemical compound with the formula o(NH3)6l3. It is the chloride salt of the coordination complex o(NH3)6sup>3+, which is considered an archetypal "Werner complex", named after the pioneer of coordination c ...

, . Here, indicates that the ion contains six

ammine group In coordination chemistry, metal ammine complexes are metal complexes containing at least one ammonia () ligand. "Ammine" is spelled this way due to historical reasons; in contrast, alkyl or aryl bearing ligands are spelt with a single "m". Almost ...

s () bonded to

cobalt Cobalt is a chemical element with the symbol Co and atomic number 27. As with nickel, cobalt is found in the Earth's crust only in a chemically combined form, save for small deposits found in alloys of natural meteoric iron. The free element, pr ...

, and encloses the entire formula of the ion with charge +3. This is strictly optional; a chemical formula is valid with or without ionization information, and Hexamminecobalt(III) chloride may be written as or . Brackets, like parentheses, behave in chemistry as they do in mathematics, grouping terms together they are not specifically employed only for ionization states. In the latter case here, the parentheses indicate 6 groups all of the same shape, bonded to another group of size 1 (the cobalt atom), and then the entire bundle, as a group, is bonded to 3 chlorine atoms. In the former case, it is clearer that the bond connecting the chlorines is ionic, rather than

covalent A covalent bond is a chemical bond that involves the sharing of electrons to form electron pairs between atoms. These electron pairs are known as shared pairs or bonding pairs. The stable balance of attractive and repulsive forces between atoms ...

Isotopes

Although

isotope Isotopes are two or more types of atoms that have the same atomic number (number of protons in their nuclei) and position in the periodic table (and hence belong to the same chemical element), and that differ in nucleon numbers (mass numbers) ...

s are more relevant to

nuclear chemistry Nuclear chemistry is the sub-field of chemistry dealing with radioactivity, nuclear processes, and transformations in the nuclei of atoms, such as nuclear transmutation and nuclear properties. It is the chemistry of radioactive elements such as t ...

stable isotope The term stable isotope has a meaning similar to stable nuclide, but is preferably used when speaking of nuclides of a specific element. Hence, the plural form stable isotopes usually refers to isotopes of the same element. The relative abundanc ...

chemistry than to conventional chemistry, different isotopes may be indicated with a prefixed superscript in a chemical formula. For example, the phosphate ion containing radioactive phosphorus-32 is . Also a study involving stable isotope ratios might include the molecule . A left-hand subscript is sometimes used redundantly to indicate the

atomic number The atomic number or nuclear charge number (symbol ''Z'') of a chemical element is the charge number of an atomic nucleus. For ordinary nuclei, this is equal to the proton number (''n''p) or the number of protons found in the nucleus of every ...

. For example, for dioxygen, and for the most abundant isotopic species of dioxygen. This is convenient when writing equations for

nuclear reaction In nuclear physics and nuclear chemistry, a nuclear reaction is a process in which two nuclei, or a nucleus and an external subatomic particle, collide to produce one or more new nuclides. Thus, a nuclear reaction must cause a transformatio ...

s, in order to show the balance of charge more clearly.

Trapped atoms

The @ symbol (

at sign The at sign, , is normally read aloud as "at"; it is also commonly called the at symbol, commercial at, or address sign. It is used as an accounting and invoice abbreviation meaning "at a rate of" (e.g. 7 widgets @ £2 per widget = £14), but ...

) indicates an atom or molecule trapped inside a cage but not chemically bound to it. For example, a

buckminsterfullerene Buckminsterfullerene is a type of fullerene with the formula C60. It has a cage-like fused-ring structure (truncated icosahedron) made of twenty hexagons and twelve pentagons, and resembles a soccer ball. Each of its 60 carbon atoms is bonded ...

() with an atom (M) would simply be represented as regardless of whether M was inside the fullerene without chemical bonding or outside, bound to one of the carbon atoms. Using the @ symbol, this would be denoted if M was inside the carbon network. A non-fullerene example is , an ion in which one

arsenic Arsenic is a chemical element with the symbol As and atomic number 33. Arsenic occurs in many minerals, usually in combination with sulfur and metals, but also as a pure elemental crystal. Arsenic is a metalloid. It has various allotropes, but ...

(As) atom is trapped in a cage formed by the other 32 atoms. This notation was proposed in 1991 with the discovery of

fullerene A fullerene is an allotrope of carbon whose molecule consists of carbon atoms connected by single and double bonds so as to form a closed or partially closed mesh, with fused rings of five to seven atoms. The molecule may be a hollow sphere, ...

cages (

endohedral fullerene Endohedral fullerenes, also called endofullerenes, are fullerenes that have additional atoms, ions, or clusters enclosed within their inner spheres. The first lanthanum C60 complex called La@C60 was synthesized in 1985. The @ (at sign) in the ...

s), which can trap atoms such as La to form, for example, or . The choice of the symbol has been explained by the authors as being concise, readily printed and transmitted electronically (the at sign is included in

ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because of ...

, which most modern character encoding schemes are based on), and the visual aspects suggesting the structure of an endohedral fullerene.

Non-stoichiometric chemical formulae

Chemical formulae most often use

integer An integer is the number zero (), a positive natural number (, , , etc.) or a negative integer with a minus sign (−1, −2, −3, etc.). The negative numbers are the additive inverses of the corresponding positive numbers. In the language ...

s for each element. However, there is a class of compounds, called

non-stoichiometric compound In chemistry, non-stoichiometric compounds are chemical compounds, almost always solid inorganic compounds, having elemental composition whose proportions cannot be represented by a ratio of small natural numbers (i.e. an empirical formula); m ...

s, that cannot be represented by small integers. Such a formula might be written using

decimal fraction The decimal numeral system (also called the base-ten positional numeral system and denary or decanary) is the standard system for denoting integer and non-integer numbers. It is the extension to non-integer numbers of the Hindu–Arabic num ...

s, as in , or it might include a variable part represented by a letter, as in , where ''x'' is normally much less than 1.

General forms for organic compounds

A chemical formula used for a series of compounds that differ from each other by a constant unit is called a ''general formula''. It generates a

homologous series In organic chemistry, a homologous series is a sequence of compounds with the same functional group and similar chemical properties in which the members of the series can be branched or unbranched, or differ by molecular formula of and molecu ...

of chemical formulae. For example,

alcohols In chemistry, an alcohol is a type of organic compound that carries at least one hydroxyl () functional group bound to a saturated carbon atom. The term ''alcohol'' originally referred to the primary alcohol ethanol (ethyl alcohol), which is ...

may be represented by the formula (''n'' ≥ 1), giving the homologs

methanol Methanol (also called methyl alcohol and wood spirit, amongst other names) is an organic chemical and the simplest aliphatic alcohol, with the formula C H3 O H (a methyl group linked to a hydroxyl group, often abbreviated as MeOH). It is a ...

propanol There are two isomers of propanol. * 1-Propanol, ''n''-propanol, or propan-1-ol : CH3CH2CH2OH, the most common meaning *2-Propanol, Isopropyl alcohol, isopropanol, or propan-2-ol : (CH3)2CHOH See also * Propanal (propionaldehyde) differs in sp ...

for 1 ≤ ''n'' ≤ 3.

Hill system

The Hill system (or Hill notation) is a system of writing empirical chemical formulae, molecular chemical formulae and components of a condensed formula such that the number of

s in a

is indicated first, the number of

atoms next, and then the number of all other

s subsequently, in

alphabetical order Alphabetical order is a system whereby character strings are placed in order based on the position of the characters in the conventional ordering of an alphabet. It is one of the methods of collation. In mathematics, a lexicographical order is t ...

of the

chemical symbols A chemical substance is a form of matter having constant chemical composition and characteristic properties. Some references add that chemical substance cannot be separated into its constituent elements by physical separation methods, i.e., wit ...

. When the formula contains no carbon, all the elements, including hydrogen, are listed alphabetically. By sorting formulae according to the number of atoms of each element present in the formula according to these rules, with differences in earlier elements or numbers being treated as more significant than differences in any later element or number—like sorting text strings into

lexicographical order In mathematics, the lexicographic or lexicographical order (also known as lexical order, or dictionary order) is a generalization of the alphabetical order of the dictionaries to sequences of ordered symbols or, more generally, of elements of a ...

—it is possible to collate chemical formulae into what is known as Hill system order. The Hill system was first published by Edwin A. Hill of the

United States Patent and Trademark Office The United States Patent and Trademark Office (USPTO) is an agency in the U.S. Department of Commerce that serves as the national patent office and trademark registration authority for the United States. The USPTO's headquarters are in Alexa ...

in 1900. It is the most commonly used system in chemical databases and printed indexes to sort lists of compounds.Wiggins, Gary. (1991). ''Chemical Information Sources.'' New York: McGraw Hill. p. 120. A list of formulae in Hill system order is arranged alphabetically, as above, with single-letter elements coming before two-letter symbols when the symbols begin with the same letter (so "B" comes before "Be", which comes before "Br"). The following example formulae are written using the Hill system, and listed in Hill order: * BrI * BrClH₂Si * CCl₄ * CH₃I * C₂H₅Br * H₂O₄S

Notes

References

External links

*
Hill notation example
from the University of Massachusetts Lowell libraries, including how to sort into Hill system order
Molecular formula calculation applying Hill notation
The library calculating Hill notation i
available on npm
{{DEFAULTSORT:Chemical Formula Chemical nomenclature Notation