Goldbach's comet is the name given to a plot of the function
, the so-called Goldbach function . The function, studied in relation to
Goldbach's conjecture
Goldbach's conjecture is one of the oldest and best-known unsolved problems in number theory and all of mathematics. It states that every even natural number greater than 2 is the sum of two prime numbers.
The conjecture has been shown to hold ...
, is defined for all even integers
to be the number of different ways in which ''E'' can be expressed as the sum of two primes. For example,
since 22 can be expressed as the sum of two primes in three different ways (
).
The coloring of points in the above image is based on the value of
modulo 3 with red points corresponding to 0 mod 3, blue points corresponding to 1 mod 3 and green points corresponding to 2 mod 3. In other words, the red points are multiples of 6, the blue points are multiples of 6 plus 2, and the green points are multiples of 6 plus 4.
Anatomy of the Goldbach Comet
An illuminating way of presenting the comet data is as a
histogram
A histogram is an approximate representation of the distribution of numerical data. The term was first introduced by Karl Pearson. To construct a histogram, the first step is to " bin" (or "bucket") the range of values—that is, divide the ent ...
. The function
can be normalized by dividing by the locally averaged value of ''g'', g
av, taken over perhaps 1000 neighboring values of the even number ''E''. The histogram can then be accumulated over a range of up to about 10% either side of a central ''E''.
Such a histogram appears on the right. A series of well-defined peaks is evident. Each of these peaks can be identified as being formed by a set of values of
which have certain smallest factors. The major peaks correspond to lowest factors of 3, 5, 7 ... as labeled. As the lowest factors become higher the peaks move left and eventually merge to give the lowest value primary peak.
There is in fact a hierarchy of peaks; the main peaks are composed of subsidiary peaks, with a succession of second smallest factors of
. This hierarchy continues until all factors are exhausted.
The magnified section shows the succession of subsidiary peaks in more detail.
The relative location of the peaks follows from the form developed by Hardy and Littlewood:
G. H. Hardy
Godfrey Harold Hardy (7 February 1877 – 1 December 1947) was an English mathematician, known for his achievements in number theory and mathematical analysis. In biology, he is known for the Hardy–Weinberg principle, a basic principle of pop ...
and J. E. Littlewood
John Edensor Littlewood (9 June 1885 – 6 September 1977) was a British mathematician. He worked on topics relating to mathematical analysis, analysis, number theory, and differential equations, and had lengthy collaborations with G. H. H ...
, "Some problems of 'partitio numerorum'; III: on the expression of a number as a sum of primes", Acta Mathematica
''Acta Mathematica'' is a peer-reviewed open-access scientific journal covering research in all fields of mathematics.
According to Cédric Villani, this journal is "considered by many to be the most prestigious of all mathematical research journ ...
, vol. 44, pp. 1-70, 1922.
:
where the product is taken over all primes ''p'' that are factors of
. The factor on the right is
Hardy–Littlewood's twin prime constant
:
Here the product is taken over all primes greater than 2.
Of particular interest is the peak formed by selecting only values of
that are prime. The product factor in equation (1) is then very close to 1. The peak is very close to a Gaussian form (shown in gray). For this range of ''E'' values, the peak location is within 0.03% of the ideal
.
When histograms are formed for different average values of ''E'', the width of this (primes only) peak is found to be proportional to
. However, it is a factor of about 1.85 less than the value
that would be expected from a hypothesis of totally
random
In common usage, randomness is the apparent or actual lack of pattern or predictability in events. A random sequence of events, symbols or steps often has no :wikt:order, order and does not follow an intelligible pattern or combination. Ind ...
occurrence of prime-pair matching. This may be expected, since there are
correlation
In statistics, correlation or dependence is any statistical relationship, whether causal or not, between two random variables or bivariate data. Although in the broadest sense, "correlation" may indicate any type of association, in statistics ...
s that give rise to the separated peaks in the total histogram.
Returning to the full range of
rather than just primes, it is seen that other peaks associated with specified lowest factors of
can also be
fitted by a
Gaussian
Carl Friedrich Gauss (1777–1855) is the eponym of all of the topics listed below.
There are over 100 topics all named after this German mathematician and scientist, all in the fields of mathematics, physics, and astronomy. The English eponymo ...
, but only on their lower shoulder. The upper shoulder, being formed by an aggregate of subsidiary peaks, lies above the simple Gaussian form.
The relative heights of the peaks in the total histogram are representative of the populations of various types of
having differing factors. The heights are approximately inversely proportional to
, the products of the lowest factors. Thus the height of the peak marked (3,5) in the overall histogram is about 1/15 of the main peak. Heights may vary from this by about 20%; their exact value is a complex function of the way in which the peaks are constituted from their components and of their varying width.
It is interesting to speculate on the possibility of any number ''E'' having zero prime pairs, taking these Gaussian forms as
probabilities, and assuming it is legitimate to
extrapolate
In mathematics, extrapolation is a type of estimation, beyond the original observation range, of the value of a variable on the basis of its relationship with another variable. It is similar to interpolation, which produces estimates between kn ...
to the zero-pair point. If this is done, the probability of zero pairs for any one ''E'', in the range considered here, is of order 10
−3700. The integrated probability over all ''E'' to infinity, taking into account the narrowing of the peak width, is not much larger. Any search for violation of the Goldbach conjecture may reasonably be expected to have these odds to contend with.
References
{{reflist
Prime numbers