Transcription (music)
   HOME

TheInfoList



OR:

In
music Music is the arrangement of sound to create some combination of Musical form, form, harmony, melody, rhythm, or otherwise Musical expression, expressive content. Music is generally agreed to be a cultural universal that is present in all hum ...
, transcription is the practice of notating a piece or a sound which was previously unnotated and/or unpopular as a written music, for example, a
jazz improvisation Jazz improvisation is the spontaneous invention of melodic solo lines or accompaniment parts in a performance of jazz music. It is one of the defining elements of jazz. Improvisation is composing on the spot, when a singer or instrumentalist inv ...
or a
video game soundtrack Video game music (VGM) is the soundtrack that accompanies video games. Early video game music was once limited to sounds of early sound chips, such as programmable sound generators (PSG) or FM synthesis chips. These limitations have led to the s ...
. When a musician is tasked with creating
sheet music Sheet music is a handwritten or printed form of musical notation that uses musical symbols to indicate the pitches, rhythms, or chords of a song or instrumental musical piece. Like its analogs – printed Book, books or Pamphlet, pamphlets ...
from a recording and they write down the notes that make up the piece in
music notation Musical notation is any system used to visually represent music. Systems of notation generally represent the elements of a piece of music that are considered important for its performance in the context of a given musical tradition. The proces ...
, it is said that they created a ''musical transcription'' of that recording. Transcription may also mean rewriting a piece of music, either solo or ensemble, for another instrument or other instruments than which it was originally intended. The Beethoven Symphonies transcribed for solo piano by
Franz Liszt Franz Liszt (22 October 1811 – 31 July 1886) was a Hungarian composer, virtuoso pianist, conductor and teacher of the Romantic music, Romantic period. With a diverse List of compositions by Franz Liszt, body of work spanning more than six ...
are an example. Transcription in this sense is sometimes called ''
arrangement In music, an arrangement is a musical adaptation of an existing composition. Differences from the original composition may include reharmonization, melodic paraphrasing, orchestration, or formal development. Arranging differs from orchestr ...
'', although strictly speaking transcriptions are faithful adaptations, whereas arrangements change significant aspects of the original piece. Further examples of music transcription include
ethnomusicological Ethnomusicology is the multidisciplinary study of music in its cultural context. The discipline investigates social, cognitive, biological, comparative, and other dimensions. Ethnomusicologists study music as a reflection of culture and investiga ...
notation of
oral tradition Oral tradition, or oral lore, is a form of human communication in which knowledge, art, ideas and culture are received, preserved, and transmitted orally from one generation to another.Jan Vansina, Vansina, Jan: ''Oral Tradition as History'' (19 ...
s of folk music, such as
Béla Bartók Béla Viktor János Bartók (; ; 25 March 1881 – 26 September 1945) was a Hungarian composer, pianist and ethnomusicologist. He is considered one of the most important composers of the 20th century; he and Franz Liszt are regarded as Hunga ...
's and
Ralph Vaughan Williams Ralph Vaughan Williams ( ; 12 October 1872– 26 August 1958) was an English composer. His works include operas, ballets, chamber music, secular and religious vocal pieces and orchestral compositions including nine symphonies, written over ...
' collections of the national folk music of
Hungary Hungary is a landlocked country in Central Europe. Spanning much of the Pannonian Basin, Carpathian Basin, it is bordered by Slovakia to the north, Ukraine to the northeast, Romania to the east and southeast, Serbia to the south, Croatia and ...
and
England England is a Countries of the United Kingdom, country that is part of the United Kingdom. It is located on the island of Great Britain, of which it covers about 62%, and List of islands of England, more than 100 smaller adjacent islands. It ...
respectively. The French
composer A composer is a person who writes music. The term is especially used to indicate composers of Western classical music, or those who are composers by occupation. Many composers are, or were, also skilled performers of music. Etymology and def ...
Olivier Messiaen Olivier Eugène Prosper Charles Messiaen (, ; ; 10 December 1908 – 27 April 1992) was a French composer, organist, and ornithology, ornithologist. One of the major composers of the 20th-century classical music, 20th century, he was also an ou ...
transcribed
birdsong Bird vocalization includes both bird calls and bird songs. In non-technical use, bird songs (often simply ''birdsong'') are the bird sounds that are melodious to the human ear. In ornithology and birding, songs (relatively complex vocalization ...
in the wild, and incorporated it into many of his compositions, for example his ''
Catalogue d'oiseaux ''Catalogue d'oiseaux'' ("Catalogue of birds") is a work for piano solo by Olivier Messiaen consisting of thirteen pieces, written between October 1956 and September 1958. It is devoted to birds and dedicated to his second wife Yvonne Loriod. Pr ...
'' for solo piano. Transcription of this nature involves scale degree recognition and harmonic analysis, both of which the transcriber will need
relative Relative may refer to: General use *Kinship and family, the principle binding the most basic social units of society. If two people are connected by circumstances of birth, they are said to be ''relatives''. Philosophy *Relativism, the concept t ...
or
perfect pitch Perfect commonly refers to: * Perfection; completeness, and excellence * Perfect (grammar), a grammatical category in some languages Perfect may also refer to: Film and television * ''Perfect'' (1985 film), a romantic drama * ''Perfect'' (20 ...
to perform. In popular music and rock, there are two forms of transcription. Individual performers copy a note-for-note guitar solo or other melodic line. As well, music publishers transcribe entire recordings of guitar solos and bass lines and sell the sheet music in bound books. Music publishers also publish PVG (piano/vocal/guitar) transcriptions of popular music, where the melody line is transcribed, and then the accompaniment on the recording is arranged as a piano part. The guitar aspect of the PVG label is achieved through guitar chords written above the melody. Lyrics are also included below the melody.


Adaptation

Some composers have rendered homage to other composers by creating "identical" versions of the earlier composers' pieces while adding their own creativity through the use of completely new sounds arising from the difference in instrumentation. The most widely known example of this is
Ravel Joseph Maurice Ravel (7 March 1875 – 28 December 1937) was a French composer, pianist and conductor. He is often associated with Impressionism in music, Impressionism along with his elder contemporary Claude Debussy, although both composer ...
's arrangement for orchestra of
Mussorgsky Modest Petrovich Mussorgsky (; ; ; – ) was a Russian composer, one of the group known as "The Five (composers), The Five." He was an innovator of Music of Russia, Russian music in the Romantic music, Romantic period and strove to achieve a ...
's piano piece ''
Pictures at an Exhibition ''Pictures at an Exhibition'' is a piano suite in ten movements, plus a recurring and varied Promenade theme, written in 1874 by Russian composer Modest Mussorgsky. It is a musical depiction of a tour of an exhibition of works by architect and ...
''.
Webern Anton Webern (; 3 December 1883 – 15 September 1945) was an Austrian composer, conductor, and musicologist. His music was among the most radical of its milieu in its lyric poetry, lyrical, poetic concision and use of then novel atonality, aton ...
used his transcription for orchestra of the six-part
ricercar A ricercar ( , ) or ricercare ( , ) is a type of late Renaissance and mostly early Baroque instrumental composition. The term ''ricercar'' derives from the Italian verb , which means "to search out; to seek"; many ricercars serve a preludial func ...
from
Bach Johann Sebastian Bach (German: joːhan zeˈbasti̯an baχ ( – 28 July 1750) was a German composer and musician of the late Baroque period. He is known for his prolific output across a variety of instruments and forms, including the or ...
's ''
The Musical Offering ''The Musical Offering'' (German: or ), BWV 1079, is a collection of keyboard canons and fugues and other pieces of music by Johann Sebastian Bach, all based on a single musical theme given to him by Frederick the Great (King Frederick II of Prus ...
'' to analyze the structure of the Bach piece, by using different instruments to play different subordinate motifs of Bach's themes and melodies. In transcription of this form, the new piece can simultaneously imitate the original sounds while recomposing them with all the technical skills of an expert composer in such a way that it seems that the piece was originally written for the new medium. But some transcriptions and arrangements have been done for purely pragmatic or contextual reasons. For example, in
Mozart Wolfgang Amadeus Mozart (27 January 1756 – 5 December 1791) was a prolific and influential composer of the Classical period (music), Classical period. Despite his short life, his rapid pace of composition and proficiency from an early age ...
's time, the overtures and songs from his popular operas were transcribed for small wind ensemble simply because such ensembles were common ways of providing popular entertainment in public places. Mozart himself did this in his opera ''
Don Giovanni ''Don Giovanni'' (; K. 527; full title: , literally ''The Rake Punished, or Don Giovanni'') is an opera in two acts with music by Wolfgang Amadeus Mozart to an Italian libretto by Lorenzo Da Ponte. Its subject is a centuries-old Spanish legen ...
'', transcribing for small wind ensemble several arias from other operas, including one from his own opera ''
The Marriage of Figaro ''The Marriage of Figaro'' (, ), K. 492, is a ''commedia per musica'' (opera buffa) in four acts composed in 1786 by Wolfgang Amadeus Mozart, with an Italian libretto written by Lorenzo Da Ponte. It premiered at the Burgtheater in Vienn ...
''. A more contemporary example is
Stravinsky Igor Fyodorovich Stravinsky ( – 6 April 1971) was a Russian composer and conductor with French citizenship (from 1934) and American citizenship (from 1945). He is widely considered one of the most important and influential composers of ...
´s transcription for four hands piano of ''
The Rite of Spring ''The Rite of Spring'' () is a ballet and orchestral concert work by the Russian composer Igor Stravinsky. It was written for the 1913 Paris season of Sergei Diaghilev's Ballets Russes company; the original choreography was by Vaslav Nijinsky ...
'', to be used on the ballet's rehearsals. Today musicians who play in cafes or restaurants will sometimes play transcriptions or arrangements of pieces written for a larger group of instruments. Other examples of this type of transcription include
Bach Johann Sebastian Bach (German: joːhan zeˈbasti̯an baχ ( – 28 July 1750) was a German composer and musician of the late Baroque period. He is known for his prolific output across a variety of instruments and forms, including the or ...
's arrangement of
Vivaldi Antonio Lucio Vivaldi (4 March 1678 – 28 July 1741) was an Italian composer, virtuoso violinist, impresario of Baroque music and Roman Catholic priest. Regarded as one of the greatest Baroque composers, Vivaldi's influence during his lif ...
's four-violin concerti for four keyboard instruments and orchestra; Mozart's arrangement of some Bach
fugue In classical music, a fugue (, from Latin ''fuga'', meaning "flight" or "escape""Fugue, ''n''." ''The Concise Oxford English Dictionary'', eleventh edition, revised, ed. Catherine Soanes and Angus Stevenson (Oxford and New York: Oxford Universit ...
s from ''
The Well-Tempered Clavier ''The Well-Tempered Clavier'', BWV 846–893, consists of two sets of preludes and fugues in all 24 major and minor keys for keyboard by Johann Sebastian Bach. In the composer's time ''clavier'' referred to a variety of keyboard instruments, ...
'' for string trio; Beethoven's arrangement of his '' Große Fuge'', originally written for
string quartet The term string quartet refers to either a type of musical composition or a group of four people who play them. Many composers from the mid-18th century onwards wrote string quartets. The associated musical ensemble consists of two Violin, violini ...
, for
piano A piano is a keyboard instrument that produces sound when its keys are depressed, activating an Action (music), action mechanism where hammers strike String (music), strings. Modern pianos have a row of 88 black and white keys, tuned to a c ...
duet, and his arrangement of his
Violin Concerto A violin concerto is a concerto for solo violin (occasionally, two or more violins) and instrumental ensemble (customarily orchestra). Such works have been written since the Baroque period, when the solo concerto form was first developed, up thro ...
as a
piano concerto A piano concerto, a type of concerto, is a solo composition in the classical music genre which is composed for piano accompanied by an orchestra or other large ensemble. Piano concertos are typically virtuosic showpieces which require an advance ...
;
Franz Liszt Franz Liszt (22 October 1811 – 31 July 1886) was a Hungarian composer, virtuoso pianist, conductor and teacher of the Romantic music, Romantic period. With a diverse List of compositions by Franz Liszt, body of work spanning more than six ...
's piano arrangements of the works of many composers, including the symphonies of Beethoven;
Tchaikovsky Pyotr Ilyich Tchaikovsky ( ; 7 May 1840 – 6 November 1893) was a Russian composer during the Romantic period. He was the first Russian composer whose music made a lasting impression internationally. Tchaikovsky wrote some of the most popular ...
's arrangement of four Mozart piano pieces into an
orchestral suite A suite, in Western classical music, is an ordered set of instrumental or orchestral/concert band pieces. It originated in the late 14th century as a pairing of dance tunes; and grew in scope so that by the early 17th century it comprised up to f ...
called " Mozartiana";
Mahler Gustav Mahler (; 7 July 1860 – 18 May 1911) was an Austro-Bohemian Romantic composer, and one of the leading conductors of his generation. As a composer he acted as a bridge between the 19th-century Austro-German tradition and the modernism ...
's re-orchestration of
Schumann Robert Schumann (; ; 8 June 181029 July 1856) was a German composer, pianist, and music critic of the early Romantic music, Romantic era. He composed in all the main musical genres of the time, writing for solo piano, voice and piano, chamber ...
symphonies; and Schoenberg's arrangement for orchestra of
Brahms Johannes Brahms (; ; 7 May 1833 – 3 April 1897) was a German composer, virtuoso pianist, and conductor of the mid- Romantic period. His music is noted for its rhythmic vitality and freer treatment of dissonance, often set within studied ye ...
's piano quintet and Bach's "St. Anne" Prelude and Fugue for organ. Since the piano became a popular instrument, a large literature has sprung up of transcriptions and arrangements for piano of works for orchestra or chamber music ensemble. These are sometimes called "
piano reduction In music, a reduction is an arrangement or transcription of an existing score or composition in which complexity is lessened to make analysis, performance, or practice easier or clearer; the number of parts may be reduced or rhythm may be ...
s", because the multiplicity of orchestral parts—in an orchestral piece there may be as many as two dozen separate instrumental parts being played simultaneously—has to be reduced to what a single pianist (or occasionally two pianists, on one or two pianos, such as the different arrangements for
George Gershwin George Gershwin (; born Jacob Gershwine; September 26, 1898 – July 11, 1937) was an American composer and pianist whose compositions spanned jazz, popular music, popular and classical music. Among his best-known works are the songs "Swan ...
's ''
Rhapsody in Blue ''Rhapsody in Blue'' is a 1924 musical composition for solo piano and jazz band by George Gershwin. Commissioned by bandleader Paul Whiteman, the work combines elements of classical music with jazz-influenced effects and premiered in a concer ...
'') can manage to play. Piano reductions are frequently made of orchestral accompaniments to choral works, for the purposes of rehearsal or of performance with keyboard alone. Many orchestral pieces have been transcribed for
concert band A concert band, also called a wind band, wind ensemble, wind symphony, wind orchestra, symphonic band, the symphonic winds, or symphonic wind ensemble, is a performing ensemble consisting of members of the woodwind instrument, woodwind, brass ...
.


Transcription aids


Notation software

Since the advent of desktop publishing, musicians can acquire
music notation software A scorewriter, or music notation program is software for creating, editing and printing sheet music. A scorewriter is to music notation what a word processor is to text, in that they typically provide flexible editing and automatic layout, and p ...
, which can receive the user's mental analysis of notes and then store and format those notes into standard music notation for personal printing or professional publishing of sheet music. Some notation software can accept a Standard
MIDI Musical Instrument Digital Interface (; MIDI) is an American-Japanese technical standard that describes a communication protocol, digital interface, and electrical connectors that connect a wide variety of electronic musical instruments, ...
File (SMF) or MIDI performance as input instead of manual note entry. These notation applications can export their scores in a variety of formats like
EPS An extended play (EP) is a Sound recording and reproduction, musical recording that contains more tracks than a Single (music), single but fewer than an album. Contemporary EPs generally contain up to eight tracks and have a playing time of 1 ...
, PNG, and SVG. Often the software contains a sound library that allows the user's score to be played aloud by the application for verification.


Slow-down software

Prior to the invention of digital transcription aids, musicians would slow down a record or a tape recording to be able to hear the melodic lines and chords at a slower, more digestible pace. The problem with this approach was that it also changed the pitches, so once a piece was transcribed, it would then have to be transposed into the correct key. Software designed to slow down the tempo of music without changing the pitch of the music can be very helpful for recognizing pitches, melodies, chords, rhythms and lyrics when transcribing music. However, unlike the slow-down effect of a record player, the pitch and original octave of the notes will stay the same, and not descend in pitch. This technology is simple enough that it is available in many free software applications. The software generally goes through a two-step process to accomplish this. First, the audio file is played back at a lower sample rate than that of the original file. This has the same effect as playing a tape or vinyl record at slower speed – the pitch is lowered meaning the music can sound like it is in a different key. The second step is to use Digital Signal Processing (or DSP) to shift the pitch back up to the original pitch level or musical key.


Pitch tracking software

As mentioned in the Automatic music transcription section, some commercial software can roughly track the pitch of dominant melodies in polyphonic musical recordings. The note scans are not exact, and often need to be manually edited by the user before saving to file in either a proprietary file format or in Standard
MIDI Musical Instrument Digital Interface (; MIDI) is an American-Japanese technical standard that describes a communication protocol, digital interface, and electrical connectors that connect a wide variety of electronic musical instruments, ...
File Format. Some pitch tracking software also allows the scanned note lists to be animated during audio playback.


Automatic music transcription

The term "automatic music transcription" was first used by audio researchers James A. Moorer, Martin Piszczalski, and Bernard Galler in 1977. With their knowledge of digital audio engineering, these researchers believed that a computer could be programmed to analyze a
digital recording In digital recording, an audio signal, audio or video signal is converted into a stream of discrete numbers representing the changes over time in air pressure for audio, or Color, chroma and luminance values for video. This number stream is s ...
of music such that the pitches of melody lines and chord patterns could be detected, along with the rhythmic accents of percussion instruments. The task of automatic music transcription concerns two separate activities: making an analysis of a musical piece, and printing out a score from that analysis. This was not a simple goal, but one that would encourage academic research for at least another three decades. Because of the close scientific relationship of speech to music, much academic and commercial research that was directed toward the more financially resourced
speech recognition Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers. It is also ...
technology would be recycled into research about music recognition technology. While many musicians and educators insist that manually doing transcriptions is a valuable exercise for developing musicians, the motivation for automatic music transcription remains the same as the motivation for sheet music: musicians who do not have intuitive transcription skills will search for sheet music or a chord chart, so that they may quickly learn how to play a song. A collection of tools created by this ongoing research could be of great aid to musicians. Since much recorded music does not have available sheet music, an automatic transcription device could also offer transcriptions that are otherwise unavailable in sheet music. To date, no software application can yet completely fulfill James Moorer’s definition of automatic music transcription. However, the pursuit of automatic music transcription has spawned the creation of many software applications that can aid in manual transcription. Some can slow down music while maintaining original pitch and octave, some can track the pitch of melodies, some can track the chord changes, and others can track the beat of music. Automatic transcription most fundamentally involves identifying the pitch and duration of the performed notes. This entails tracking pitch and identifying note onsets. After capturing those physical measurements, this information is mapped into traditional music notation, i.e., the sheet music.
Digital Signal Processing Digital signal processing (DSP) is the use of digital processing, such as by computers or more specialized digital signal processors, to perform a wide variety of signal processing operations. The digital signals processed in this manner are a ...
is the branch of engineering that provides software engineers with the tools and algorithms needed to analyze a digital recording in terms of pitch (note detection of melodic instruments), and the energy content of un-pitched sounds (detection of percussion instruments). Musical recordings are sampled at a given recording rate and its frequency data is stored in any digital wave format in the computer. Such format represents sound by digital sampling.


Pitch detection

Pitch detection is often the detection of individual
note Note, notes, or NOTE may refer to: Music and entertainment * Musical note, a pitched sound (or a symbol for a sound) in music * ''Notes'' (album), a 1987 album by Paul Bley and Paul Motian * ''Notes'', a common (yet unofficial) shortened versi ...
s that might make up a
melody A melody (), also tune, voice, or line, is a linear succession of musical tones that the listener perceives as a single entity. In its most literal sense, a melody is a combination of Pitch (music), pitch and rhythm, while more figurativel ...
in music, or the notes in a chord. When a single key is pressed upon a piano, what we hear is not just ''one''
frequency Frequency is the number of occurrences of a repeating event per unit of time. Frequency is an important parameter used in science and engineering to specify the rate of oscillatory and vibratory phenomena, such as mechanical vibrations, audio ...
of sound vibration, but a ''composite'' of multiple sound vibrations occurring at different mathematically related frequencies. The elements of this composite of vibrations at differing frequencies are referred to as
harmonic In physics, acoustics, and telecommunications, a harmonic is a sinusoidal wave with a frequency that is a positive integer multiple of the ''fundamental frequency'' of a periodic signal. The fundamental frequency is also called the ''1st har ...
s or partials. For instance, if the note A3 (220 Hz) is played, the individual
frequencies Frequency is the number of occurrences of a repeating event per unit of time. Frequency is an important parameter used in science and engineering to specify the rate of oscillatory and vibratory phenomena, such as mechanical vibrations, audio ...
of the composite's harmonic series will start at 220 Hz as the
fundamental frequency The fundamental frequency, often referred to simply as the ''fundamental'' (abbreviated as 0 or 1 ), is defined as the lowest frequency of a Periodic signal, periodic waveform. In music, the fundamental is the musical pitch (music), pitch of a n ...
: 440 Hz would be the second harmonic, 660 Hz would be the third harmonic, 880 Hz would be the fourth harmonic, etc.) These are integer multiples of the fundamental frequency (for example, two times 220 is 440, the second harmonic). While only about eight harmonics are really needed to audibly recreate the note, the total number of harmonics in this mathematical series can be large, although the higher the harmonic's numeral the weaker the magnitude and contribution of that harmonic. Contrary to intuition, a musical recording at its lowest physical level is not a collection of individual
note Note, notes, or NOTE may refer to: Music and entertainment * Musical note, a pitched sound (or a symbol for a sound) in music * ''Notes'' (album), a 1987 album by Paul Bley and Paul Motian * ''Notes'', a common (yet unofficial) shortened versi ...
s, but is really a collection of individual harmonics. That is why very similar-sounding recordings can be created with differing collections of instruments and their assigned notes. As long as the total harmonics of the recording are recreated to some degree, it does not really matter which instruments or which notes were used. A first step in the detection of notes is the transformation of the sound file's digital data from the
time domain In mathematics and signal processing, the time domain is a representation of how a signal, function, or data set varies with time. It is used for the analysis of mathematical functions, physical signals or time series of economic or environmental ...
into the
frequency domain In mathematics, physics, electronics, control systems engineering, and statistics, the frequency domain refers to the analysis of mathematical functions or signals with respect to frequency (and possibly phase), rather than time, as in time ser ...
, which enables the measurement of various frequencies over time. The graphic image of an audio recording in the frequency domain is called a
spectrogram A spectrogram is a visual representation of the spectrum of frequencies of a signal as it varies with time. When applied to an audio signal, spectrograms are sometimes called sonographs, voiceprints, or voicegrams. When the data are represen ...
or sonogram. A musical note, as a composite of various
harmonic In physics, acoustics, and telecommunications, a harmonic is a sinusoidal wave with a frequency that is a positive integer multiple of the ''fundamental frequency'' of a periodic signal. The fundamental frequency is also called the ''1st har ...
s, appears in a spectrogram like a vertically placed ''comb'', with the individual teeth of the comb representing the various harmonics and their differing frequency values. A
Fourier Transform In mathematics, the Fourier transform (FT) is an integral transform that takes a function as input then outputs another function that describes the extent to which various frequencies are present in the original function. The output of the tr ...
is the mathematical procedure that is used to create the spectrogram from the sound file’s digital data. The task of many note detection algorithms is to search the
spectrogram A spectrogram is a visual representation of the spectrum of frequencies of a signal as it varies with time. When applied to an audio signal, spectrograms are sometimes called sonographs, voiceprints, or voicegrams. When the data are represen ...
for the occurrence of such ''comb patterns'' (a composite of harmonics) caused by individual notes. Once the pattern of a note's particular comb shape of
harmonic In physics, acoustics, and telecommunications, a harmonic is a sinusoidal wave with a frequency that is a positive integer multiple of the ''fundamental frequency'' of a periodic signal. The fundamental frequency is also called the ''1st har ...
s is detected, the note's pitch can be measured by the vertical position of the comb pattern upon the
spectrogram A spectrogram is a visual representation of the spectrum of frequencies of a signal as it varies with time. When applied to an audio signal, spectrograms are sometimes called sonographs, voiceprints, or voicegrams. When the data are represen ...
. There are basically two different types of music which create very different demands for a pitch detection algorithm: ''monophonic'' music and ''polyphonic'' music. Monophonic music is a passage with only one instrument playing one note at a time, while polyphonic music can have multiple instruments and vocals playing at once. Pitch detection upon a monophonic recording was a relatively simple task, and its technology enabled the invention of guitar tuners in the 1970s. However, pitch detection upon polyphonic music becomes a much more difficult task because the image of its
spectrogram A spectrogram is a visual representation of the spectrum of frequencies of a signal as it varies with time. When applied to an audio signal, spectrograms are sometimes called sonographs, voiceprints, or voicegrams. When the data are represen ...
now appears as a vague cloud due to a multitude of overlapping comb patterns, caused by each note's multiple
harmonic In physics, acoustics, and telecommunications, a harmonic is a sinusoidal wave with a frequency that is a positive integer multiple of the ''fundamental frequency'' of a periodic signal. The fundamental frequency is also called the ''1st har ...
s. Another method of pitch detection was invented by Martin Piszczalski in conjunction with Bernard Galler in the 1970s and has since been widely followed. It targets monophonic music. Central to this method is how pitch is determined by the human
ear In vertebrates, an ear is the organ that enables hearing and (in mammals) body balance using the vestibular system. In humans, the ear is described as having three parts: the outer ear, the middle ear and the inner ear. The outer ear co ...
. The process attempts to roughly mimic the biology of the human inner
ear In vertebrates, an ear is the organ that enables hearing and (in mammals) body balance using the vestibular system. In humans, the ear is described as having three parts: the outer ear, the middle ear and the inner ear. The outer ear co ...
by finding only but a few of the loudest
harmonic In physics, acoustics, and telecommunications, a harmonic is a sinusoidal wave with a frequency that is a positive integer multiple of the ''fundamental frequency'' of a periodic signal. The fundamental frequency is also called the ''1st har ...
s at a given instant. That small set of found
harmonic In physics, acoustics, and telecommunications, a harmonic is a sinusoidal wave with a frequency that is a positive integer multiple of the ''fundamental frequency'' of a periodic signal. The fundamental frequency is also called the ''1st har ...
s are in turn compared against all the possible resultant pitches' harmonic-sets, to hypothesize what the most probable pitch could be given that particular set of harmonics. To date, the complete note detection of polyphonic recordings remains a mystery to audio engineers, although they continue to make progress by inventing algorithms which can partially detect some of the notes of a polyphonic recording, such as a
melody A melody (), also tune, voice, or line, is a linear succession of musical tones that the listener perceives as a single entity. In its most literal sense, a melody is a combination of Pitch (music), pitch and rhythm, while more figurativel ...
or bass line.


Beat detection

Beat tracking is the determination of a repeating time interval between perceived pulses in music. Beat can also be described as 'foot tapping' or 'hand clapping' in time with the music. The beat is often a predictable basic unit in time for the musical piece, and may only vary slightly during the performance. Songs are frequently measured for their Beats Per Minute (BPM) in determining the tempo of the music, whether it be fast or slow. Since notes frequently begin on a beat, or a simple subdivision of the beat's time interval, beat tracking software has the potential to better resolve note onsets that may have been detected in a crude fashion. Beat tracking is often the first step in the detection of percussion instruments. Despite the intuitive nature of 'foot tapping' of which most humans are capable, developing an algorithm to detect those beats is difficult. Most of the current software algorithms for beat detection use a group competing hypothesis for beats-per-minute, as the algorithm progressively finds and resolves local peaks in volume, roughly corresponding to the foot-taps of the music.


How automatic music transcription works

To transcribe music automatically, several problems must be solved: 1. Notes must be recognized – this is typically done by changing from the time domain into the frequency domain. This can be accomplished through the
Fourier transform In mathematics, the Fourier transform (FT) is an integral transform that takes a function as input then outputs another function that describes the extent to which various frequencies are present in the original function. The output of the tr ...
. Computer algorithms for doing this are common. The
fast Fourier transform A fast Fourier transform (FFT) is an algorithm that computes the discrete Fourier transform (DFT) of a sequence, or its inverse (IDFT). A Fourier transform converts a signal from its original domain (often time or space) to a representation in ...
algorithm computes the frequency content of a signal, and is useful in processing musical excerpts. 2. A beat and tempo need to be detected (
Beat detection In signal analysis, beat detection is using computer software or computer hardware to detect the beat of a musical score. There are many methods available and beat detection is always a tradeoff between accuracy and speed. Beat detectors are comm ...
)- this is a difficult, many-faceted problem. The method proposed in Costantini et al. 2009 focuses on note events and their main characteristics: the attack instant, the pitch and the final instant.
Onset detection Onset refers to the beginning of a musical note or other sound. It is related to (but different from) the concept of a transient: all musical notes have an onset, but do not necessarily include an initial transient. Onset detection In signal pro ...
exploits a binary time-frequency representation of the audio signal. Note classification and offset detection are based on constant Q transform (CQT) and
support vector machines In machine learning, support vector machines (SVMs, also support vector networks) are supervised max-margin models with associated learning algorithms that analyze data for classification and regression analysis. Developed at AT&T Bell Laborato ...
(SVMs). This in turn leads to a “pitch contour” namely a continuously time-varying line that corresponds to what humans refer to as melody. The next step is to segment this continuous melodic stream to identify the beginning and end of each note. After that, each “note unit” is expressed in physical terms (e.g., 4402 Hz, .52 seconds). The final step is then to map this physical information into familiar music-notation-like terms for each note (e.g., an A4, quarter note).


Detailed computer steps behind automatic music transcription

In terms of actual computer processing, the principal steps are to 1) digitize the performed, analog music, 2) do successive short-term,
fast Fourier transform A fast Fourier transform (FFT) is an algorithm that computes the discrete Fourier transform (DFT) of a sequence, or its inverse (IDFT). A Fourier transform converts a signal from its original domain (often time or space) to a representation in ...
(FFTs) to obtain the time-varying spectra, 3) identify the peaks in each spectrum, 4) analyze the spectral peaks to get pitch candidates, 5) connect the strongest individual pitch candidates to get the most likely time-varying, pitch contour, 6) map this physical data into the closest music-notation terms. These fundamental steps, originated by Piszczalski in the 1970s, became the foundation of automatic music transcription. The most controversial and difficult step in this process is detecting pitch . The most successful pitch methods operate in the frequency domain, not the time domain. While time-domain methods have been proposed, they can break down for real-world musical instruments played in typically reverberant rooms. The pitch-detection method invented by Piszczalski again mimics human hearing. It follows how only certain sets of partials “fuse” together in human listening. These are the sets that create the perception of a single pitch only. Fusion occurs only when two partials are within 1.5% of being a perfect, harmonic pair (i.e., their frequencies approximate a low-integer pair set such as 1:2, 5:8, etc.) This near harmonic match is required of all the partials in order for a human to hear them as only a single pitch.


See also

*
Orchestration Orchestration is the study or practice of writing music for an orchestra (or, more loosely, for any musical ensemble, such as a concert band) or of adapting music composed for another medium for an orchestra. Also called "instrumentation", orch ...
*
Timbre In music, timbre (), also known as tone color or tone quality (from psychoacoustics), is the perceived sound of a musical note, sound or tone. Timbre distinguishes sounds according to their source, such as choir voices and musical instrument ...
*
Composer tributes (classical music) A composer is a person who writes music. The term is especially used to indicate composers of Western classical music, or those who are composers by occupation. Many composers are, or were, also skilled performers of music. Etymology and defi ...
* :Scorewriters *
Reduction (music) In music, a reduction is an arrangement or transcription of an existing score or composition in which complexity is lessened to make analysis, performance, or practice easier or clearer; the number of parts may be reduced or rhythm may be ...


References

{{DEFAULTSORT:Transcription (Music) Musical notation Musical tributes