HOME
*





Sound Recognition
Sound recognition is a technology, which is based on both traditional pattern recognition theories and audio signal analysis methods. Sound recognition technologies contain preliminary data processing, feature extraction and classification algorithms. Sound recognition can classify feature vectors. Feature vectors are created as a result of preliminary data processing and linear predictive coding. Sound recognition technologies are used for: * Music recognition * Speech recognition * Automatic alarm detection and identification for surveillance, monitoring systems, based on the acoustic environment * Assistance to disabled or elderly people affected in their hearing capabilities * Identifying species of animals such as fish and mammals, e.g. in acoustical oceanography Security In monitoring and security, an important contribution to alarm detection and alarm verification can be supplied, using sound recognition techniques. In particular, these methods could be helpful for intr ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Pattern Recognition
Pattern recognition is the automated recognition of patterns and regularities in data. It has applications in statistical data analysis, signal processing, image analysis, information retrieval, bioinformatics, data compression, computer graphics and machine learning. Pattern recognition has its origins in statistics and engineering; some modern approaches to pattern recognition include the use of machine learning, due to the increased availability of big data and a new abundance of processing power. These activities can be viewed as two facets of the same field of application, and they have undergone substantial development over the past few decades. Pattern recognition systems are commonly trained from labeled "training" data. When no labeled data are available, other algorithms can be used to discover previously unknown patterns. KDD and data mining have a larger focus on unsupervised methods and stronger connection to business use. Pattern recognition focuses more on the s ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Feature Extraction
In machine learning, pattern recognition, and image processing, feature extraction starts from an initial set of measured data and builds derived values (features) intended to be informative and non-redundant, facilitating the subsequent learning and generalization steps, and in some cases leading to better human interpretations. Feature extraction is related to dimensionality reduction. When the input data to an algorithm is too large to be processed and it is suspected to be redundant (e.g. the same measurement in both feet and meters, or the repetitiveness of images presented as pixels), then it can be transformed into a reduced set of features (also named a feature vector). Determining a subset of the initial features is called feature selection. The selected features are expected to contain the relevant information from the input data, so that the desired task can be performed by using this reduced representation instead of the complete initial data. General Feature extractio ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Linear Predictive Coding
Linear predictive coding (LPC) is a method used mostly in audio signal processing and speech processing for representing the spectral envelope of a digital signal of speech in compressed form, using the information of a linear predictive model. LPC is the most widely used method in speech coding and speech synthesis. It is a powerful speech analysis technique, and a useful method for encoding good quality speech at a low bit rate. Overview LPC starts with the assumption that a speech signal is produced by a buzzer at the end of a tube (for voiced sounds), with occasional added hissing and popping sounds (for voiceless sounds such as sibilants and plosives). Although apparently crude, this Source–filter model is actually a close approximation of the reality of speech production. The glottis (the space between the vocal folds) produces the buzz, which is characterized by its intensity (loudness) and frequency (pitch). The vocal tract (the throat and mouth) forms the tube, ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Music Recognition
Music information retrieval (MIR) is the interdisciplinary science of retrieving information from music. MIR is a small but growing field of research with many real-world applications. Those involved in MIR may have a background in academic musicology, psychoacoustics, psychology, signal processing, informatics, machine learning, optical music recognition, computational intelligence or some combination of these. Applications MIR is being used by businesses and academics to categorize, manipulate and even create music. Music classification One of the classical MIR research topic is genre classification, which is categorizing music items into one of pre-defined genres such as classical, jazz, rock, etc. Mood classification, artist classification, instrument identification, and music tagging are also popular topics. Recommender systems Several recommender systems for music already exist, but surprisingly few are based upon MIR techniques, instead making use of similarity bet ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Speech Recognition
Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers with the main benefit of searchability. It is also known as automatic speech recognition (ASR), computer speech recognition or speech to text (STT). It incorporates knowledge and research in the computer science, linguistics and computer engineering fields. The reverse process is speech synthesis. Some speech recognition systems require "training" (also called "enrollment") where an individual speaker reads text or isolated vocabulary into the system. The system analyzes the person's specific voice and uses it to fine-tune the recognition of that person's speech, resulting in increased accuracy. Systems that do not use training are called "speaker-independent" systems. Systems that use training are called "speaker dependent". Speech recognition ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Species Identification
Identification in biology is the process of assigning a pre-existing taxon name to an individual organism. Identification of organisms to individual scientific names (or codes) may be based on individualistic natural body features, experimentally created individual markers (e.g., color dot patterns), or natural individualistic molecular markers (similar to those used in maternity or paternity identification tests). Individual identification is used in ecology, wildlife management and conservation biology. The more common form of identification is the identification of organisms to common names (e. g., "lion") or scientific name (e. g., "''Panthera leo''"). By necessity this is based on inherited features ("characters") of the sexual organisms, the inheritance forming the basis of defining a class. The features may, e. g., be morphological, anatomical, physiological, behavioral, or molecular. The term "determination" may occasionally be used as a synonym for identification (e. ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Acoustical Oceanography
Hydroacoustics is the study and application of sound in water. Hydroacoustics, using sonar technology, is most commonly used for monitoring of underwater physical and biological characteristics. Hydroacoustics can be used to detect the depth of a water body (bathymetry), as well as the presence or absence, abundance, distribution, size, and behavior of underwater plants and animals. Hydroacoustic sensing involves "passive acoustics" (listening for sounds) or '' active acoustics'' making a sound and listening for the echo, hence the common name for the device, echo sounder or echosounder. There are a number of different causes of noise from shipping. These can be subdivided into those caused by the propeller, those caused by machinery, and those caused by the movement of the hull through the water. The relative importance of these three different categories will depend, amongst other things, on the ship type One of the main causes of hydro acoustic noise from fully submerged ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Audio Analytic
Audio Analytic is a British company headquartered in Cambridge, England that has developed a patented sound recognition software framework called ai3 which provides technology with the ability to understand context through sound. This framework includes an embeddable software platform that can react to a range of sounds such as smoke alarms and carbon monoxide alarms, window breakage, infant crying and dogs barking. History The company was based on founder Christopher Mitchell's doctoral research from Anglia Ruskin University, with seed investment from EEDA (East of England Development Agency) and local Cambridge Angels investors. In 2022 Audio Analytic was bought by Facebook and Instagram owner Meta. Products Audio Analytic sells ai3, a software package that is embedded on a device, along with an assortment of sound profiles that the software can recognise, including warning alarms, window breakage, an infant crying, and voice activity. Audio Analytic developed the Polyph ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Acoustic Fingerprint
An acoustic fingerprint is a condensed digital summary, a fingerprint, deterministically generated from an audio signal, that can be used to identify an audio sample or quickly locate similar items in an audio database. Practical uses of acoustic fingerprinting include identifying songs, melodies, tunes, or advertisements; sound effect library management; and video file identification. Media identification using acoustic fingerprints can be used to monitor the use of specific musical works and performances on radio broadcast, records, CDs, streaming media and peer-to-peer networks. This identification has been used in copyright compliance, licensing, and other monetization schemes. Attributes A robust acoustic fingerprint algorithm must take into account the perceptual characteristics of the audio. If two files sound alike to the human ear, their acoustic fingerprints should match, even if their binary representations are quite different. Acoustic fingerprints are not h ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]