Confusion Matrix

	Confusion Matrix In the field of machine learning and specifically the problem of statistical classification, a confusion matrix, also known as error matrix, is a specific table layout that allows visualization of the performance of an algorithm, typically a supervised learning one; in unsupervised learning it is usually called a matching matrix. Each row of the matrix represents the instances in an actual class while each column represents the instances in a predicted class, or vice versa – both variants are found in the literature. The diagonal of the matrix therefore represents all instances that are correctly predicted. The name stems from the fact that it makes it easy to see whether the system is confusing two classes (i.e. commonly mislabeling one as another). It is a special kind of contingency table, with two dimensions ("actual" and "predicted"), and identical sets of "classes" in both dimensions (each combination of dimension and class is a variable in the contingency table). ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Machine Learning Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task (computing), tasks without explicit Machine code, instructions. Within a subdiscipline in machine learning, advances in the field of deep learning have allowed Neural network (machine learning), neural networks, a class of statistical algorithms, to surpass many previous machine learning approaches in performance. ML finds application in many fields, including natural language processing, computer vision, speech recognition, email filtering, agriculture, and medicine. The application of ML to business problems is known as predictive analytics. Statistics and mathematical optimisation (mathematical programming) methods comprise the foundations of machine learning. Data mining is a related field of study, focusing on exploratory data analysi ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Sensitivity (test) In medicine and statistics, sensitivity and specificity mathematically describe the accuracy of a test that reports the presence or absence of a medical condition. If individuals who have the condition are considered "positive" and those who do not are considered "negative", then sensitivity is a measure of how well a test can identify true positives and specificity is a measure of how well a test can identify true negatives: * Sensitivity (true positive rate) is the probability of a positive test result, conditioned on the individual truly being positive. * Specificity (true negative rate) is the probability of a negative test result, conditioned on the individual truly being negative. If the true status of the condition cannot be known, sensitivity and specificity can be defined relative to a " gold standard test" which is assumed correct. For all testing, both diagnoses and screening, there is usually a trade-off between sensitivity and specificity, such that higher sensiti ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Positive And Negative Predictive Values The positive and negative predictive values (PPV and NPV respectively) are the proportions of positive and negative results in statistics and diagnostic tests that are true positive and true negative results, respectively. The PPV and NPV describe the performance of a diagnostic test or other statistical measure. A high result can be interpreted as indicating the accuracy of such a statistic. The PPV and NPV are not intrinsic to the test (as true positive rate and true negative rate are); they depend also on the prevalence. Both PPV and NPV can be derived using Bayes' theorem. Although sometimes used synonymously, a ''positive predictive value'' generally refers to what is established by control groups, while a post-test probability refers to a probability for an individual. Still, if the individual's pre-test probability of the target condition is the same as the prevalence in the control group used to establish the positive predictive value, the two are numerically equal ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Principle Of Maximum Entropy The principle of maximum entropy states that the probability distribution which best represents the current state of knowledge about a system is the one with largest entropy, in the context of precisely stated prior data (such as a proposition that expresses testable information). Another way of stating this: Take precisely stated prior data or testable information about a probability distribution function. Consider the set of all trial probability distributions that would encode the prior data. According to this principle, the distribution with maximal information entropy is the best choice. History The principle was first expounded by E. T. Jaynes in two papers in 1957, where he emphasized a natural correspondence between statistical mechanics and information theory. In particular, Jaynes argued that the Gibbsian method of statistical mechanics is sound by also arguing that the entropy of statistical mechanics and the information entropy of information theory are the same ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Transportation Theory (mathematics) In mathematics and economics, transportation theory or transport theory is a name given to the study of optimal transportation and allocation of resources. The problem was formalized by the French mathematician Gaspard Monge in 1781.G. Monge. ''Mémoire sur la théorie des déblais et des remblais. Histoire de l’Académie Royale des Sciences de Paris, avec les Mémoires de Mathématique et de Physique pour la même année'', pages 666–704, 1781. In the 1920s A.N. Tolstoi was one of the first to study the transportation problem mathematically. In 1930, in the collection ''Transportation Planning Volume I'' for the National Commissariat of Transportation of the Soviet Union, he published a paper "Methods of Finding the Minimal Kilometrage in Cargo-transportation in space". Major advances were made in the field during World War II by the Soviet mathematician and economist Leonid Kantorovich.L. Kantorovich. ''On the translocation of masses.'' C.R. (Doklady) Acad. Sci. URSS (N.S ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Multi-label Classification In machine learning, multi-label classification or multi-output classification is a variant of the statistical classification, classification problem where multiple nonexclusive labels may be assigned to each instance. Multi-label classification is a generalization of multiclass classification, which is the single-label problem of categorizing instances into precisely one of several (greater than or equal to two) classes. In the multi-label problem the labels are nonexclusive and there is no constraint on how many of the classes the instance can be assigned to. The formulation of multi-label learning was first introduced by Shen et al. in the context of Semantic Scene Classification,Xipeng Shen, Matthew Boutell, Jiebo Luo, and Christopher Brown,Multi-label Machine Learning and Its Application to Semantic Scene Classification, In Proceedings of IS&T/SPIE's Sixteenth Anaual Symposium on Electronic Imaging: Science and Technology (EI 2004), San Jose, California, USA, January 2004, pag ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Whistled Language Whistled speech is a form of speech surrogacy in which whistling is used to mimic speech. Speakers of more than 80 languages have been found to practice various degrees of whistled speech, most of them in rugged topography or dense forests, where movement to carry messages is challenging, and whistling expands the distance of communication. The practice is generally threatened by increased modernization and faster roads, but successful conservation efforts are recorded. Definition A whistled language is a system of whistled communication which allows fluent whistlers to transmit and comprehend a potentially unlimited number of messages over long distances. Whistled languages are different in this respect from free associative whistling, which may be done to simulate music, to attract attention, or, in the case of herders or animal trainers, to transmit simple messages or instructions to animal companions. Generally, whistled languages emulate the tones or vowel formants of ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Matthews Correlation Coefficient In statistics, the phi coefficient, or mean square contingency coefficient, denoted by ''φ'' or ''r''''φ'', is a measure of association for two binary variables. In machine learning, it is known as the Matthews correlation coefficient (MCC) and used as a measure of the quality of binary (two-class) classifications, introduced by biochemist Brian W. Matthews in 1975. Introduced by Karl Pearson,Cramer, H. (1946). ''Mathematical Methods of Statistics''. Princeton: Princeton University Press, p. 282 (second paragraph). https://archive.org/details/in.ernet.dli.2015.223699 and also known as the ''Yule phi coefficient'' from its introduction by Udny Yule in 1912 this measure is similar to the Pearson correlation coefficient in its interpretation. In meteorology, the phi coefficient, or its square (the latter aligning with M. H. Doolittle's original proposition from 1885), is referred to as the Doolittle Skill Score or the Doolittle Measure of Association. Definition A Pear ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Informedness Youden's J statistic (also called Youden's index) is a single statistic that captures the performance of a dichotomous diagnostic test. In meteorology, this statistic is referred to as Peirce Skill Score (PSS), Hanssen–Kuipers Discriminant (HKD), or True Skill Statistic (TSS). (Bookmaker) Informedness is its generalization to the multiclass case and estimates the probability of an informed decision. Definition Youden's ''J'' statistic is : J = \text + \text -1=\text_1 + \text_0 -1 with the two right-hand quantities being sensitivity and specificity. Thus the expanded formula is: :J = \frac+\frac-1 = \frac In this equation, TP is the number of true positives, TN the number of true negatives, FP the number of false positives and FN the number of false negatives. The index was suggested by W. J. Youden in 1950 as a way of summarising the performance of a diagnostic test; however, the formula was earlier published in ''Science'' by C. S. Peirce in 1884. Its value range ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	F1 Score In statistical analysis of binary classification and information retrieval systems, the F-score or F-measure is a measure of predictive performance. It is calculated from the precision and recall of the test, where the precision is the number of true positive results divided by the number of all samples predicted to be positive, including those not identified correctly, and the recall is the number of true positive results divided by the number of all samples that should have been identified as positive. Precision is also known as positive predictive value, and recall is also known as sensitivity in diagnostic binary classification. The F1 score is the harmonic mean of the precision and recall. It thus symmetrically represents both precision and recall in one metric. The more generic F_\beta score applies additional weights, valuing one of precision or recall more than the other. The highest possible value of an F-score is 1.0, indicating perfect precision and recall, and the ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Predictive Analytics Predictive analytics encompasses a variety of Statistics, statistical techniques from data mining, Predictive modelling, predictive modeling, and machine learning that analyze current and historical facts to make predictions about future or otherwise unknown events. In business, predictive models exploit Pattern detection, patterns found in historical and transactional data to identify risks and opportunities. Models capture relationships among many factors to allow assessment of risk or potential associated with a particular set of conditions, guiding decision-making for candidate transactions. The defining functional effect of these technical approaches is that predictive analytics provides a predictive score (probability) for each individual (customer, employee, healthcare patient, product SKU, vehicle, component, machine, or other organizational unit) in order to determine, inform, or influence organizational processes that pertain across large numbers of individuals, such as ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]