Concept Class
   HOME

TheInfoList



OR:

In
computational learning theory In computer science, computational learning theory (or just learning theory) is a subfield of artificial intelligence devoted to studying the design and analysis of machine learning algorithms. Overview Theoretical results in machine learning m ...
in
mathematics Mathematics is an area of knowledge that includes the topics of numbers, formulas and related structures, shapes and the spaces in which they are contained, and quantities and their changes. These topics are represented in modern mathematics ...
, a concept over a domain ''X'' is a total
Boolean function In mathematics, a Boolean function is a function whose arguments and result assume values from a two-element set (usually , or ). Alternative names are switching function, used especially in older computer science literature, and truth function ( ...
over ''X''. A concept class is a class of concepts. Concept classes are a subject of
computational learning theory In computer science, computational learning theory (or just learning theory) is a subfield of artificial intelligence devoted to studying the design and analysis of machine learning algorithms. Overview Theoretical results in machine learning m ...
. Concept class terminology frequently appears in
model theory In mathematical logic, model theory is the study of the relationship between formal theories (a collection of sentences in a formal language expressing statements about a mathematical structure), and their models (those structures in which the s ...
associated with
probably approximately correct In computational learning theory, probably approximately correct (PAC) learning is a framework for mathematical analysis of machine learning. It was proposed in 1984 by Leslie Valiant.L. Valiant. A theory of the learnable.' Communications of the A ...
(PAC) learning.Chase, H., & Freitag, J. (2018). ''Model Theory and Machine Learning''. arXiv preprint arXiv:1801.06566
In this setting, if one takes a set ''Y'' as a set of (classifier output) labels, and ''X'' is a set of examples, the map c: X\to Y, i.e. from examples to classifier labels (where Y = \ and where ''c'' is a subset of ''X''), ''c'' is then said to be a ''concept''. A ''concept class'' C is then a collection of such concepts. Given a class of concepts ''C'', a subclass ''D'' is reachable if there exists a sample ''s'' such that ''D'' contains exactly those concepts in ''C'' that are extensions to ''s''. Not every subclass is reachable.


Background

A ''sample'' s is a
partial function In mathematics, a partial function from a set to a set is a function from a subset of (possibly itself) to . The subset , that is, the domain of viewed as a function, is called the domain of definition of . If equals , that is, if is de ...
from X to \. Identifying a concept with its characteristic function mapping X to \, it is a special case of a sample. Two samples are ''consistent'' if they agree on the intersection of their domains. A sample s' ''extends'' another sample s if the two are consistent and the domain of s is contained in the domain of s'.


Examples

Suppose that C = S^+(X). Then: * the subclass \ is reachable with the sample s = \; * the subclass S^+(Y) for Y\subseteq X are reachable with a sample that maps the elements of X - Y to zero; * the subclass S(X), which consists of the singleton sets, is ''not'' reachable.


Applications

Let C be some concept class. For any concept c\in C, we call this concept 1/d''-good'' for a positive integer d if, for all x\in X, at least 1/d of the concepts in C agree with c on the classification of x. The ''fingerprint dimension'' FD(C) of the entire concept class C is the least positive integer d such that every reachable subclass C'\subseteq C contains a concept that is 1/d-good for it. This quantity can be used to bound the minimum number of equivalence queries needed to learn a class of concepts according to the following
inequality Inequality may refer to: Economics * Attention inequality, unequal distribution of attention across users, groups of people, issues in etc. in attention economy * Economic inequality, difference in economic well-being between population groups * ...
:FD(C) - 1\leq \#EQ(C)\leq \lceil FD(C)\ln(, C, )\rceil.


References

Computational learning theory {{Compu-AI-stub