Boolean analysis was introduced by Flament (1976).
[Flament, C. (1976). "L'analyse booleenne de questionnaire", Paris: Mouton.] The goal of a Boolean analysis is to detect
deterministic
Determinism is a philosophical view, where all events are determined completely by previously existing causes. Deterministic theories throughout the history of philosophy have developed from diverse and sometimes overlapping motives and consi ...
dependencies between the items of a
questionnaire
A questionnaire is a research instrument that consists of a set of questions (or other types of prompts) for the purpose of gathering information from respondents through survey or statistical study. A research questionnaire is typically a mix of ...
or similar data-structures in observed response patterns. These deterministic dependencies have the form of
logical formulas connecting the items. Assume, for example, that a questionnaire contains items ''i'', ''j'', and ''k''. Examples of such deterministic dependencies are then ''i'' → ''j'', ''i'' ∧ ''j'' → ''k'', and ''i'' ∨ ''j'' → ''k''.
Since the basic work of Flament (1976) a number of different methods for Boolean analysis have been developed. See, for example, Buggenhaut and Degreef (1987), Duquenne (1987),
item tree analysis Item tree analysis (ITA) is a data analytical method which allows constructing a
hierarchical structure on the items of a questionnaire or test from observed response
patterns. Assume that we have a questionnaire with ''m'' items and that subjects ...
Leeuwe (1974), Schrepp (1999), or Theuns (1998). These methods share the goal to derive deterministic dependencies between the items of a questionnaire from data, but differ in the algorithms to reach this goal.
Boolean analysis is an
explorative method to detect deterministic dependencies between items. The detected dependencies must be confirmed in subsequent research. Methods of Boolean analysis do not assume that the detected dependencies describe the data completely. There may be other probabilistic dependencies as well. Thus, a Boolean analysis tries to detect interesting deterministic structures in the data, but has not the goal to uncover all structural aspects in the data set. Therefore, it makes sense to use other methods, like for example
latent class analysis, together with a Boolean analysis.
Application areas
The investigation of deterministic dependencies has some tradition in
educational psychology
Educational psychology is the branch of psychology concerned with the scientific study of human learning. The study of learning processes, from both cognitive and behavioral perspectives, allows researchers to understand individual differences in ...
. The items represent in this area usually skills or cognitive abilities of subjects. Bart and Airasian (1974) use Boolean analysis to establish logical implications on a set of
Piagetian
Jean William Fritz Piaget (, , ; 9 August 1896 – 16 September 1980) was a Swiss psychologist known for his work on child development. Piaget's theory of cognitive development and epistemological view are together called "genetic epistemology". ...
tasks. Other examples in this tradition are the learning hierarchies of Gagné (1968) or the theory of structural learning of Scandura (1971).
There are several attempts to use boolean analysis, especially
item tree analysis Item tree analysis (ITA) is a data analytical method which allows constructing a
hierarchical structure on the items of a questionnaire or test from observed response
patterns. Assume that we have a questionnaire with ''m'' items and that subjects ...
to construct
knowledge space In mathematical psychology and education theory, a knowledge space is a combinatorial structure used to formulate mathematical models describing the progression of a human learner. Knowledge spaces were introduced in 1985 by Jean-Paul Doignon and ...
s from data. Examples can be found in Held and Korossy (1998), or Schrepp (2002).
Methods of Boolean analysis are used in a number of
social science
Social science is one of the branches of science, devoted to the study of societies and the relationships among individuals within those societies. The term was formerly used to refer to the field of sociology, the original "science of soc ...
studies to get insight into the structure of
dichotomous
A dichotomy is a partition of a whole (or a set) into two parts (subsets). In other words, this couple of parts must be
* jointly exhaustive: everything must belong to one part or the other, and
* mutually exclusive: nothing can belong simul ...
data. Bart and Krus (1973) use, for example, Boolean analysis to establish a hierarchical order on items that describe socially unaccepted behavior. Janssens (1999) used a method of Boolean analysis to investigate the integration process of minorities into the value system of the dominant culture.
Romme (1995a) introduced Boolean comparative analysis to the management sciences, and applied it in a study of self-organizing processes in management teams (Romme 1995b).
Relations to other areas
Boolean analysis has some relations to other research areas. There is a close connection between Boolean analysis and
knowledge space In mathematical psychology and education theory, a knowledge space is a combinatorial structure used to formulate mathematical models describing the progression of a human learner. Knowledge spaces were introduced in 1985 by Jean-Paul Doignon and ...
s. The theory of knowledge spaces provides a theoretical framework for the formal description of human knowledge. A knowledge domain is in this approach represented by a set ''Q'' of problems. The knowledge of a subject in the domain is then described by the subset of problems from ''Q'' he or she is able to solve. This set is called the ''knowledge state'' of the subject. Because of dependencies between the items (for example, if solving item ''j'' implies solving item ''i'') not all elements of the power set of ''Q'' will, in general, be possible knowledge states. The set of all possible knowledge states is called the ''knowledge structure''. Methods of Boolean analysis can be used to construct a knowledge structure from data (for example, Theuns, 1998 or Schrepp, 1999). The main difference between both research areas is that Boolean analysis concentrates on the extraction of structures from data while knowledge space theory focus on the structural properties of the relation between a knowledge structure and the logical formulas which describe it.
Closely related to knowledge space theory is
formal concept analysis (Ganter and Wille, 1996). Similar to knowledge space theory this approach concentrates on the formal description and visualization of existing dependencies. Formal concept analysis offers very effective ways to construct such dependencies from data, with a focus on if-then expressions ("
implications"). There is even a method, called attribute exploration,
[Ganter, Bernhard and Obiedkov, Sergei (2016) ''Conceptual Exploration''. Springer, ] for extracting all implications from hard-to-access data.
Another related field is
data mining. Data mining deals with the extraction of knowledge from large databases. Several data mining algorithms extract dependencies of the form j → i (called
association rules
Association rule learning is a rule-based machine learning method for discovering interesting relations between variables in large databases. It is intended to identify strong rules discovered in databases using some measures of interestingness.P ...
) from the database.
The main difference between Boolean analysis and the extraction of association rules in data mining is the interpretation of the extracted implications. The goal of a Boolean analysis is to extract implications from the data which are (with the exception of random errors in the response behavior) true for all rows in the data set. For data mining applications it is sufficient to detect implications which fulfill a predefined level of accuracy.
It is, for example in a marketing scenario, of interest to find implications which are true for more than x% of the rows in the data set. An online bookshop may be interested, for example, to search for implications of the form ''If a customer orders book A he also orders book B'' if they are fulfilled by more than 10% of the available customer data.
References
{{reflist
* Flament, C. (1976). L’analyse booleenne de questionnaire. Paris: Mouton.
* Buggenhaut, J., & Degreef, E. (1987). On dichotomization methods in Boolean analysis of questionnaires. In E. E. Roskam & R. Suck (Eds.), Mathematical psychology in progress (pp. 447–453). Amsterdam, NY: North Holland.
* Duquenne, V. (1987). Conceptual implications between attributes and some representation properties for finite lattices. In B. Ganter, R. Wille & K. E. Wolff (Eds.), Beiträge zur Begriffsanalyse: Vorträge der Arbeitstagung Begriffsanalyse, Darmstadt 1986 (pp. 213–239). Mannheim: BI Wissenschafts-Verlag.
* Leeuwe, J. F. J. van (1974). Item tree analysis. Nederlands Tijdschrift voor de Psychologie,29, 475–484.
* Schrepp, M. (1999). On the empirical construction of implications on bi-valued test items. Journal of Mathematical Social Sciences, 38(3), 361–375.
* Theuns, P (1998). Building a knowledge space via Boolean analysis of co-occurrence data. In C. E. Dowling,
F. S. Roberts, and P. Theuns (Eds.), Recent Progress in Mathematical Psychology (pp. 173–194). Hillsdale, NJ: Erlbaum.
* Bart, W. A., & Airasian P. W. (1974). Determination of the ordering among seven Piagetian tasks by an ordering-theoretic method. Journal of Educational Psychology, 66(2), 277–284.
* Gagné, R. M. (1968). Learning hierarchies. Educational Psychology, 6, 1–9.
* Scandura J. M. (1971). Deterministic theorizing in structural learning: Three levels of empiricism. Journal of Structural Learning, 3, 21–53.
* Bart, W. M., & Krus, D. J. (1973). An ordering-theoretic method to determine hierarchies among items. Educational and psychological measurement, 33, 291–300.
* Janssens, R. (1999). A Boolean approach to the measurement of group processes and attitudes. The concept of integration as an example. Mathematical Social Sciences, 38, 275–293.
* Held, T., & Korossy, K. (1998). Data-analysis as heuristic for establishing theoretically founded item structures. Zeitschrift für Psychologie, 206, 169–188.
* Ganter, B., & Wille, R. (1996). Formale Begriffsanalyse: Mathematische Grundlagen. Berlin: Springer.
* Romme, A.G.L. (1995). Boolean comparative analysis of qualitative data. Quality and Quantity, 29, 317-329.
* Romme, A.G.L. (1995). A Self-organizing processes in top management teams: a Boolean comparative approach. Journal of Business Research, 34, 11-34.
* Schrepp, M. (2003). A method for the analysis of hierarchical dependencies between items of a questionnaire. Methods of Psychological Research — Online, 19, 43–79.
Statistical analysis
Logic and statistics