Uncertain inference was first described by C. J. van Rijsbergen as a way to formally define a query and document relationship in

Information retrieval Information retrieval (IR) in computing and information science is the task of identifying and retrieving information system resources that are relevant to an Information needs, information need. The information need can be specified in the form ...

. This formalization is a

logical implication Logic is the study of correct reasoning. It includes both formal and informal logic. Formal logic is the study of deductively valid inferences or logical truths. It examines how conclusions follow from premises based on the structure of ...

with an attached measure of uncertainty.

Definitions

Rijsbergen proposes that the measure of

uncertainty Uncertainty or incertitude refers to situations involving imperfect or unknown information. It applies to predictions of future events, to physical measurements that are already made, or to the unknown, and is particularly relevant for decision ...

of a document ''d'' to a query ''q'' be the probability of its logical implication, i.e.: :

P(d \to q)

A user's query can be interpreted as a set of assertions about the desired document. It is the system's task to

infer Inferences are steps in logical reasoning, moving from premises to logical consequences; etymologically, the word '' infer'' means to "carry forward". Inference is theoretically traditionally divided into deduction and induction, a distinctio ...

, given a particular document, if the query assertions are true. If they are, the document is retrieved. In many cases the contents of documents are not sufficient to assert the queries. A

knowledge base In computer science, a knowledge base (KB) is a set of sentences, each sentence given in a knowledge representation language, with interfaces to tell new sentences and to ask questions about what is known, where either of these interfaces migh ...

of facts and rules is needed, but some of them may be uncertain because there may be a probability associated to using them for inference. Therefore, we can also refer to this as ''plausible inference''. The

plausibility In sociology and especially the sociological study of religion, plausibility structures are the sociocultural contexts for systems of meaning within which these meanings make sense, or are made plausible. Beliefs and meanings held by individuals a ...

of an inference

d \to q

is a function of the plausibility of each query assertion. Rather than retrieving a document that exactly matches the query we should rank the documents based on their plausibility in regards to that query. Since ''d'' and ''q'' are both generated by users, they are error prone; thus

d \to q

is uncertain. This will affect the plausibility of a given query. By doing this it accomplishes two things: * Separate the processes of revising probabilities from the logic * Separate the treatment of relevance from the treatment of requests

Multimedia Multimedia is a form of communication that uses a combination of different content forms, such as Text (literary theory), writing, Sound, audio, images, animations, or video, into a single presentation. T ...

documents, like images or videos, have different inference properties for each datatype. They are also different from text document properties. The framework of plausible inference allows us to measure and combine the probabilities coming from these different properties. Uncertain inference generalizes the notions of autoepistemic logic, where truth values are either known or unknown, and when known, they are true or false.

Example

If we have a query of the form: :

q = A \wedge B \wedge C

where A, B and C are query assertions, then for a document D we want the probability: :

P (D \to (A \wedge B \wedge C))

If we transform this into the

conditional probability In probability theory, conditional probability is a measure of the probability of an Event (probability theory), event occurring, given that another event (by assumption, presumption, assertion or evidence) is already known to have occurred. This ...

P ((A \wedge B \wedge C) ,  D)

and if the query assertions are independent we can calculate the overall probability of the implication as the product of the individual assertions probabilities.

Further work

Croft and Krovetz applied uncertain inference to an information retrieval system for office documents they called ''OFFICER''. In office documents the independence assumption is valid since the query will focus on their individual attributes. Besides analysing the content of documents one can also query about the author, size, topic or collection for example. They devised methods to compare document and query attributes, infer their plausibility and combine it into an overall rating for each document. Besides that uncertainty of document and query contents also had to be addressed. Probabilistic logic networks is a system for performing uncertain inference; crisp true/false truth values are replaced not only by a probability, but also by a confidence level, indicating the certitude of the probability.

Markov logic network A Markov logic network (MLN) is a probabilistic logic which applies the ideas of a Markov network to first-order logic, defining probability distributions on possible worlds on any given domain. History In 2002, Ben Taskar, Pieter Abbeel and ...

s allow uncertain inference to be performed; uncertainties are computed using the maximum entropy principle, in analogy to the way that

Markov chain In probability theory and statistics, a Markov chain or Markov process is a stochastic process describing a sequence of possible events in which the probability of each event depends only on the state attained in the previous event. Informally ...

s describe the uncertainty of

finite-state machine A finite-state machine (FSM) or finite-state automaton (FSA, plural: ''automata''), finite automaton, or simply a state machine, is a mathematical model of computation. It is an abstract machine that can be in exactly one of a finite number o ...

References

{{reflist Fuzzy logic Information retrieval techniques Inference

Definitions

Example

Further work

See also

References