Pathfinder Network
   HOME

TheInfoList



OR:

A pathfinder network is a
psychometric Psychometrics is a field of study within psychology concerned with the theory and technique of measurement. Psychometrics generally refers to specialized fields within psychology and education devoted to testing, measurement, assessment, and ...
scaling method based on
graph theory In mathematics, graph theory is the study of ''graphs'', which are mathematical structures used to model pairwise relations between objects. A graph in this context is made up of '' vertices'' (also called ''nodes'' or ''points'') which are conn ...
and used in the study of expertise, knowledge acquisition,
knowledge engineering Knowledge engineering (KE) refers to all technical, scientific and social aspects involved in building, maintaining and using knowledge-based systems. Background Expert systems One of the first examples of an expert system was MYCIN, an appli ...
, scientific citation patterns, information retrieval, and data visualization. Pathfinder networks are potentially applicable to any problem addressed by network theory.


Overview

Several psychometric scaling methods start from proximity data and yield structures revealing the underlying organization of the data.
Data clustering Cluster analysis or clustering is the task of grouping a set of objects in such a way that objects in the same group (called a cluster) are more similar (in some sense) to each other than to those in other groups (clusters). It is a main task of ...
and
multidimensional scaling Multidimensional scaling (MDS) is a means of visualizing the level of similarity of individual cases of a dataset. MDS is used to translate "information about the pairwise 'distances' among a set of n objects or individuals" into a configurati ...
are two such methods. Network scaling represents another method based on
graph theory In mathematics, graph theory is the study of ''graphs'', which are mathematical structures used to model pairwise relations between objects. A graph in this context is made up of '' vertices'' (also called ''nodes'' or ''points'') which are conn ...
. Pathfinder networks are derived from proximities for pairs of entities. Proximities can be obtained from similarities, correlations, distances, conditional probabilities, or any other measure of the relationships among entities. The entities are often concepts of some sort, but they can be anything with a pattern of relationships. In the pathfinder network, the entities correspond to the nodes of the generated network, and the links in the network are determined by the patterns of proximities. For example, if the proximities are similarities, links will generally connect nodes of high similarity. The links in the network will be undirected if the proximities are symmetrical for every pair of entities. Symmetrical proximities mean that the order of the entities is not important, so the proximity of ''i'' and ''j'' is the same as the proximity of ''j'' and ''i'' for all pairs ''i,j''. If the proximities are not symmetrical for every pair, the links will be directed.


Example

Here is an example of an undirected pathfinder network derived from average similarity ratings of a group of biology graduate students. The students rated the relatedness of all pairs of the terms shown, and the mean rating for each pair was computed. The network shown is the PFnet(2, ∞).


Algorithm

The pathfinder algorithm uses two parameters. #The ''q'' parameter constrains the number of indirect proximities examined in generating the network. The ''q'' parameter is an integer value between 2 and ''n'' − 1, inclusive where ''n'' is the number of nodes or items. #The ''r'' parameter defines the metric used for computing the distance of paths (cf. the
Minkowski distance The Minkowski distance or Minkowski metric is a metric in a normed vector space which can be considered as a generalization of both the Euclidean distance and the Manhattan distance. It is named after the German mathematician Hermann Minkowski. ...
). The ''r'' parameter is a real number between 1 and ''infinity'', inclusive. A network generated with particular values of ''q'' and ''r'' is called a PFnet(''q'', ''r''). Both of the parameters have the effect of decreasing the number of links in the network as their values are increased. The network with the minimum number of links is obtained when ''q'' = ''n'' − 1 and ''r'' = ∞, i.e., PFnet(''n'' − 1, ∞). With ordinal-scale data (see level of measurement), the r-parameter should be infinity because the same PFnet would result from any positive
monotonic transformation In mathematics, a monotonic function (or monotone function) is a function (mathematics), function between List of order structures in mathematics, ordered sets that preserves or reverses the given order relation, order. This concept first aro ...
of the proximity data. Other values of ''r'' require data measured on a ratio scale. The ''q'' parameter can be varied to yield the desired number of links in the network. Essentially, pathfinder networks preserve the shortest possible paths given the data so links are eliminated when they are not on shortest paths. The PFnet(''n'' − 1, ∞) will be the
minimum spanning tree A minimum spanning tree (MST) or minimum weight spanning tree is a subset of the edges of a connected, edge-weighted undirected graph that connects all the vertices together, without any cycles and with the minimum possible total edge weight. T ...
for the links defined by the proximity data if a unique minimum spanning tree exists. In general, the PFnet(''n'' − 1, ∞) includes all of the links in any minimum spanning tree.


References

Further information on pathfinder networks and several examples of the application of PFnets to a variety of problems can be found in: * Schvaneveldt, R. W. (Ed.) (1990) ''Pathfinder Associative Networks: Studies in Knowledge Organization.'' Norwood, NJ: Ablex. The book is out of print. A zipped copy of pdf chapters can be downloaded
zip
A shorter article summarizing pathfinder networks: * Schvaneveldt, R. W., Durso, F. T., & Dearholt, D. W. (1989). Network structures in proximity data. In G. Bower (Ed.), The ''psychology of learning and motivation: Advances in research and theory'', Vol. 24 (pp. 249–284). New York: Academic Press
pdf
Three papers describing fast implementations of pathfinder networks: * * * {{cite journal , doi = 10.1002/asi.20904 , last1 = Quirin , first1 = A. , last2 = Cordón , first2 = O. , last3 = Guerrero-Bote , first3 = V. P. , last4 = Vargas-Quesada , first4 = B. , last5 = Moya-Anegón , first5 = F. , year = 2008 , title = A Quick MST-based Algorithm to Obtain Pathfinder Networks , journal = Journal of the American Society for Information Science and Technology , volume = 59 , issue = 12, pages = 1912–1924 , citeseerx = 10.1.1.331.1548 (The two variants by Quirin et al. are significantly faster. While the former can be applied with ''q'' = 2 or ''q'' = ''n'' − 1 and any value for ''r'', the latter can only be applied in cases where ''q'' = ''n'' − 1 and ''r'' = ∞.)


External links




Pathfinder Software Download Site

Implementation of the original, Binary, Fast and MST variants of the algorithm in C


Psychometrics