The primary value learned value (PVLV)
model
A model is an informative representation of an object, person or system. The term originally denoted the plans of a building in late 16th-century English, and derived via French and Italian ultimately from Latin ''modulus'', a measure.
Models c ...
is a possible explanation for the reward-predictive firing properties of
dopamine (DA) neurons. It simulates behavioral and neural data on
Pavlovian conditioning
Classical conditioning (also known as Pavlovian or respondent conditioning) is a behavioral procedure in which a biologically potent stimulus (e.g. food) is paired with a previously neutral stimulus (e.g. a triangle). It also refers to the learni ...
and the
midbrain
The midbrain or mesencephalon is the forward-most portion of the brainstem and is associated with vision, hearing, motor control, sleep and wakefulness, arousal (alertness), and temperature regulation. The name comes from the Greek ''mesos'', " ...
dopaminergic neurons that fire in proportion to unexpected rewards. It is an alternative to the
temporal-differences (TD) algorithm.
It is used as part of
Leabra Leabra stands for local, error-driven and associative, biologically realistic algorithm. It is a model of learning which is a balance between Hebbian and error-driven learning with other network-derived characteristics. This model is used to mathe ...
.
References
Computational neuroscience
Machine learning algorithms
{{neuroscience-stub