PVLV
Encyclopedia
The primary value learned value (PVLV) model
is a possible explanation for the reward-predictive firing properties of dopamine
(DA) neurons. It simulates behavioral and neural data on Pavlovian conditioning
and the midbrain dopaminergic neurons that fire in proportion to unexpected rewards. It is an alternative to the temporal-differences (TD) algorithm
.
It is used as part of Leabra
.
Computer simulation
A computer simulation, a computer model, or a computational model is a computer program, or network of computers, that attempts to simulate an abstract model of a particular system...
is a possible explanation for the reward-predictive firing properties of dopamine
Dopamine
Dopamine is a catecholamine neurotransmitter present in a wide variety of animals, including both vertebrates and invertebrates. In the brain, this substituted phenethylamine functions as a neurotransmitter, activating the five known types of dopamine receptors—D1, D2, D3, D4, and D5—and their...
(DA) neurons. It simulates behavioral and neural data on Pavlovian conditioning
Classical conditioning
Classical conditioning is a form of conditioning that was first demonstrated by Ivan Pavlov...
and the midbrain dopaminergic neurons that fire in proportion to unexpected rewards. It is an alternative to the temporal-differences (TD) algorithm
Temporal difference learning
Temporal difference learning is a prediction method. It has been mostly used for solving the reinforcement learning problem. "TD learning is a combination of Monte Carlo ideas and dynamic programming ideas." TD resembles a Monte Carlo method because it learns by sampling the environment according...
.
It is used as part of Leabra
Leabra
Leabra stands for "Local, Error-driven and Associative, Biologically Realistic Algorithm". It is a model of learning which is a balance between Hebbian and error-driven learning with other network-derived characteristics. This model is used to mathematically predict outcomes based on inputs and...
.