PVLV - AbsoluteAstronomy.com

The primary value learned value (PVLV) model

Computer simulation

A computer simulation, a computer model, or a computational model is a computer program, or network of computers, that attempts to simulate an abstract model of a particular system...

is a possible explanation for the reward-predictive firing properties of dopamine

Dopamine

Dopamine is a catecholamine neurotransmitter present in a wide variety of animals, including both vertebrates and invertebrates. In the brain, this substituted phenethylamine functions as a neurotransmitter, activating the five known types of dopamine receptors—D1, D2, D3, D4, and D5—and their...

(DA) neurons. It simulates behavioral and neural data on Pavlovian conditioning

Classical conditioning

Classical conditioning is a form of conditioning that was first demonstrated by Ivan Pavlov...

and the midbrain dopaminergic neurons that fire in proportion to unexpected rewards. It is an alternative to the temporal-differences (TD) algorithm

Temporal difference learning

Temporal difference learning is a prediction method. It has been mostly used for solving the reinforcement learning problem. "TD learning is a combination of Monte Carlo ideas and dynamic programming ideas." TD resembles a Monte Carlo method because it learns by sampling the environment according...

.

It is used as part of Leabra

Leabra

Leabra stands for "Local, Error-driven and Associative, Biologically Realistic Algorithm". It is a model of learning which is a balance between Hebbian and error-driven learning with other network-derived characteristics. This model is used to mathematically predict outcomes based on inputs and...

The source of this article is wikipedia, the free encyclopedia. The text of this article is licensed under the GFDL.