Which brain areas are involved in computing value of options and states of world
How can we study value across domains
Assign values to different states of world with goal of maximizing value of things we do
Pfc = where these value computations happen
Striatum = arbitration between value and basing behaviour towards choosing a type of value over another
perceptual discrimination task
Changing difficulty of stimulus to understand transformation from sensory perception to decision formation and evaluation
#of dots, coherence, gratings etc
Then identify and characterize —> brain areas, neurons over stages (cognitive sensory)
Free choice task
Decision rely on subjective value between options
Internally generated
How do we construct such subjective value representations and how do we arbitrate between them
Assign value -can we put on common axis
Which brain circuits underly tehse computations
describe Iowa gambling task
4 decks
A-b = 100 reward p=1, 250 loss p=0.5 (more risky, lose money on avg in long run)
C-d = 50 reward p=1, 50 loss p=0.5 (safer option, win money in long run )
what does Iowa gambling task measure
ability to integrate rewards and losses in a probabalisitic setting
measuring decision differences across subjects and patients = compare Iowa and Wisconsin
Lesion vmpfc
No impairment in. Control tasks and in Wisconsin task (match to rule- changes and have to figure out rule change, error means rule switched)
In Iowa task = challenge is about estimating the average long term reward in a probabilistic setting rather than the rule that provides a guaranteed reward
Obvious impairments in daily life
Describe what happens when vmpfc lesion in Iowa task = control
Start by choosing risk decks before reverting to choosing safe decks bc have positive avg reward in long run
Describe what happens when vmpfc lesion in Iowa task = lesion group
Did not switch behaviour
Choose high risk more often and cannot switch
Domain of info diff = integrate probabilities over time to be able to determine that one set of decks is better than other
Describe what happens when vmpfc lesion in Iowa task= conclusions
Simailr to patients in Wisconsin sortinggame where patients had an inability to change their behaviour with new info
But here domain diff = estimating probabilistic rewards rather than rules
what do we have to consider = patient cohort to investigate localization and laterality
Cohort of patients wirh lesions across pfc areas
Group patients by overlap of lesionned areas with regions of interest - some overlap, some large, soma small
Also study effect of laterality = bi vs uni lateral
Important to consider heterogeneity of the origin of the lesion across the cohort= this, blunt force trauma, stroke
describe results patient cohort to investigate localization and laterality= all patients
Typical subjects start with risky decks and switch to safer ones
Frontal patients do switch but then return to risky option
describe results patient cohort to investigate localization and laterality= left vs right control lesions
Score deficits appear regardless of lesionned hemisphere
Same results for right vs left frontal hemispheres = not lateralzied
describe results patient cohort to investigate localization and laterality= localizaiton of lesion
Dorsolateral, venteromedial, extended (both areas)
No significant effect of prefrontal area lesionned
describe results patient cohort to investigate localization and laterality= conslusions
Vmpfc involved but not unique contribution = if lesion whole Pfc will still have same impairments in value based decision making
broad activation across pfc in task = compare what
Compare relative activation between Iowa gambling task and a control task where the outcome of each deck is explicitly given
= task structure the same but gambling and payoff estimation under uncertainty component is removed
= no building probabailsitic represnettaion = remove subjective valuation
broad activation across pfc in task = results
Monitoring approach
Broad activation of Pfc and other areas including ventral striatum (downstream action selection)
What guides decisions
perception and valuation
- identify something more specific to vmpfc
What types of value representations are being done where
stages of value based decision making = gen
Represent possible states of world
Assign value to each state - how to compare across states
Select action
Evaluate outcomes
Update valuation of options
stages of value based decision making = stage 1
Representation of states of world
Set of feasible actions, internal and external states
stages of value based decision making = stage 2
Valuation
What is value of each action given internal and external states - like if thirsty, water bottle has more value
stages of value based decision making = stage 3
Action selection
Choose actions based on valuations
Picks best option to pick another option to try out diff things
stages of value based decision making = stage 4
Outcome evolution = how desirable are the outcomes and states that followed the action - confidence and meta cognition, is outcome good
Then —> learning = update representation, valuation and action selection processes = adapt
pavlovian valuation system
Associated with something good, direct reward linked to action we did
Appetitive= eat food, food reward
Avoidance= cross street when see dangerous person, avoid potential harm
Habitual valuation system
Do bc used to doing it, learned behaviour from many actions, habit, not direct rewards
Appetitive = coffee in morning, stimulant reward
Avoidance = drive usual route to work, avoid traffic