Value Based Decision Making II Flashcards

Question

muscimol devaluation task = lesion area 13

Answer 1

Deficits in behaviour is area 13 lesionned during devaluation - deficit in updating value Strong impairment - not shifting behaviour If lesion while eating berries

Answer 2

Deficit in behaviour of area 11 if lesionned after devaluation = deficit in goal selection When in test phase, cannot use value to switch behaviour easily

Answer 3

Current worth of available options - what we measured with common currency - domain general

Answer 4

Representations of value important for learning and value updating

Answer 5

Needed for choice based for value comparison Transforms value representation into common currency - making comparing between values

Answer 6

Chance that chosen items are obtained or goal of action is realized - vlpfc

Answer 7

Estimation of availability of the different options What is chance options can be obtained

Answer 8

Chose what we want - nto always choosing best one = might want to explore world Saw dissociation between building valuation and using valuation for action Basal ganglia and striatum Involved in bias - one way or another, compute valye of action, biases choices by changing specific components of values

Answer 9

Direct - activation promotes action by selection of the intended motor program - neurons more active = promotes action = accelerator Indirect - activation of indirect pathway suppresses competing motor programs, suppresses what is encoding, break/inhibtit/repress

Answer 10

Simailr to bandit tasks Value of an option depends on the reward history One port not rewarded Other port rewarded 75% of time Contingencies change during task so mice have to evaluate whether port location changed or if its an unreewarded trial Integrate over history - which one is best, reward most of time

Answer 11

Compute reward history that leads mouse to get reward Mice persevere in port following rewarded trials Tend to switch preference after 2 unrewarded trails Use reward history to assess whether unrewarded trial is one of the 25% unrewarded trials or due to change in the block

Answer 12

Value nto given by stimulus but have to be inferred based on trail history Build computation model = how much reward in past influences current estimate fo value - estimate relative contribution of previous rewarded and unrewarded trails as a function of their recency Approach allows to estimate for each trial an action value = simailr to decisions variable in perceptual decision making Asses relative value - don’t always pick best one, graded shift in variability Base action on reward history = what is most likely rewarded port

Answer 13

Bias - might be biased, Exploration - task probabilistic, need to explore to see where reward is Variability - task variable Laziness - hard to know if animal doing task, if they are trying, at extremes when stimulus easy - pick one side vs other = good sign they are actually doing task

Answer 14

Micro stimulation in Mt leverages anatomy - add value in one part and see if changes behaviour - only bc neurons organized like that But not same thing in d1/d2 = neurons integrated, mixed, cannot stimulate one specifically

Answer 15

insert light sensitive ion channel - gene coding for a protein so it can be inserted in a specified subset of cells = Shine light = activate or inhibit neurons Allows for targeted manipulation of a genetically defined subset of cells in a temporally precise ms manner - very precise

Answer 16

By light = can actiavte one specific cell type even tho not anatomically segregated Can provide laser pulses to 1 hemisphere only Activate independently each of the pathways to understand they contribution to action selection = see how it affects behaviour

Answer 17

Right striatum dms - d1 direct = increase left action value, d2 indirect = inhibit left action value Left striatum dms - d1 direct increase right action value, d2 indirect = inhibit right action value Define as relative av = left av-right av Define positive action value as left av higher than right av - psychometric curves will therefore have p(left) increasing with higher relative action value

Answer 18

When go in center port and when lights are on = optical stimulation during specific time on 6% of trials Optical stimulation is done on a subset of the trials to actiavte specific sub population - d1 or d2 in one hemisphere Can manipulate each of the 4 pathways independently to test this model of action selection - one pathway specifically Bias behaviour in crude sense, not super precise

Answer 19

Direct pathway = leads to biasing towards contralateral side - more towards right

Answer 20

Indirect pathway = biasing towards ipsilateral side More towards left

Answer 21

Do same in sensory decision task Induce increase in perceived stimuli’s = means subjective effect that is negative the other way Provides negative - now for value of environment to be + to perceive 50/50 Smelting happening with manipulation that provides you with negative info

Answer 22

bias pathways and looks t how behaviour changes Compute relative action value = left av-right av, based on reward history and plot probability of choosing left side Inhibit left action value = shift to right, something gives negative action value, decrease relative action value Stimulation d2 neurons right dms = decreases relative action value - decreases left action value and therefore subjective relative action value With stimulation = need positive relative av to observed p(left choice)=0.5 More precise control with optogenetics

Answer 23

Bias increase = increases with more stimaultion =more powerful Amount of bias is correlated with intensity of stimulation Faster stimulation frequency leads to larger bias in action value Graded effect as increase strength of stimulation More it reduces left action value - now only if reward every time, strong evidence model is correct = manipulation quantitatively affects behaviour

Answer 24

Fit model and can assign subjective value stimulation value = put number on it Can be compared to equivalent added motion coherence Model can provide estimate of equivalent change in action value given a stimaultion level Graded effect that can be controlled experimentally strengthens evidence for model= quantified as equiv amount subjective value Downstream of action building = build based on reward history and pick which to act on - can bias specific and precise, bias behaviour By activating one of the output platforms, picking which to act on given representation

Value Based Decision Making II Flashcards

(48 cards)