Learning Flashcards

Question

What Is supervised learning

Answer 1

Each data point associated with label Goal = learn features fo data that predict the label

Answer 2

Each image in training set associated with category = cat or dog Algorithm learns to associate each image in dataset with correct label Have to figure our features

Answer 3

If algorithm generalizes well = will also associate new unseen images of dogs and cats to correct label Tell if right or wrong so can update model = build up from sensory pathway

Answer 4

How do we assign credit to each neuron for its contribution to output of network How do we change connectivity in network to improve the accuracy output = complicated How do we assign credit to some change or part of system to output Train = tell it’s wrong - change neural activity to output the correct identity

Answer 5

Back propagation Used in machine learning Governs how weights are changed given the output of the network and the activity of the units Formation to nudge weights towards giving correct output Compute = relative to how strong error is and how influential given neuron is to network Only possible in artificial systems

Answer 6

forwards pass of activity = see something and do this, leads to output Backwards pass of errors = if errors then look back into system and nudge it towards correct answer

Answer 7

Compute output of net work given the input

Answer 8

Update weights in network o reduce the error Structure of network

Answer 9

NOO HARD Can only measure pairwaise changes of a few neurons Across a few neurons - have potentiation and depression (depreciation) Weights change in simailr way to artificial network More supervised approach Output - update weights to reach output

Answer 10

Measure the connectivity change between 2 neurons = ahve to measure activity of both neurons simultaneously and usually intracellularly (membrane potential rather than spikes)

Answer 11

Can be due to numerous factors = distribution fo receptors, number of synapses All of these are difficult to measure without access to the membrane potential

Answer 12

Define architecture of the network - number of layers, feature maps, their sizes etc Use training data and back propagation to learn the weights Test accuracy on a test set of data not used during the training Thinking about tehse supervised approaches = cna help understand how we cold build these representations In humans = not super simailr to how you build your own visual representation of objects

Answer 13

Applies to both Independent of type of task Apply same principle to both cases

Answer 14

Contains random samples of 3 species of flowers Each of species = data set contains 50 obsertaosn for septal length, septal width, petal length and width = how do you classify things when data doesn’t have a label - only have where data lays in space Once you look in more dimensions = see a gradient

Answer 15

Data points unlabelled Goal= to find structure in data Usually done by finding statistical regularity that is indicative of an underlying structure Point are clsoe together inn space of parameters should belong to same category Difficulty= man definitions of clsoe together As the data points are not labelled = this definition of distance is central to the results of the algorithm = clustering challenge = mesure of distance

Answer 16

Not given identity of points Classify and identify the diff types in data Data unlabelled- algorithm learns without supervision Goal = find structure in data What is structure tho - or distance within space of parameters No teaching signal = only internal measure of distance of things

Answer 17

Unsupervised=experience world and amke associations Only based on features in data Reinformcent leanring = in between, partial feedback

Answer 18

Supervised

Answer 19

Unsupervised

Answer 20

Supervised and unsupervised = presented data and agent had to learn either from labelled or unlabelled exs How does agent learn to act in world Some leanring = extremely fast, one shot, like fear/threat conditioning Info = from actions and learn which actions good

Answer 21

Learn a policy to take actions that maximize rewards given Rewards sparse, most actions don’t lead directly to a reward How to assign credit to actin in the past that led to a reward

Answer 22

Agent receives occasional rewards and must learn how to act to maximize them

Answer 23

Only info that right decisions as made = at time of exit Once agent find s exit = can reinforce actions that led to finding the right solution After experiencing maze several times = agent can learn best exit strategy

Answer 24

Classical/pavlovian Operant conditoning

Answer 25

Stimulus given No action necessary Value and associated behavioural response is learned No action of animal

Answer 26

Learn to associate response with a cue/state of envir to obtain rewards Have to make voluntary action = lie press lever, need to learn action What is being strengthened = different slightly

Answer 27

Acquisition = learn, reaches max Extinction = decouple association = have some sort of forgetting First spontaneous recovery = persistence in memory - ltm happens tho Dynamics in leanring

Answer 28

Associate intrinsically rewarding stimulus like food (us) with a simtulus that otherwise has no intrinsic value like sound (cs) Measure behavioural response to cs with leanring across diff conditions Bell has no value but when associated with us (food)= condition stimulus acquires value

Answer 29

Formalize process Makes some predictions in simple classifications conditioning scenarios= predictions in terms of quantitative changes = how much given reward predicts how much you associate Cs to value - more food =stronger association Why saturates at some point = once predicting value correct, if have correct prediction of value = do not need to increase value

Answer 30

Based on trial structure and not easily transferred to sequences of actions and states world is more continuous Ex = thorndikes cat = how do you assign reward credit to any of the specific actions that the cat took in whole sequence Hard to assign credit bc many actions - is it the pressing lever, how do we assign credit to previous actions

Answer 31

Reinforcement leanring How can mouse in maze learn which actions are more likely to lead to reward = have to work backwards from reward state Driving to restaurant or putting fork in mouth = both have value bc leads to food, in a state that leads to a state that gives rewards Framework = assign value to state as a function of reward obtained in that state bit also the value of states that can be reached from that state

Answer 32

Reinforcement learning How do you link everything you did before to the feedback and how do you maximize reward

Answer 33

visual representations in ventral stream

Answer 34

value representations in pfc

Answer 35

biasing action selection in striatum From perception to decision in dorsal stream - Mt/lip

Answer 36

bayes rule to update world model

Learning Flashcards

(61 cards)