Distributional Semantics Flashcards

Question 1

Q

How do we learn new words?

Answer

A

look in a dictionary
from experience of usage
similar words from the past

Question 2

Q

What is the distributional hypothesis?

Answer

A

Similar context suggests similar meanings

Question 3

Q

In distributional semantics, we want to find f, where f is

Answer

A

a function that takes in and transforms and compresses contexts to produce a vector that encompasses the meaning of a word
meaning(w) = f(c1, c2, c3, c4)

Question 4

Q

How do we find function, f?

Answer

A

use co-occurrence vectors

Question 5

Q

what is a cooccurrence vector?

Answer

A

collect a corpus of documents or sentences

apply basic preprocessing like lower case

count how many times word u appears with word v

the meaning of u is vector [(count(u,v1), count(u,v2)…]

Question 6

Q

what are the benefits of cooccurrence vectors (3)

Answer

A

meaning of a word is vector so we can compute similarities like the cosine similarities
can visualise word meanings
can directly use these vectors as input to machine learning models

Question 7

Q

what are the disadvantages of cooccurrence vectors

Answer

A

distributional semantics beyond words

cant capture all aspects of semantics

Distributional Semantics Flashcards

(7 cards)