8e. Tutorial Flashcards

(6 cards)

1
Q

What was the AIRbnb review topic about?

A

500 reviews, downloaded by lecturer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What happens in the first step?

A

File is opened - all airbnb reviews

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why does this step happen (step 2)?

A

A numeric column like 12345 is treated by KNIME as a number for calculations, not as text to tokenize or parse.

Converting it with Number to String turns it into textual content that text-mining nodes can consume. KNIME’s Number to String node is specifically for converting number columns into strings.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does this node do? (+ why?)

A

Strings to Document turns plain text in a table into KNIME’s document type for text processing.

Why?: in KNIME text mining, a document column is the standard input type for many downstream text nodes. For example, Sentence Extractor works on a document and returns its sentences, and KNIME’s stemming/text-preprocessing nodes describe operating on terms contained in an input

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does this node do?

A
  • removes all words that have less than 3 characters (“a”)
    -> because usually not really meaningful
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does this node do?

A

Topic extraction!

-> chooses set amount of topics and number of words
-> calculates a weighing per topic
-> assigns to greatest weighted topic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly