What was the AIRbnb review topic about?
500 reviews, downloaded by lecturer
What happens in the first step?
File is opened - all airbnb reviews
Why does this step happen (step 2)?
A numeric column like 12345 is treated by KNIME as a number for calculations, not as text to tokenize or parse.
Converting it with Number to String turns it into textual content that text-mining nodes can consume. KNIME’s Number to String node is specifically for converting number columns into strings.
What does this node do? (+ why?)
Strings to Document turns plain text in a table into KNIME’s document type for text processing.
Why?: in KNIME text mining, a document column is the standard input type for many downstream text nodes. For example, Sentence Extractor works on a document and returns its sentences, and KNIME’s stemming/text-preprocessing nodes describe operating on terms contained in an input
What does this node do?
What does this node do?
Topic extraction!
-> chooses set amount of topics and number of words
-> calculates a weighing per topic
-> assigns to greatest weighted topic