What is Text Mining?
The semiautomated process of extracting patterns from large amounts of unstructured data sources.
What are the Seven (7) Application Areas of Text Mining?
What are the Fourteen (1-5) Text Mining Terms we need to know?
What are the Fourteen (6-10) Text Mining Terms we need to know?
What are the Fourteen (11-14) Text Mining Terms we need to know?
What does NLP Stand For and How is it Defined?
Natural Language Processing studies the problem of “understanding” the natural human language, with the view of converting depictions of human language into more formal representations that are easier for computer programs to manipulate.
What are some of the Challenges Related to NLP? (6)
Part-Of-Speech Tagging
Text Segmentation
Word Sense Disambiguation
Syntactic Ambiguity
Imperfect or Irregular Imput
Speech Acts
What is Deception Detection as it Relates to Text Mining?
It is used in prediction models to differentiate deceptive statements from truthful ones
What is Part-Of-Speech Tagging?
Tokenized terms (words) are matched and interpreted against the text based on the term’s definition and the context that it is being used.
What are the Three (3) Steps/Tasks for Text Mining?
What is a TDM?
A Term-Document Matrix that indexes the relationships between terms and documents.
What is SVD?
Singular Value Decomposition reduces the overall dimensionality of the input matrix to a lower-dimensional space where each consecutive dimension represents the largest degree of variability between words and documents.
What is Sentiment Analysis?
Sentiment analysis is trying to answer the question “What do people feel about a certain topic?” by digging into opinions using a variety of automated tools.
What are the Seven (7) Discrete Sentiment Analysis Applications Stated by the Author?
What is the Sentiment Analysis Process?
What are the Three (3) Different Elements of Sentiment Analysis?
Polarity Identification
Identifying Semantic Orientation of Sentences and Phrases
Identifying Semantic Orientation of Documents
What is Polarity Identification?
The process of identifying the sentiments under one of two opposing polarities, or locate the position along the continuum between the polarities.
What are the Two (2) Methods of Polarity Identification?
What is Web Mining?
The process of discovering intrinsic relationships from Web data, which are expressed in the form of textual, linkage, or usage information.
What are Web Crawlers?
AKA Spiders are used to read through the content of a website automatically.
What is an Authoritative Page?
Use of a web page or a relevance index that improves the search results and rankings of relevant pages.
What is a HITS?
A hyperlink-induced topic search. It is a link-analysis algorithm that rates Web pages using the hyperlink information contained within them.
What is Web Structure Mining and Why is it Important?
Web Structure mining is the process of extracting useful information from the links embedded in Web documents. It is used to identify authoritative pages and hubs which are the cornerstones of page-rank algorithms relied upon by Google and other search engines.
What is SEO?
Search Engine Optimization is the intentional activity of affecting the visibility of a website in a search engine’s natural search results.