True or False: Most corporate data is structured in databases.
False
How fast is unstructured corporate data doubling in size?
Every 18 months
According to the slides, tapping into unstructured information sources is not an option but a ______ to stay competitive.
need
What is text mining (exact slide definition)?
A semi automated process of extracting knowledge from unstructured data sources
Text mining is also called what (exact slide wording)?
Text data mining or knowledge discovery in textual databases
True or False: The benefits of text mining are especially obvious in text rich data environments.
True
Give one example of a text rich environment mentioned in the slides.
Law academic research finance medicine biology technology or marketing
In law, what type of text is given as an example for text mining?
Court orders
In academic research, what type of text is given as an example for text mining?
Research articles
In finance, what type of text is given as an example for text mining?
Quarterly reports
In medicine, what type of text is given as an example for text mining?
Discharge summaries
In biology, what type of text is given as an example for text mining?
Molecular interactions
In technology, what type of text is given as an example for text mining?
Patent files
In marketing, what type of text is given as an example for text mining?
Customer comments
Email is an example of what kind of records mentioned in the slides?
Electronic communication records
List one application of text mining for email records.
Spam filtering
List another application of text mining for email records.
Email prioritization and categorization
List another application of text mining for email records.
Automatic response generation
What is text analytics (exact slide wording)?
A broader concept that includes information retrieval text mining data mining web mining and NLP
True or False: Text analytics is narrower than text mining.
False
What is information retrieval (exact slide definition)?
Searching and identifying relevant documents for a given set of key terms
According to Figure 7.2, text analytics is enabled by which disciplines (list as shown)?
Statistics machine learning management science artificial intelligence computer science and other disciplines
In Figure 7.2, name one item shown under Information Retrieval.
Document matching
In Figure 7.2, name another item shown under Information Retrieval.
Link analysis