What is Natural Language Processing?
(NLP)
It is a field of artificial intelligence focused on enabling computers to understand, interpret, and respond to human language.
Examples of NLP applications include virtual assistants, translation services, and sentiment analysis.
True or False:
NLP is concerned with both understanding and generating human language.
True
Natural Language Processing (NLP) involves tasks like interpreting (understanding) and producing (generating) human language.
What is tokenization in NLP?
It is the process of breaking down text into smaller units, such as words or sentences, called tokens.
Tokenization is the first step in text processing for many NLP tasks.
Fill in the blank:
In tokenization, text is divided into smaller ______.
tokens
What are word embeddings?
These are representations of words as vectors in a continuous vector space, capturing semantic meanings and relationships.
Popular word embedding models include Word2Vec and GloVe.
True or False:
Word embeddings help capture the context of words in text.
True
Word embeddings represent words as vectors in a way that reflects their meanings and relationships based on context.
What is machine translation in NLP?
It is the automatic conversion of text from one language to another using computer algorithms.
Examples include Google Translate and Microsoft Translator.
Which NLP task involves summarizing large amounts of text into shorter, coherent summaries?
summarization
What does NER stand for in NLP?
Named Entity Recognition
What is the purpose of Named Entity Recognition (NER) in NLP?
It aims to identify and categorize key information (entities) in text, such as names, dates, and locations.
True or False:
Sentiment analysis is used to determine the emotional tone or attitude expressed in a piece of text.
True
Sentiment analysis classifies text as positive, negative, or neutral to gauge emotional tone or opinion.
What is sentiment analysis in NLP?
It is the process of determining whether a piece of text expresses a positive, negative, or neutral sentiment.
Commonly used in social media monitoring and customer feedback analysis.
Fill in the blank:
In machine translation, algorithms automatically convert text from one ______ to another.
language
Which NLP task would be appropriate for extracting people’s names and locations from a news article?
Named Entity Recognition
(NER)
What is a common challenge in machine translation?
Handling idiomatic expressions and nuances of each language effectively.
Idiomatic expressions often do not translate directly across languages, requiring context-aware translation.
Explain the difference between tokenization and summarization.
Scenario: A company wants to automatically categorize customer feedback into ‘positive’, ‘neutral’, or ‘negative’.
Which NLP technique should they use?
Sentiment analysis
True or False:
Word embeddings can help in understanding the relationships between words based on their context in large datasets.
True
By analyzing word co-occurrence in large datasets, embeddings map semantic relationships—placing similar words closer in vector space.