MS-DS Master of Data science FREE Master of Data Science Natural Language Processing Questions and Answers 3

Question 1

What distinguishes a Transformer architecture from traditional RNN-based models in NLP?

Accepted Answer

It processes all positions in parallel using self-attention instead of sequential recurrence

Answer

Transformers replace recurrence with self-attention, enabling parallel computation across all sequence positions simultaneously.

Question 2

In sentiment analysis, what challenge does sarcasm detection present for standard classifiers?

Accepted Answer

The literal meaning of words contradicts the intended sentiment

Answer

Sarcasm conveys the opposite of the surface-level meaning, causing models that rely on literal word polarity to misclassify sentiment.

Question 3

What is the role of positional encoding in Transformer-based NLP models?

Accepted Answer

To inject information about token order since self-attention has no inherent notion of position

Answer

Since self-attention treats input as an unordered set, positional encodings add sequence order information to the token representations.

Question 4

Which tokenization strategy splits rare words into smaller subword units to handle out-of-vocabulary terms?

Accepted Answer

Byte-Pair Encoding (BPE)

Answer

BPE iteratively merges the most frequent character pairs to build a subword vocabulary that balances coverage and vocabulary size.

Question 5

What does perplexity measure when evaluating a language model?

Accepted Answer

How well the model predicts a held-out test set, with lower values indicating better performance

Answer

Perplexity is the exponentiated average negative log-likelihood per token, where lower values mean the model assigns higher probability to the test data.

MS-DS Master of Data science Practice Test

MS-DS Master of Data science Practice Test

MS-DS Master of Data science FREE Master of Data Science Natural Language Processing Questions and Answers 3