A CNN model achieves 99% accuracy on the training dataset but only 75% on the validation dataset. Which phenomenon is occurring, and what is a common technique to address it?
-
A
Underfitting; increase model complexity by adding more layers.
-
B
Overfitting; apply Dropout to the fully connected layers.
-
C
Data leakage; remove features that are present in both training and validation sets.
-
D
Vanishing gradients; switch to a different activation function like Leaky ReLU.