When visualizing the feature maps of a trained CNN, what type of patterns are typically detected by the filters in the initial (early) layers compared to the final (deeper) layers?