Statistics for Evaluating deep learning for enhanced breast cancer diagnosis: a comparative analysis of CNN architectures