You are working with a team of data scientists on an AI project where multiple machine learning models are being trained to predict customer churn. The models are evaluated based on the Mean Squared Error (MSE) as the loss function. However, one model consistently shows a higher MSE despite having a more complex architecture compared to simpler models. What is the most likely reason for the higher MSE in the more complex model?
Submit