Quantifying Multimodal Imbalance: A GMM-Guided Adaptive Loss for Audio-Visual Learning