...Techniques include deep ensembles, Monte Carlo dropout, temperature scaling, stochastic variational inference, heteroscedastic heads, and out-of-distribution detection workflows. Each baseline emphasizes reproducibility: fixed seeds, standard splits, and strong metrics such as calibration error, AUROC for OOD, and accuracy under shift.