Topic

#model-calibration

1 article exploring model-calibration. Expert insights and analysis from our editorial team.

Showing 1–1 of 1 articles

Articles

Newest first
Models & Research

MM-JudgeBias Exposes Compositional Bias in MLLM-as-a-Judge: What It Means for Teams Running Model-Based Eval Pipelines

MM-JudgeBias shows MLLM judges inherit the compositional biases they evaluate, so teams must audit judge selection rather than assume model-based eval removes labeling work.