MACR Tackles the Problem of LLMs That Trust Wrong Sources

A research team has proposed MACR, a framework designed to make LLMs stop blindly trusting either their training data or whatever context you hand them.

Current systems essentially force a binary choice: either the model's internal knowledge wins, or the external context does. MACR rejects that premise. It first measures how confident the model actually is in its own answer using a modified semantic entropy score, then generates a working context from whichever source — internal or external — looks more reliable. From there, three specialized agents take over: one surfaces explicit reasoning rules, one maps out potential conflicts, and the third resolves inconsistencies. Crucially, the framework handles two distinct conflict types that prior work has largely ignored — cases where the model's parametric knowledge contradicts provided context, and cases where multiple pieces of external context contradict each other.

The second conflict type is where things get practically interesting. Retrieval-augmented systems often pull in several documents that don't agree with each other, and today's LLMs have no principled way to arbitrate between them. MACR is built to surface and resolve those inconsistencies explicitly rather than averaging them away. On the PopQA and ConFiQA benchmarks, the paper reports MACR outperforming state-of-the-art baselines, with the added benefit of producing human-readable explanations of how each conflict was resolved.

The interpretability claim is worth watching. The field is littered with conflict-handling papers that show benchmark gains without explaining the reasoning — MACR's agent-based design at least makes the resolution auditable, which matters more than the leaderboard number if you're deploying this in production.

← Back to the front page