Skip to yearly menu bar Skip to main content


Entailment Closure Failures in Large Language Models: A Benchmark for Cross-Query Logical Consistency

Ben Jenkins

Abstract

Chat is not available.