Skip to yearly menu bar Skip to main content


tinyBenchmarks: evaluating LLMs with fewer examples

Felipe Polo · Lucas Weber · Leshem Choshen · Yuekai Sun · Gongjun Xu · Mikhail Yurochkin

Abstract

Chat is not available.