arXiv AI

[2402.15929] Certifying Knowledge Comprehension in LLMs

By Advanced AI EditorApril 23, 2025No Comments2 Mins Read

[Submitted on 24 Feb 2024 (v1), last revised 21 Apr 2025 (this version, v3)]

View a PDF of the paper titled Certifying Knowledge Comprehension in LLMs, by Isha Chaudhary and 2 other authors

View PDF
HTML (experimental)

Abstract:Large Language Models (LLMs) are increasingly deployed in safety-critical systems where they provide answers based on in-context information derived from knowledge bases. As LLMs are increasingly envisioned as superhuman agents, their proficiency in knowledge comprehension-extracting relevant information and reasoning over it to answer questions, a key facet of human intelligence-becomes crucial. However, existing evaluations of LLMs on knowledge comprehension are typically conducted on small test sets, but these datasets represent only a tiny fraction of the vast number of possible queries. Simple empirical evaluations on these limited test sets raises concerns about the reliability and generalizability of the results. In this work, we introduce the first specification and certification framework for knowledge comprehension in LLMs, providing formal probabilistic guarantees for reliability. Instead of a fixed dataset, we design novel specifications that mathematically represent prohibitively large probability distributions of knowledge comprehension prompts with natural noise, using knowledge graphs. From these specifications, we generate quantitative certificates that offer high-confidence, tight bounds on the probability that a given LLM correctly answers any question drawn from the specification distribution. We apply our framework to certify SOTA LLMs in two domains: precision medicine and general question-answering. Our results reveal previously unrecognized vulnerabilities in SOTA LLMs due to natural noise in the prompts. Additionally, we establish performance hierarchies with formal guarantees among the SOTA LLMs, particularly in the context of precision medicine question-answering.

Submission history

From: Isha Chaudhary [view email]
[v1]
Sat, 24 Feb 2024 23:16:57 UTC (258 KB)
[v2]
Mon, 7 Oct 2024 15:01:48 UTC (3,810 KB)
[v3]
Mon, 21 Apr 2025 23:10:55 UTC (5,207 KB)

Previous ArticleTesla units in America are made with 100% U.S.-built battery packs

Next Article Character AI reveals new AI video maker, bringing us one step closer to video chatbots

Advanced AI Editor

Leave A Reply