Factuality Challenges in the Era of Large Language Models
This talk will discuss the risks, the challenges, and the opportunities that large language models (LLMs) bring regarding factuality. We will then present some recent work on using LLMs for fact-checking, on detecting machine-generated text, and on fighting the ongoing misinformation pollution with LLMs. Finally, we will present a number of LLM fact-checking tools recently developed at MBZUAI:
- LM-Polygraph, a tool to predict an LLM’s uncertainty in its output using cheap and fast uncertainty quantification techniques,
- Factcheck-Bench, a fine-grained evaluation benchmark and framework for fact-checking the output of LLMs,
- Loki, an open-source tool for fact-checking the output of LLMs, developed based on Factcheck-Bench and optimized for speed and quality,
- OpenFactCheck, a framework for fact-checking LLM output, for building customized fact-checking systems, and for benchmarking LLMs for factuality,
- LLM-DetectAIve, a tool for machine-generated text detection, and
- FRAPPE, a FRAming, Persuasion, and Propaganda Explorer.
Speaker
Preslav Nakov
Preslav Nakov is professor and department chair for NLP at the Mohamed bin Zayed University of Artificial Intelligence. Previously, he was principal scientist at the Qatar Computing Research Institute, HBKU, where he led the Tanbih mega-project, developed in collaboration with MIT, which aims to limit the impact of “fake news”, propaganda and media bias by making users aware of what they are reading, thus promoting media literacy and critical thinking.
He received his Ph.D. in computer science from UC Berkeley, supported by a Fulbright grant. He is chair of the European Chapter of the Association for Computational Linguistics (EACL), secretary of ACL SIGSLAV, and secretary of the Truth and Trust Online board of trustees. Formerly, he was PC chair of ACL 2022, and president of ACL SIGLEX. He is also member of the editorial board of several journals including Computational Linguistics, TACL, ACM TOIS, IEEE TASL, IEEE TAC, CS&L, NLE, AI Communications, and Frontiers in AI. He was the first to receive the Bulgarian president’s John Atanasoff award, named after the inventor of the first automatic electronic digital computer.
