Misinformation is an important problem but fact checkers are overwhelmed by the amount of false content that is produced online every day. To support fact checkers in their efforts, we are creating data driven verification methods that use structured datasets to assess claims and explain their decisions. For statistical claims, we translate text claims into SQL queries on relational databases. We exploit text classifiers to propose validation queries to the users and rely on tentative execution of query candidates to narrow down the set of alternatives. The verification process is controlled by a cost-based optimizer that considers expected verification overheads and the expected claim utility as training samples. For property claims, we use the rich semantics in knowledge graphs (KGs) to verify claims and produce explanations. As information in a KG is inevitably incomplete, we rely on rule discovery and on text mining to gather the evidence to assess claims. Uncertain rules and facts are turned into logical programs and the checking task is modeled as a probabilistic inference problem. Experiments show that both methods enable the efficient and effective labeling of claims with interpretable explanations, both in simulations and in real world user studies with 50% decrease in verification time. Our algorithms are demonstrated in a fact checking website (https://coronacheck.eurecom.fr), which has been used by more than twelve thousands users to verify claims related to the coronavirus disease (COVID-19) spreads and effects.
Explainable fact checking for statistical and property claims
Invited Talk, 15 October 2020, Inria-Ecole Polytechnique CEDAR Seminar, France (Virtual Seminar)
© EURECOM. Personal use of this material is permitted. The definitive version of this paper was published in Invited Talk, 15 October 2020, Inria-Ecole Polytechnique CEDAR Seminar, France (Virtual Seminar) and is available at :
PERMALINK : https://www.eurecom.fr/publication/6393