Publications

Conference paper

kampik T, Yin X, Potyka N, Toni Fet al., 2025,

Strength change explanations in quantitative argumentation

, Autonomous Agents and Multiagent Systems (AAMAS 2026), Publisher: nternational Foundation for Autonomous Agents and Multiagent Systems

In order to make argumentation-based inference contestable, it is crucial to explain what changes can achieve a desired (instead of the contested) inference result. To this end, we introduce strength change explanations for quantitative (bipolar) argumentation graphs. Strength change explanations describe changes to the initial strengths of a subset of the arguments in a given graph that can achieve a desired ordering based on the final strengths of some (potentially different) subset of arguments. We show that the existing notions of inverse and counterfactual problems can be reduced to strength change explanations. We also prove basic soundnessand completeness properties of our strength change explanations, and demonstrate their existence and non-existence in some special cases. Applying a heuristic search, we demonstrate that we can often successfully find strength change explanations for layered graphs that are common in typical application scenarios, althoughlimitations remain for the general problem.

Abstract
Cite

Conference paper

Gehlot P, Rapberger A, Russo F, Toni Fet al., 2025,

Heterogeneous graph neural networks for assumption-based argumentation

, The 40th Annual AAAI Conference on Artificial Intelligence, Publisher: Association for the Advancement of Artificial Intelligence

Assumption-Based Argumentation (ABA) is a powerfulstructured argumentation formalism, but exact computation of extensions under stable semantics is intractable for large frameworks. We present the first Graph Neural Network (GNN) approach to approximate credulous acceptance in ABA. To leverage GNNs, we model ABA frameworks via a dependency graph representation encoding assumptions, claims and rules as nodes, with heterogeneous edge labels distinguishing support, derive and attack relations. We propose two GNN architectures—ABAGCN and ABAGAT—that stack residual heterogeneous convolution or attention layers, respectively, to learn node embeddings. Our models are trained on the ICCMA 2023 benchmark, augmented with synthetic ABAFs, with hyperparameters optimised via Bayesian search. Empirically, both ABAGCN and ABAGAT outperform a state-of-the-art GNN baseline that we adapt from the abstract argumentation literature, achieving a node-level F1 score of up to 0.71 on the ICCMA instances. Finally, we develop a sound polynomial time extension-reconstruction algorithm driven by our predictor: it reconstructs stable extensions with F1 above 0.85 on small ABAFs and maintains an F1 of about 0.58 on large frameworks. Ourwork opens new avenues for scalable approximate reasoning in structured argumentation.

Abstract
Cite

Conference paper

Ayoobi H, Potyka N, Rapberger A, Toni Fet al., 2025,

Argumentative debates for transparent bias detection

, AAAI 2026, Publisher: Association for the Advancement of Artificial Intelligence (AAAI)

As the use of AI in society grows, addressing emerging biases is essential to prevent systematic disadvantages against specific groups. Several fairness notions/algorithmic methods have been proposed, but, with very few exceptions, these tend to ignore transparency. Instead, interpretability and explainability are core requirements for algorithmic fairness, even more so than for other algorithmic solutions, given the human-oriented nature of fairness. In this paper, we contribute a novel transparent method for bias detection to empower debates about the presence of bias against individuals, based on the values of protected features for the individuals and others in their neighbourhoods. Our method builds upon techniques from (formal and computational) argumentation, whereby debates result from arguing about biases within and across neighbourhoods. We evaluate our approach experimentally and demonstrate its strengths in performance against an argumentative baseline.

Abstract
Cite

Conference paper

Zhou K, Dejl A, Freedman G, Chen L, Rago A, Toni Fet al., 2025,

Evaluating uncertainty quantification methods in argumentative large language models

, 2025 Conference on Empirical Methods in Natural Language Processing, Publisher: Association for Computational Linguistics, Pages: 21700-21711

Research in uncertainty quantification (UQ) for large language models (LLMs) is increasingly important towards guaranteeing the reliability of this groundbreaking technology. We explore the integration of LLM UQ methods in argumentative LLMs (ArgLLMs), an explainable LLM framework for decision-making based on computational argumentation in which UQ plays a critical role. We conduct experiments to evaluate ArgLLMs’ performance on claim verification tasks when using different LLM UQ methods, inherently performing an assessment of the UQ methods’ effectiveness. Moreover, the experimental procedure itself is a novel way of evaluating the effectiveness of UQ methods, especially when intricate and potentially contentious statements are present. Our results demonstrate that, despite its simplicity, direct prompting is an effective UQ strategy in ArgLLMs, outperforming considerably more complex approaches.

Abstract
Cite

Conference paper

Gehlot P, Rapberger A, Russo F, Toni Fet al., 2025,

Heterogeneous graph neural networks for credulous acceptance of assumptions in ABA

, The Second International Workshop on Argumentation and Applications co-located with the 22nd International Conference on Principles of Knowledge Representation and Reasoning, Pages: 14-25

Assumption-Based Argumentation (ABA) is a powerful structured argumentation formalism, but exact com-putation of extensions under stable semantics is intractable for large frameworks. We present the first Graph Neural Network (GNN) approach to approximate credulous acceptance in ABA. To use GNNs, we represent ABA frameworks via a dependency graph representation that encodes atoms and rules as nodes and distinguishessupport, derive and attack relations by heterogeneous edge labels. We propose two GNN variants—ABAGCNand ABAGAT—that stack residual heterogeneous convolution or attention blocks, respectively, to learn nodeembeddings. Our models are trained on the ICCMA2023 benchmark, augmented with synthetic ABAFs, withhyperparameters optimised via Bayesian search. Empirically, both ABAGCN and ABAGAT outperform a state-of-the-art GNN baseline that we adapt from the abstract argumentation literature, achieving node-level an F1 score up to 0.71 on the ICCMA instances. Finally, we develop a poly-time extension-reconstruction algorithm driven by our predictor: it reconstructs stable extensions with F1 above 0.85 on small ABAFs and maintains an F1 ofabout 0.58 on frameworks with 1,000 atoms. Our work opens new avenues for scalable approximate reasoning instructured argumentation.

Book chapter

De Angelis E, Proietti M, Toni F, 2025,

Learning to contest argumentative claims

, Rules and Reasoning, Publisher: Springer Nature Switzerland, Pages: 237-255, ISBN: 9783032088864

Contestability is a highly desirable property for human-centric AI, ensuring that the outcomes of an AI system can be challenged, and possibly changed, when interacting with humans and/or other AI systems. In this paper we study contestability of argumentative claims obtained from Assumption-Based Argumentation (ABA) frameworks, a unifying formalism for various non-monotonic reasoning methods that can be used for explainable AI systems. Specifically, we focus on ABA frameworks that are learnt with ABA Learning, a recent approach to symbolic learning from positive and negative examples, given a background knowledge . We formally define a notion of contestation when desirable claims are rejected or undesirable claims are accepted in learnt ABA frameworks. We also show that ABA Learning can be adapted to redress issues raised by contestation so that the desirable claims are accepted and the undesirable claims are rejected. This is naturally achieved by extending the learnt ABA framework without restarting from scratch, and instead preserving as much as possible thereof by considering some of its rules defeasible. We conduct several experiments with a variety of tabular datasets to demonstrate the computational advantages of our contestable ABA Learning in comparison with re-learning from scratch.

Conference paper

Peacock D - M, Potyka N, Toni F, Yin Xet al., 2025,

On the impact of sparsification on quantitative argumentative explanations in neural networks

, 3rd International Workshop on Argumentation for eXplainable AI (ArgXAI@ECAI), Publisher: CEUR Workshop Proceedings, Pages: 20-35, ISSN: 1613-0073

Neural Networks (NNs) are powerful decision-making tools, but their lack of explainability limits their use inhigh-stakes domains such as healthcare and criminal justice. The recent SpArX framework sparsifies NNs andmaps them to (weighted) Quantitative Bipolar Argumentation Frameworks (QBAFs) to provide an argumentative understanding of their mechanics. QBAFs can be explained by various quantitative argumentative explanation methods such as Argument Attribution Explanations (AAEs), Relation Attribution Explanations (RAEs), and Contestability Explanations (CEs) - which assign numerical scores to arguments or relations to quantify their influence on the dialectical strength of an argument to be explained. However, it remains unexplored how sparsification of NNs impacts the explanations derived from the corresponding (weighted) QBAFs. In this paper we explore two directions for impact. First, we empirically investigate how varying the sparsification levels of NNs affects the preservation of these explanations: using four datasets (Iris, Diabetes, Cancer, and COMPAS), we find that AAEs are generally well preserved, whereas RAEs are not. Then, for CEs, we find that sparsification canimprove computational efficiency in several cases. Overall, this study offers a preliminary investigation into thepotential synergy between sparsification and explanation methods, opening up new avenues for future research.

Abstract
Cite

Conference paper

De Olim Gaul G, Gould A, Kori A, Toni Fet al., 2025,

Object-centric case-based reasoning via argumentation

, 3rd International Workshop on Argumentation for eXplainable AI (ArgXAI 2025), Publisher: CEUR-WS.org, Pages: 36-49, ISSN: 1613-0073

We introduce Slot Attention Argumentation for Case-Based Reasoning (SAA-CBR), a novel neuro-symbolic pipeline for image classification that integrates object-centric learning via a neural Slot Attention (SA) component with symbolic reasoning conducted by Abstract Argumentation for Case-Based Reasoning (AA-CBR). We explore novel integrations of AA-CBR with the neural component, including feature combination strategies, casebase reduction via representative samples, novel count-based partial orders, a One-Vs-Rest strategy for extending AA-CBR to multi-class classification, and an application of Supported AA-CBR, a bipolar variant of AA-CBR. We demonstrate that SAA-CBR is an effective classifier on the CLEVR-Hans datasets, showing competitive performance against baseline models.

Abstract
Cite

Conference paper

Jiang J, Bewley T, Amoukou S, Leofante F, Rago A, Mishra S, Toni Fet al., 2025,

Representation consistency for accurate and coherent LLM answer aggregation

, The Thirty-Ninth Annual Conference on Neural Information Processing Systems, Publisher: Neural Information Processing Systems Foundation, Inc. (NeurIPS)

Test-time scaling improves large language models' (LLMs) performance by allocating more compute budget during inference. To achieve this, existing methods often require intricate modifications to prompting and sampling strategies. In this work, we introduce representation consistency (RC), a test-time scaling method for aggregating answers drawn from multiple candidate responses of an LLM regardless of how they were generated, including variations in prompt phrasing and sampling strategy. RC enhances answer aggregation by not only considering the number of occurrences of each answer in the candidate response set, but also the consistency of the model's internal activations while generating the set of responses leading to each answer. These activations can be either dense (raw model activations) or sparse (encoded via pretrained sparse autoencoders). Our rationale is that if the model's representations of multiple responses converging on the same answer are highly variable, this answer is more likely to be the result of incoherent reasoning and should be down-weighted during aggregation. Importantly, our method only uses cached activations and lightweight similarity computations and requires no additional model queries. Through experiments with four open-source LLMs and four reasoning datasets, we validate the effectiveness of RC for improving task performance during inference, with consistent accuracy improvements (up to 4%) over strong test-time scaling baselines. We also show that consistency in the sparse activation signals aligns well with the common notion of coherent reasoning.

Abstract
Cite

Conference paper

Jacob AR, Kori A, Angelis ED, Glocker B, Proietti M, Toni Fet al., 2025,

Object-centric neuro-argumentative learning

, Conference on Neurosymbolic Learning and Reasoning, Publisher: MLResearchPress, Pages: 1077-1089

Over the last decade, as we rely more on deep learning technologies to make critical decisions, concerns regarding their safety, reliability and interpretability have emerged. We introduce a novel Neural Argumentative Learning (NAL) architecture that integrates Assumption-Based Argumentation (ABA) with Object-Centric (OC) deep learning for im- age analysis. Our OC-NAL architecture consists of neural and symbolic components. The former segments and encodes images into facts, while the latter applies ABA learning to develop ABA frameworks enabling image classification. Experiments on synthetic data show that the OC-NAL architecture can be competitive with a state-of-the-art alternative. The code can be found at https://github.com/AbdulRJacob/Neuro-AL.

Abstract
Cite

Conference paper

Gould A, Toni F, 2025,

Neuro-argumentative learning with case-based reasoning

, 19th International Conference on Neurosymbolic Learning and Reasoning, Publisher: MLResearchPress, Pages: 1090-1106

We introduce Gradual Abstract Argumentation for Case-Based Reasoning (Gradual AA-CBR), a data-driven, neurosymbolic classification model in which the outcome is determined by an argumentation debate structure that is learned simultaneously with neural-based feature extractors. Each argument in the debate is an observed case from the training data, favouring their labelling. Cases attack or support those with opposing or agreeing labellings, with the strength of each argument and relationship learned through gradient-based methods. This argumentation debate structure provides human-aligned reasoning, improving model interpretability compared to traditional neural networks (NNs). Unlike the existing purely symbolic variant, Abstract Argumentation for Case-Based Reasoning (AA-CBR), Gradual AA-CBR is capable of multi-class classification, automatic learning of feature and data point importance, assigning uncertainty values to outcomes, using all available data points, and does not require binary features. We show that Gradual AA-CBR performs comparably to NNs whilst significantly outperforming existing AA-CBR formulations.

Abstract
Cite

Conference paper

Blümel L, Rapberger A, Thimm M, Toni Fet al., 2025,

On independence and SCC-recursiveness in assumption-based argumentation

, Thirty-Fourth International Joint Conference on Artificial Intelligence {IJCAI-25}, Publisher: International Joint Conferences on Artificial Intelligence Organization, Pages: 4382-4390

We introduce a notion of conditional independence in (flat) assumption-based argumentation (ABA), where independence between (sets of) assumptions amounts to the presence of information about one set of assumptions not impacting the acceptability of another. We study general properties, computational complexity, and the relation to independence in abstract argumentation. In light of the high computational complexity of deciding independence, we introduce sound methods for checking independence in polynomial time via two different routes: the first utilizes the strongly connected components (SCCs) of the instantiated abstract argumentation framework; the second exploits the structure of the ABA framework directly. Along the way, we introduce the notion of SCC-recursiveness for ABA.

Journal article

Leofante F, Artelt A, Eliades D, Korre A, Toni F, Miller Tet al., 2025,

Explainable AI, energy and critical infrastructure systems

, AI Magazine, Vol: 46, ISSN: 0738-4602

The AAAI 2025 Bridge on “Explainable AI, Energy and Critical Infrastructure Systems” was held at the Pennsylvania Convention Centre, Philadelphia, Pennsylvania, USA, on February 25, 2025. The bridge gathered researchers and practitioners, bringing together innovation research across explainable AI, energy and critical infrastructure systems so they can enhance each other. The Bridge featured five keynote presentations by experts, one tutorial, poster presentations by authors who contributed their research findings, and three breakout sessions to discuss new challenges arising at the intersection of these exciting disciplines.

Conference paper

Gao L, Muyassar H, Hunanyan Y, Pongpanich S, Rago A, Toni Fet al., 2025,

ADA-X: an online system for fully automated, explainable review aggregation

, Proceedings of the 27th European Conference on Artificial Intelligence (ECAI 2025) - Demo Track, Publisher: IOS Press

Cite

Conference paper

Kori A, Toni F, Glocker B, 2025,

Identifiable object representations under spatial ambiguities

, International Conference on Machine Learning 2025, Publisher: MLResearchPress, Pages: 31486-31518, ISSN: 2640-3498

Modular object-centric representations are essential for human-like reasoning but are challenging to obtain under spatial ambiguities, e.g. due to occlusions and view ambiguities. However, addressing challenges presents both theoretical and practical difficulties. We introduce a novel multi-view probabilistic approach that aggregates view-specific slots to capture invariant content information while simultaneously learning disentangled global viewpoint-level information. Unlike prior single-view methods, our approach resolves spatial ambiguities, provides theoretical guarantees for identifiability, and requires no viewpoint annotations. Extensive experiments on standard benchmarks and novel complex datasets validate our method’s robustness and scalability.

Abstract
Cite

Conference paper

Gorur D, Rago A, Toni F, 2025,

Argumentatively coherent judgmental forecasting

, 28th European Conference on Artificial Intelligence, Publisher: IOS Press

Judgmental forecasting employs human opinions to make predictions about future events, rather than exclusively historical data as in quantitative forecasting. When these opinions form an argumentative structure around forecasts, it is useful to study the properties of the forecasts from an argumentative perspective. In this paper, we advocate and formally define a property of argumentative coherence, which, in essence, requires that a forecaster’s reasoning iscoherent with their forecast. We then conduct three evaluations with our notion of coherence. First, we assess the impact of enforcing coherence on human forecasters as well as on Large Language Model (LLM)-based forecasters, given that they have recently shown to be competitive with human forecasters. In both cases, we show that filtering out incoherent predictions improves forecasting accuracy consistently, supporting the practical value of coherence in both human and LLM-based forecasting. Then, via crowd-sourced user experiments, we show that, despite its apparent intuitiveness and usefulness, users do not generally align with this coherence property. This points to the need to integrate, within argumentation-based judgmental forecasting, mechanisms to filter out incoherent opinions before obtaining group forecasting predictions.

Abstract
Cite

Conference paper

Rago A, Vasileiou SL, Tran S, Toni F, Yeoh Wet al., 2025,

A methodology for incompleteness-tolerant and modular gradual semantics for argumentative statement graphs

, 22nd International Conference on Principles of Knowledge Representation and Reasoning (KR 2025), Publisher: International Joint Conferences on Artificial Intelligence Organization

Gradual semantics (GS) have demonstrated great potential in argumentation, in particular for deploying quantitative bipolar argumentation frameworks (QBAFs) in a number of real-world settings, from judgmental forecasting to explainable AI. In this paper, we provide a novel methodology for obtaining GS for statement graphs, a form of structured argumentation framework, where arguments and relations between them are built from logical statements. Our methodology differs from existing approaches in the literature in two main ways. First, it naturally accommodates incomplete information, so that arguments with partially specified premises can play ameaningful role in the evaluation. Second, it is modularlydefined to leverage on any GS for QBAFs. We also define aset of novel properties for our GS and study their suitabilityalongside a set of existing properties (adapted to our setting) for two instantiations of our GS, demonstrating their advantages over existing approaches.

Abstract
Cite

Conference paper

Rapberger A, Russo F, Rago A, Toni Fet al., 2025,

On gradual semantics for assumption-based argumentation

, 22nd International Conference on Principles of Knowledge Representation and Reasoning (KR 2025)

In computational argumentation, gradual semantics are fine-grained alternatives to extension-based and labelling-basedsemantics. They ascribe a dialectical strength to (components of) arguments sanctioning their degree of acceptability. Several gradual semantics have been studied for abstract, bipolar and quantitative bipolar argumentation frameworks (QBAFs), as well as, to a lesser extent, for some forms of structured argumentation. However, this has not been the case for assumption-based argumentation (ABA), despite it being a popular form of structured argumentation with several applications where gradual semantics could be useful. In this paper, we fill this gap and propose a family of novel gradual semantics for equipping assumptions, which are the core components in ABA frameworks, with dialectical strengths. To do so, we use bipolar set-based argumentation frame-works as an abstraction of (potentially non-flat) ABA frame-works and generalise state-of-the-art modular gradual semantics for QBAFs. We show that our gradual ABA semantics satisfy suitable adaptations of desirable properties of gradual QBAF semantics, such as balance and monotonicity. We also explore an argument-based approach that leverages established QBAF modular semantics directly, and use it as base-line. Finally, we conduct experiments with synthetic ABA frameworks to compare our gradual ABA semantics with its argument-based counterpart and assess convergence.

Abstract
Cite

Conference paper

Rago A, Palfi B, Sukpanichnant P, Nabli H, Vivek K, Kostopoulou O, Kinross J, Toni Fet al., 2025,

Exploring the effect of explanation content and format on user comprehension and trust in healthcare

, The 14th Conference on Prestigious Applications of Intelligent Systems (PAIS 2025), Publisher: IOS Press

AI-driven tools for healthcare are widely acknowledgedas potentially beneficial to health practitioners and patients, e.g. the QCancer regression tool for cancer risk prediction. However, for these tools to be trusted, they need to be supplemented with explanations. We examine how explanations’ content and format affect user comprehension and trust when explaining QCancer’s predictions. Regarding content, we deploy SHAP and Occlusion-1. Regarding format, we present SHAP explanations, conventionally, ascharts (SC) and Occlusion-1 explanations as charts (OC) as well as text (OT), to which their simpler nature lends itself. We conduct experiments with two sets of stakeholders: the general public (representing patients) and medical students (representing healthcare practitioners). Our experiments showed higher subjective comprehension and trust for Occlusion-1 over SHAP explanations based on content.However, when controlling for format, only OT outperformed SC, suggesting this trend is driven by preferences for text. Other findings corroborated that explanation format, rather than content, is often the critical factor.

Abstract
Cite

Conference paper

Raaghav V, Bikos D, Rago A, Toni F, Charalambides Met al., 2025,

Explainable prediction of the mechanical properties of composites with CNNs

, The 14th Conference on Prestigious Applications of Intelligent Systems (PAIS 2025), Publisher: IOS Press

Composites are amongst the most important materialsmanufactured today, as evidenced by their use in countless applications. In order to establish the suitability of composites in specific applications, finite element (FE) modelling, a numerical method based on partial differential equations, is the industry standard for assessing their mechanical properties. However, FE modelling is exceptionally costly from a computational viewpoint, a limitation which has led to efforts towards applying AI models to this task. However, in these approaches: the chosen model architectures were rudimentary, feed-forward neural networks giving limited accuracy; the studies focus on predicting elastic mechanical properties, without considering material strength limits; and the models lacked transparency, hindering trustworthiness by users. In this paper, we show that convolutional neural networks (CNNs) equipped with methods from explainable AI (XAI) can be successfully deployed to solve this problem. Ourapproach uses customised CNNs trained on a dataset we generate using transverse tension tests in FE modelling to predict composites’ mechanical properties, i.e., Young’s modulus and yield strength. We show empirically that our approach achieves high accuracy, outperforming a baseline, ResNet-34, in estimating the mechanical properties. We then use SHAP and Integrated Gradients, two post-hocXAI methods, to explain the predictions, showing that the CNNs use the critical geometrical features that influence the composites’ behaviour, thus allowing engineers to verify that the models are trust-worthy by representing the science of composites.

Abstract
Cite

Journal article

Dickie C, Lauren S, Belardinelli F, Rago A, Toni Fet al., 2025,

Aggregating bipolar opinions through bipolar assumption-based argumentation

, Autonomous Agents and Multi-Agent Systems, Vol: 39, ISSN: 1387-2532

We introduce a novel method to aggregate Bipolar ArgumentationFrameworks expressing opinions of different parties in debates. We use BipolarAssumption-based Argumentation (ABA) as an all-encompassing formalismfor Bipolar Argumentation under different semantics. By leveraging on recentresults on judgement aggregation in Social Choice Theory, we prove severalpreservation results for relevant properties of Bipolar ABA using quota andoligarchic rules. Specifically, we prove (positive and negative) results about thepreservation of conflict-free, closed, admissible, preferred, complete, set-stable,well-founded and ideal extensions in Bipolar ABA, as well as the preservationof acceptability, acyclicity and coherence for individual assumptions. Finally,we illustrate our methodology and results in the context of a case study onopinion aggregation for the treatment of long COVID patients.

Conference paper

Todd J, Jiang J, Russo A, Winkler S, Sale S, McMillan J, Rago Aet al., 2025,

Explainable time series prediction of tyre energy in formula one race strategy

, SAC 2025: The 40th ACM/SIGAPP Symposium On Applied Computing, Publisher: ACM

Formula One (F1) race strategy takes place in a high-pressure and fast-paced environment where split-second decisions can drastically affect race results. Two of the core decisions of race strategy are when to make pit stops (i.e. replace the cars’ tyres) and which tyre compounds (hard, medium or soft, in normal conditions) to select. The optimal pit stop decisions can be determined by estimatingthe tyre degradation of these compounds, which in turn can be computed from the energy applied to each tyre, i.e. the tyre energy. In this work, we trained deep learning models, using an F1 team’s historic race data consisting of telemetry, to forecast tyre energies during races. Additionally, we fitted XGBoost, a decision tree-based machine learning algorithm, to the same dataset and compared the results, with both giving impressive performance. Furthermore, weincorporated two different explainable AI methods, namely feature importance and counterfactual explanations, to gain insights into the reasoning behind the forecasts. Our contributions thus result in an explainable, automated method which could assist F1 teams in optimising their race strategy.

Abstract
Cite

Conference paper

Thomas D, Jiang J, Kori A, Russo A, Winkler S, Sale S, McMillan J, Belardinelli F, Rago Aet al., 2025,

Explainable reinforcement learning for Formula One race strategy

, The 40th ACM/SIGAPP Symposium On Applied Computing, Publisher: ACM

In Formula One, teams compete to develop their cars to achieve the highest possible finishing position in each race. During a race, however, teams are unable to alter the car, so they must improve their cars’ finishing positions via race strategy, i.e. optimising their selection of which tyre compounds to put on the car and when to do so. In this work, we introduce a reinforcement learning model, RSRL(Race Strategy Reinforcement Learning), to control race strategies in simulations, offering a faster alternative to the industry standard of hard-coded and Monte Carlo-based race strategies. Controlling cars with a pace equating to an expected finishing position of P5.5 (where P1 represents first place and P20 is last place), RSRL achieves an average finishing position of P5.33 on our test race, the 2023Bahrain Grand Prix, outperforming the best baseline of P5.63. We then demonstrate, in a generalisability study, how performance for one track or multiple tracks can be prioritised via training. Further, we supplement model predictions with feature importance, decision tree-based surrogate models, and decision tree counterfactualstowards improving user trust in the model. Finally, we provide illustrations which exemplify our approach in real-world situations, drawing parallels between simulations and reality.

Abstract
Cite

Conference paper

Freedman G, Toni F, 2025,

Exploring the potential for large language models to demonstrate rational probabilistic beliefs

, 38th International FLAIRS Conference, Publisher: LibraryPress@UF, ISSN: 2334-0762

Advances in the general capabilities of large language models (LLMs) have led to their use for information retrieval, and as components in automated decision systems. A faithful representation of probabilistic reasoning in these models may be essential to ensure trustworthy, explainable and effective performance in these tasks. Despite previous work suggesting that LLMs can perform complex reasoning and well-calibrated uncertainty quantification, we find that current versions of this class of model lack the ability to provide rational and coherent representations of probabilistic beliefs. To demonstrate this, we introduce a novel dataset of claims with indeterminate truth values and apply a number of well-established techniques for uncertainty quantification to measure the ability of LLM's to adhere to fundamental properties of probabilistic reasoning.

Abstract
Cite

Conference paper

Freedman G, Dejl A, Gorur D, Yin X, Rago A, Toni Fet al., 2025,

Argumentative large language models for explainable and contestable claim verification

, AAAI Conference on Artificial Intelligence, Publisher: Association for the Advancement of Artificial Intelligence, Pages: 14930-14939, ISSN: 2159-5399

The profusion of knowledge encoded in large language models (LLMs) and their ability to apply this knowledge zero-shot in a range of settings makes them promising candidates for use in decision-making. However, they are currently limited by their inability to provide outputs which can be faithfully explained and effectively contested to correct mistakes. In this paper, we attempt to reconcile these strengths and weaknesses by introducing argumentative LLMs (ArgLLMs), a method for augmenting LLMs with argumentative reasoning. Concretely, ArgLLMs construct argumentation frameworks, which then serve as the basis for formal reasoning in support of decision-making. The interpretable nature of these argumentation frameworks and formal reasoning means that any decision made by ArgLLMs may be explained and contested. We evaluate ArgLLMs’ performance experimentally in comparison with state-of-the-art techniques, in the context of the decision-making task of claim verification. We also define novel properties to characterise contestability and assess ArgLLMs formally in terms of these properties.

Abstract
Cite

Conference paper

Chen L, Dejl A, Toni F, 2025,

Identifying Query-Relevant Neurons in Large Language Models for Long-Form Texts

, The 39th Annual AAAI Conference on Artificial Intelligence

Cite

Conference paper

Ayoobi H, Potyka N, Toni F, 2025,

ProtoArgNet: Interpretable Image Classification with Super-Prototypes and Argumentation

, AAAI Conference on Artificial Intelligence

Cite

Conference paper

Russo F, Toni F, 2025,

Shapley-PC: constraint-based causal structure learning with a Shapley inspired framework

, 4th Conference on Causal Learning and Reasoning (CLeaR 2025)

Causal Structure Learning (CSL), also referred to as causal discovery, amounts to extracting causal relations among variables in data. CSL enables the estimation of causal effects from observational data alone, avoiding the need to perform real life experiments. Constraint-based CSL leverages conditional independence tests to perform causal discovery. We propose Shapley-PC, a novel method to improve constraint-based CSL algorithms by using Shapley values over the possible conditioning sets, to decide which variables are responsible for the observed conditional (in)dependences. We prove soundness, completeness and asymptotic consistency of Shapley-PC and run a simulationstudy showing that our proposed algorithm is superior to existing versions of PC.

Abstract
Cite

Conference paper

Rapberger A, Ulbricht M, Toni F, 2024,

On the correspondence of non-flat assumption-based argumentation and logic programming with negation as failure in the head

, 22nd International Workshop on Nonmonotonic Reasoning NMR 24), Publisher: CEUR Workshop Proceedings, Pages: 122-121, ISSN: 1613-0073

The relation between (a fragment of) assumption-based argumentation (ABA) and logic programs (LPs) under stable model semantics is well-studied. However, for obtaining this relation, the ABA framework needs to be restricted to being flat, i.e., a fragment where the (defeasible) assumptions can never be entailed, only assumed to be true or false. Here, we remove this restriction and show a correspondence between non-flat ABA and LPs with negation as failure in their head. We then extend this result to so-called set-stable ABA semantics, originally defined for the fragment of non-flat ABA called bipolar ABA. We showcase how to define set-stable semantics for LPs with negation as failure in their head and show the correspondence to set-stable ABA semantics.

Abstract
Cite

Conference paper

Vasileiou S, Kumar A, Yeoh W, Son TC, Toni Fet al., 2024,

Dialectical reconciliation via structured argumentative dialogues

, KR 2024

We present a novel framework designed to extend model reconciliation approaches, commonly used in human-aware planning, for enhanced human-AI interaction. By adopting a structured argumentation-based dialogue paradigm, our framework enables dialectical reconciliation to address knowledge discrepancies between an explainer (AI agent) and an explainee (human user), where the goal is for the explainee to understand the explainer's decision. We formally describe the operational semantics of our proposed framework, providing theoretical guarantees. We then evaluate the framework's efficacy ``in the wild'' via computational and human-subject experiments. Our findings suggest that our framework offers a promising direction for fostering effective human-AI interactions in domains where explainability is important.

Publications

Search or filter publications

Filter by type:

Filter by year:

Results

Search results

Strength change explanations in quantitative argumentation

Heterogeneous graph neural networks for assumption-based argumentation

Argumentative debates for transparent bias detection

Evaluating uncertainty quantification methods in argumentative large language models

Heterogeneous graph neural networks for credulous acceptance of assumptions in ABA

Learning to contest argumentative claims

On the impact of sparsification on quantitative argumentative explanations in neural networks

Object-centric case-based reasoning via argumentation

Representation consistency for accurate and coherent LLM answer aggregation

Object-centric neuro-argumentative learning

Neuro-argumentative learning with case-based reasoning

On independence and SCC-recursiveness in assumption-based argumentation

Explainable AI, energy and critical infrastructure systems

ADA-X: an online system for fully automated, explainable review aggregation

Identifiable object representations under spatial ambiguities

Argumentatively coherent judgmental forecasting

A methodology for incompleteness-tolerant and modular gradual semantics for argumentative statement graphs

On gradual semantics for assumption-based argumentation

Exploring the effect of explanation content and format on user comprehension and trust in healthcare

Explainable prediction of the mechanical properties of composites with CNNs

Aggregating bipolar opinions through bipolar assumption-based argumentation

Explainable time series prediction of tyre energy in formula one race strategy

Explainable reinforcement learning for Formula One race strategy

Exploring the potential for large language models to demonstrate rational probabilistic beliefs

Argumentative large language models for explainable and contestable claim verification

Identifying Query-Relevant Neurons in Large Language Models for Long-Form Texts

ProtoArgNet: Interpretable Image Classification with Super-Prototypes and Argumentation

Shapley-PC: constraint-based causal structure learning with a Shapley inspired framework

On the correspondence of non-flat assumption-based argumentation and logic programming with negation as failure in the head

Dialectical reconciliation via structured argumentative dialogues