Imperial College London

ProfessorFrancescaToni

Faculty of EngineeringDepartment of Computing

Professor in Computational Logic
 
 
 
//

Contact

 

+44 (0)20 7594 8228f.toni Website

 
 
//

Location

 

430Huxley BuildingSouth Kensington Campus

//

Summary

 

Publications

Citation

BibTex format

@inproceedings{Gaskell:2022:ijcai.2022/573,
author = {Gaskell, A and Miao, Y and Toni, F and Specia, L},
doi = {ijcai.2022/573},
pages = {4129--4135},
publisher = {International Joint Conferences on Artificial Intelligence},
title = {Logically consistent adversarial attacks for soft theorem provers},
url = {http://dx.doi.org/10.24963/ijcai.2022/573},
year = {2022}
}

RIS format (EndNote, RefMan)

TY  - CPAPER
AB - Recent efforts within the AI community haveyielded impressive results towards “soft theoremproving” over natural language sentences using lan-guage models. We propose a novel, generativeadversarial framework for probing and improvingthese models’ reasoning capabilities. Adversarialattacks in this domain suffer from the logical in-consistency problem, whereby perturbations to theinput may alter the label. Our Logically consis-tent AdVersarial Attacker, LAVA, addresses this bycombining a structured generative process with asymbolic solver, guaranteeing logical consistency.Our framework successfully generates adversarialattacks and identifies global weaknesses commonacross multiple target models. Our analyses revealnaive heuristics and vulnerabilities in these mod-els’ reasoning capabilities, exposing an incompletegrasp of logical deduction under logic programs.Finally, in addition to effective probing of thesemodels, we show that training on the generatedsamples improves the target model’s performance.
AU - Gaskell,A
AU - Miao,Y
AU - Toni,F
AU - Specia,L
DO - ijcai.2022/573
EP - 4135
PB - International Joint Conferences on Artificial Intelligence
PY - 2022///
SP - 4129
TI - Logically consistent adversarial attacks for soft theorem provers
UR - http://dx.doi.org/10.24963/ijcai.2022/573
UR - https://www.ijcai.org/proceedings/2022/573
UR - http://hdl.handle.net/10044/1/97096
ER -