Publications

Citation

BibTex format

@inproceedings{Gaskell:2022:ijcai.2022/573,
author = {Gaskell, A and Miao, Y and Toni, F and Specia, L},
doi = {ijcai.2022/573},
pages = {4129--4135},
publisher = {International Joint Conferences on Artificial Intelligence},
title = {Logically consistent adversarial attacks for soft theorem provers},
url = {http://dx.doi.org/10.24963/ijcai.2022/573},
year = {2022}
}

Download

RIS format (EndNote, RefMan)

TY  - CPAPER
AB  - Recent efforts within the AI community haveyielded impressive results towards “soft theoremproving” over natural language sentences using lan-guage models. We propose a novel, generativeadversarial framework for probing and improvingthese models’ reasoning capabilities. Adversarialattacks in this domain suffer from the logical in-consistency problem, whereby perturbations to theinput may alter the label. Our Logically consis-tent AdVersarial Attacker, LAVA, addresses this bycombining a structured generative process with asymbolic solver, guaranteeing logical consistency.Our framework successfully generates adversarialattacks and identifies global weaknesses commonacross multiple target models. Our analyses revealnaive heuristics and vulnerabilities in these mod-els’ reasoning capabilities, exposing an incompletegrasp of logical deduction under logic programs.Finally, in addition to effective probing of thesemodels, we show that training on the generatedsamples improves the target model’s performance.
AU  - Gaskell,A
AU  - Miao,Y
AU  - Toni,F
AU  - Specia,L
DO  - ijcai.2022/573
EP  - 4135
PB  - International Joint Conferences on Artificial Intelligence
PY  - 2022///
SP  - 4129
TI  - Logically consistent adversarial attacks for soft theorem provers
UR  - http://dx.doi.org/10.24963/ijcai.2022/573
UR  - https://www.ijcai.org/proceedings/2022/573
UR  - http://hdl.handle.net/10044/1/97096
ER  -

Download

Citation

BibTex format

RIS format (EndNote, RefMan)

Explainable AI