My research focuses on various aspects of data-driven approaches to natural language processing, with a particular interest in multimodal and multilingual context models and work at the intersection of language and vision. My work has various applications such as machine translation, image captioning, quality estimation and text adaptation.
I currently hold an ERC (European Research Council) Starting Grant on Multi-modal Context Modelling for Machine Translation.
I am also part-time Professor of Language Engineering at the University of Sheffield. Check my Sheffield page for more.
Madhyastha PS, Wang J, Specia L, 2018, The role of image representations in vision to language tasks., Natural Language Engineering, Vol:24, Pages:415-439
Madhyastha P, Wang J, Specia L, 2019, VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions., Proceedings of the 57th Conference of the Association for Computational Linguistics, ACL 2019, Florence, Italy, July 28- August 2, 2019, Volume 1: Long Papers, Pages:6539-6550
Ive J, Madhyastha P, Specia L, 2019, Distilling Translations with Visual Awareness., Proceedings of the 57th Conference of the Association for Computational Linguistics, ACL 2019, Florence, Italy, July 28- August 2, 2019, Volume 1: Long Papers, Pages:6525-6538
et al., 2019, Probing the Need for Visual Context in Multimodal Machine Translation., Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, Minneapolis, MN, USA, June 2-7, 2019, Volume 1 (Long and Short Papers), Pages:4159-4170