Choose Your QA Model Wisely: A Systematic Study of Generative and Extractive Readers for Question Answering

Man Luo; Kazuma Hashimoto; Semih Yavuz; Zhiwei Liu; Chitta Baral; Yingbo Zhou

Choose Your QA Model Wisely: A Systematic Study of Generative and Extractive Readers for Question Answering

Man Luo, Kazuma Hashimoto, Semih Yavuz, Zhiwei Liu, Chitta Baral, Yingbo Zhou

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

6 Scopus citations

Abstract

While both extractive and generative readers have been successfully applied to the Question Answering (QA) task, little attention has been paid toward the systematic comparison of them. Characterizing the strengths and weaknesses of the two readers is crucial not only for making a more informed reader selection in practice but also for developing a deeper understanding to foster further research on improving readers in a principled manner. Motivated by this goal, we make the first attempt to systematically study the comparison of extractive and generative readers for question answering. To be aligned with the state-of-the-art, we explore nine transformer-based large pre-trained language models (PrLMs) as backbone architectures. Furthermore, we organize our findings under two main categories: (1) keeping the architecture invariant, and (2) varying the underlying PrLMs. Among several interesting findings, it is important to highlight that (1) the generative readers perform better in long context QA, (2) the extractive readers perform better in short context while also showing better out-of-domain generalization, and (3) the encoder of encoder-decoder PrLMs (e.g., T5) turns out to be a strong extractive reader and outperforms the standard choice of encoder-only PrLMs (e.g., RoBERTa). We also study the effect of multi-task learning on the two types of readers varying the underlying PrLMs and perform qualitative and quantitative diagnosis to provide further insights into future directions in modeling better readers.

Original language	English (US)
Title of host publication	Spa-NLP 2022 - 1st Workshop on Semiparametric Methods in NLP
Subtitle of host publication	Decoupling Logic from Knowledge, Proceedings of the Workshop
Editors	Rajarshi Das, Patrick Lewis, Sewon Min, June Thai, Manzil Zaheer
Publisher	Association for Computational Linguistics (ACL)
Pages	7-22
Number of pages	16
ISBN (Electronic)	9781955917506
State	Published - 2022
Event	1st Workshop on Semiparametric Methods in NLP: Decoupling Logic from Knowledge, Spa-NLP 2022 - Dublin, Ireland Duration: May 27 2022 → …

Publication series

Name	Spa-NLP 2022 - 1st Workshop on Semiparametric Methods in NLP: Decoupling Logic from Knowledge, Proceedings of the Workshop

Conference

Conference	1st Workshop on Semiparametric Methods in NLP: Decoupling Logic from Knowledge, Spa-NLP 2022
Country/Territory	Ireland
City	Dublin
Period	5/27/22 → …

ASJC Scopus subject areas

Computational Theory and Mathematics
Computer Science Applications
Information Systems

Cite this

Luo, M., Hashimoto, K., Yavuz, S., Liu, Z., Baral, C., & Zhou, Y. (2022). Choose Your QA Model Wisely: A Systematic Study of Generative and Extractive Readers for Question Answering. In R. Das, P. Lewis, S. Min, J. Thai, & M. Zaheer (Eds.), Spa-NLP 2022 - 1st Workshop on Semiparametric Methods in NLP: Decoupling Logic from Knowledge, Proceedings of the Workshop (pp. 7-22). (Spa-NLP 2022 - 1st Workshop on Semiparametric Methods in NLP: Decoupling Logic from Knowledge, Proceedings of the Workshop). Association for Computational Linguistics (ACL).

Choose Your QA Model Wisely: A Systematic Study of Generative and Extractive Readers for Question Answering. / Luo, Man; Hashimoto, Kazuma; Yavuz, Semih et al.
Spa-NLP 2022 - 1st Workshop on Semiparametric Methods in NLP: Decoupling Logic from Knowledge, Proceedings of the Workshop. ed. / Rajarshi Das; Patrick Lewis; Sewon Min; June Thai; Manzil Zaheer. Association for Computational Linguistics (ACL), 2022. p. 7-22 (Spa-NLP 2022 - 1st Workshop on Semiparametric Methods in NLP: Decoupling Logic from Knowledge, Proceedings of the Workshop).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Luo, M, Hashimoto, K, Yavuz, S, Liu, Z, Baral, C & Zhou, Y 2022, Choose Your QA Model Wisely: A Systematic Study of Generative and Extractive Readers for Question Answering. in R Das, P Lewis, S Min, J Thai & M Zaheer (eds), Spa-NLP 2022 - 1st Workshop on Semiparametric Methods in NLP: Decoupling Logic from Knowledge, Proceedings of the Workshop. Spa-NLP 2022 - 1st Workshop on Semiparametric Methods in NLP: Decoupling Logic from Knowledge, Proceedings of the Workshop, Association for Computational Linguistics (ACL), pp. 7-22, 1st Workshop on Semiparametric Methods in NLP: Decoupling Logic from Knowledge, Spa-NLP 2022, Dublin, Ireland, 5/27/22.

Luo M, Hashimoto K, Yavuz S, Liu Z, Baral C, Zhou Y. Choose Your QA Model Wisely: A Systematic Study of Generative and Extractive Readers for Question Answering. In Das R, Lewis P, Min S, Thai J, Zaheer M, editors, Spa-NLP 2022 - 1st Workshop on Semiparametric Methods in NLP: Decoupling Logic from Knowledge, Proceedings of the Workshop. Association for Computational Linguistics (ACL). 2022. p. 7-22. (Spa-NLP 2022 - 1st Workshop on Semiparametric Methods in NLP: Decoupling Logic from Knowledge, Proceedings of the Workshop).

Luo, Man ; Hashimoto, Kazuma ; Yavuz, Semih et al. / Choose Your QA Model Wisely : A Systematic Study of Generative and Extractive Readers for Question Answering. Spa-NLP 2022 - 1st Workshop on Semiparametric Methods in NLP: Decoupling Logic from Knowledge, Proceedings of the Workshop. editor / Rajarshi Das ; Patrick Lewis ; Sewon Min ; June Thai ; Manzil Zaheer. Association for Computational Linguistics (ACL), 2022. pp. 7-22 (Spa-NLP 2022 - 1st Workshop on Semiparametric Methods in NLP: Decoupling Logic from Knowledge, Proceedings of the Workshop).

@inproceedings{453cd16fcd474d25a760800c47b78bb9,

title = "Choose Your QA Model Wisely: A Systematic Study of Generative and Extractive Readers for Question Answering",

abstract = "While both extractive and generative readers have been successfully applied to the Question Answering (QA) task, little attention has been paid toward the systematic comparison of them. Characterizing the strengths and weaknesses of the two readers is crucial not only for making a more informed reader selection in practice but also for developing a deeper understanding to foster further research on improving readers in a principled manner. Motivated by this goal, we make the first attempt to systematically study the comparison of extractive and generative readers for question answering. To be aligned with the state-of-the-art, we explore nine transformer-based large pre-trained language models (PrLMs) as backbone architectures. Furthermore, we organize our findings under two main categories: (1) keeping the architecture invariant, and (2) varying the underlying PrLMs. Among several interesting findings, it is important to highlight that (1) the generative readers perform better in long context QA, (2) the extractive readers perform better in short context while also showing better out-of-domain generalization, and (3) the encoder of encoder-decoder PrLMs (e.g., T5) turns out to be a strong extractive reader and outperforms the standard choice of encoder-only PrLMs (e.g., RoBERTa). We also study the effect of multi-task learning on the two types of readers varying the underlying PrLMs and perform qualitative and quantitative diagnosis to provide further insights into future directions in modeling better readers.",

author = "Man Luo and Kazuma Hashimoto and Semih Yavuz and Zhiwei Liu and Chitta Baral and Yingbo Zhou",

note = "Publisher Copyright: {\textcopyright} 2022 Association for Computational Linguistics.; 1st Workshop on Semiparametric Methods in NLP: Decoupling Logic from Knowledge, Spa-NLP 2022 ; Conference date: 27-05-2022",

year = "2022",

language = "English (US)",

series = "Spa-NLP 2022 - 1st Workshop on Semiparametric Methods in NLP: Decoupling Logic from Knowledge, Proceedings of the Workshop",

publisher = "Association for Computational Linguistics (ACL)",

pages = "7--22",

editor = "Rajarshi Das and Patrick Lewis and Sewon Min and June Thai and Manzil Zaheer",

booktitle = "Spa-NLP 2022 - 1st Workshop on Semiparametric Methods in NLP",

}

TY - GEN

T1 - Choose Your QA Model Wisely

T2 - 1st Workshop on Semiparametric Methods in NLP: Decoupling Logic from Knowledge, Spa-NLP 2022

AU - Luo, Man

AU - Hashimoto, Kazuma

AU - Yavuz, Semih

AU - Liu, Zhiwei

AU - Baral, Chitta

AU - Zhou, Yingbo

PY - 2022

Y1 - 2022

N2 - While both extractive and generative readers have been successfully applied to the Question Answering (QA) task, little attention has been paid toward the systematic comparison of them. Characterizing the strengths and weaknesses of the two readers is crucial not only for making a more informed reader selection in practice but also for developing a deeper understanding to foster further research on improving readers in a principled manner. Motivated by this goal, we make the first attempt to systematically study the comparison of extractive and generative readers for question answering. To be aligned with the state-of-the-art, we explore nine transformer-based large pre-trained language models (PrLMs) as backbone architectures. Furthermore, we organize our findings under two main categories: (1) keeping the architecture invariant, and (2) varying the underlying PrLMs. Among several interesting findings, it is important to highlight that (1) the generative readers perform better in long context QA, (2) the extractive readers perform better in short context while also showing better out-of-domain generalization, and (3) the encoder of encoder-decoder PrLMs (e.g., T5) turns out to be a strong extractive reader and outperforms the standard choice of encoder-only PrLMs (e.g., RoBERTa). We also study the effect of multi-task learning on the two types of readers varying the underlying PrLMs and perform qualitative and quantitative diagnosis to provide further insights into future directions in modeling better readers.

AB - While both extractive and generative readers have been successfully applied to the Question Answering (QA) task, little attention has been paid toward the systematic comparison of them. Characterizing the strengths and weaknesses of the two readers is crucial not only for making a more informed reader selection in practice but also for developing a deeper understanding to foster further research on improving readers in a principled manner. Motivated by this goal, we make the first attempt to systematically study the comparison of extractive and generative readers for question answering. To be aligned with the state-of-the-art, we explore nine transformer-based large pre-trained language models (PrLMs) as backbone architectures. Furthermore, we organize our findings under two main categories: (1) keeping the architecture invariant, and (2) varying the underlying PrLMs. Among several interesting findings, it is important to highlight that (1) the generative readers perform better in long context QA, (2) the extractive readers perform better in short context while also showing better out-of-domain generalization, and (3) the encoder of encoder-decoder PrLMs (e.g., T5) turns out to be a strong extractive reader and outperforms the standard choice of encoder-only PrLMs (e.g., RoBERTa). We also study the effect of multi-task learning on the two types of readers varying the underlying PrLMs and perform qualitative and quantitative diagnosis to provide further insights into future directions in modeling better readers.

UR - http://www.scopus.com/inward/record.url?scp=85134654601&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85134654601&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85134654601

T3 - Spa-NLP 2022 - 1st Workshop on Semiparametric Methods in NLP: Decoupling Logic from Knowledge, Proceedings of the Workshop

SP - 7

EP - 22

BT - Spa-NLP 2022 - 1st Workshop on Semiparametric Methods in NLP

A2 - Das, Rajarshi

A2 - Lewis, Patrick

A2 - Min, Sewon

A2 - Thai, June

A2 - Zaheer, Manzil

PB - Association for Computational Linguistics (ACL)

Y2 - 27 May 2022

ER -

Choose Your QA Model Wisely: A Systematic Study of Generative and Extractive Readers for Question Answering

Abstract

Publication series

Conference

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this