Using Commonsense Knowledge to Answer Why-Questions

Using Commonsense Knowledge to Answer Why-Questions (2022)

Yash Kumar Lal, Niket Tandon, Tanvi Aggarwal, Horace Liu, Nathanael Chambers, Raymond Mooney, Niranjan Balasubramanian

Answering questions in narratives about why events happened often requires commonsense knowledge external to the text. What aspects of this knowledge are available in large language models? What aspects can be made accessible via external commonsense resources? We study these questions in the context of answering questions in the TELLMEWHY dataset using COMET as a source of relevant commonsense relations. We analyze the effects of model size (T5 variants and GPT-3) along with methods of injecting knowledge (COMET) into these models. Results show that the largest models, as expected, yield substantial improvements over base models and injecting external knowledge helps models of all sizes. We also find that the format in which knowledge is provided is critical, and that smaller models benefit more from larger amounts of knowledge. Finally, we develop an ontology of knowledge types and analyze the relative coverage of the models across these categories.

View:

PDF

Citation:

In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, December 2022.

Bibtex:

Presentation:

Video

People

Raymond J. Mooney

Faculty

mooney [at] cs utexas edu

Areas of Interest

Natural Language Processing

Labs

Machine Learning