Conference proceeding
Causal Reasoning About Entities and Events in Procedural Texts
Findings of the Association for Computational Linguistics: EACL 2023, pp 415-431
01 Jan 2023
Abstract
Entities and events are crucial to natural language reasoning and common in procedural texts. Existing work has focused either exclusively on entity state tracking (e.g., whether a pan is hot) or on event reasoning (e.g., whether one would burn themselves by touching the pan), while these two tasks are often causally related. We propose CREPE, the first benchmark on causal reasoning of event plausibility and entity states. We show that most language models, including GPT-3, perform close to chance at .35 F1, lagging far behind human at .87 F1. We boost model performance to .59 F1 by creatively representing events as programming languages while prompting language models pretrained on code. By injecting the causal relations between entities and events as intermediate reasoning steps in our representation, we further boost the performance to .67 F1. Our findings indicate not only the challenge that CREPE brings for language models, but also the efficacy of code-like prompting combined with chain-of-thought prompting for multihop event reasoning.(1)
Metrics
5 Record Views
Details
- Title
- Causal Reasoning About Entities and Events in Procedural Texts
- Creators
- Li Zhang - University of PennsylvaniaHainiu Xu - University of PennsylvaniaYue Yang - University of PennsylvaniaShuyan Zhou - Carnegie Mellon UniversityWeiqiu You - University of PennsylvaniaManni Arora - University of PennsylvaniaChris Callison-Burch - University of Pennsylvania
- Contributors
- Augenstein (Editor)A Vlachos (Editor)
- Publication Details
- Findings of the Association for Computational Linguistics: EACL 2023, pp 415-431
- Conference
- Conference of the European Chapter of the Association for Computational Linguistics, 17 (Dubrovnik, Croatia, 02 May 2023–06 May 2023)
- Publisher
- Association of Computational Linguistics
- Number of pages
- 17
- Grant note
- 2019-19051600004 / IARPA BETTER Program FA8750-19-2-0201 / DARPA LwLL Program 1928631 / NSF; National Science Foundation (NSF) FA8750-19-2-1004 / DARPA KAIROS Program; United States Department of Defense
- Resource Type
- Conference proceeding
- Language
- English
- Academic Unit
- Computer Science
- Web of Science ID
- WOS:001181085100030
- Other Identifier
- 991022123456804721
InCites Highlights
Data related to this publication, from InCites Benchmarking & Analytics tool:
- Collaboration types
- Domestic collaboration
- Web of Science research areas
- Computer Science, Artificial Intelligence
- Computer Science, Theory & Methods