- Title
- GTBENCH: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations
- Creators
- Jinhao Duan - Drexel UniversityRenming Zhang - Boston UniversityJames Diffenderfer - Landesamt für Landwirtschaft und nachhaltige LandentwicklungBhavya Kailkhura - Landesamt für Landwirtschaft und nachhaltige LandentwicklungLichao Sun - Lehigh UniversityElias Stengel-Eskin - University of North Carolina at Chapel HillMohit Bansal - University of North Carolina at Chapel HillTianlong Chen - Harvard UniversityKaidi Xu - Drexel University, United States
- Publication Details
- Advances in Neural Information Processing Systems 37 - 38th Conference on Neural Information Processing Systems, NeurIPS 2024, v 37
- Grant note
- FMitF-2319242 / National Science Foundation (http://data.elsevier.com/vocabulary/SciValFunders/100000001) N66001-19-2-4031 / Defense Advanced Research Projects Agency (100000185) AC52-07NA27344 / Lawrence Livermore National Laboratory (http://data.elsevier.com/vocabulary/SciValFunders/100006227) Lawrence Livermore National Laboratory (http://data.elsevier.com/vocabulary/SciValFunders/100006227) 24-ERD-058 / Lawrence Livermore National Laboratory (100006227) N66001-19-2-4031 / Defense Advanced Research Projects Agency (http://data.elsevier.com/vocabulary/SciValFunders/100000185) 24-ERD-058; 23-ERD-030 / Laboratory Directed Research and Development (http://data.elsevier.com/vocabulary/SciValFunders/100007000) Laboratory Directed Research and Development (http://data.elsevier.com/vocabulary/SciValFunders/100007000) Defense Advanced Research Projects Agency (http://data.elsevier.com/vocabulary/SciValFunders/100000185) U.S. Department of Energy (http://data.elsevier.com/vocabulary/SciValFunders/100000015)
- Resource Type
- Conference proceeding
- Language
- English
- Academic Unit
- Computer Science (Computing)
- Scopus ID
- 2-s2.0-105000548036
- Other Identifier
- 991022133530804721
Conference proceeding
GTBENCH: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations
Advances in Neural Information Processing Systems 37 - 38th Conference on Neural Information Processing Systems, NeurIPS 2024, v 37
2024
Metrics
3 Record Views