Skip to main navigation Skip to search Skip to main content

Self-Para-Consistency: Improving Reasoning Tasks at Low Cost for Large Language Models

  • Wenqing Chen
  • , Weicheng Wang
  • , Zhixuan Chu*
  • , Kui Ren
  • , Zibin Zheng
  • , Zhichao Lu
  • *Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

300 Downloads (CityUHK Scholars)

Abstract

Recently, the self-consistency decoding strategy has shown the ability to improve performance for complex reasoning tasks with large language models (LLMs). However, the costs may be high because the sampling process of the strategy generates some low-probability text, resulting in low-quality reasoning paths. As a consequence, it requires a relatively large sampling number to obtain good aggregation performance. In this paper, we propose an alternative strategy, self-para-consistency. It first generates multiple paraphrases for each test question, then generates reasoning paths for the original and all the paraphrased questions based on greedy decoding, and finally selects the most consistent answer. Since all the candidate paths have relatively high probabilities, the sampling number could be much smaller than the self-consistency strategy. Extensive experiments on complex reasoning datasets demonstrate the effectiveness of our method in reducing the sampling number. © 2024 Association for Computational Linguistics.
Original languageEnglish
Title of host publicationThe 62nd Annual Meeting of the Association for Computational Linguistics
Subtitle of host publicationFindings of the Association for Computational Linguistics: ACL 2024
PublisherAssociation for Computational Linguistics
Pages14162-14167
ISBN (Print)9798891760998
DOIs
Publication statusPublished - Aug 2024
Event62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024) - Centara Grand and Bangkok Convention Centre, Bangkok, Thailand
Duration: 11 Aug 202416 Aug 2024
https://aclanthology.org/2024.acl-long
https://2024.aclweb.org/
https://aclanthology.org/
https://aclanthology.org/2024.acl-tutorials
https://aclanthology.org/2024.findings-acl

Publication series

NameProceedings of the Annual Meeting of the Association for Computational Linguistics
ISSN (Print)0736-587X

Conference

Conference62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)
Abbreviated titleACL2024
PlaceThailand
CityBangkok
Period11/08/2416/08/24
Internet address

Publisher's Copyright Statement

  • This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/

Fingerprint

Dive into the research topics of 'Self-Para-Consistency: Improving Reasoning Tasks at Low Cost for Large Language Models'. Together they form a unique fingerprint.

Cite this