Research on Development of LLMs and Manual Comparison of Applications

Xiaoya Liu (Co-first Author), Jiayi Li (Co-first Author), Tianhao Bai (Co-first Author), Jingtong Gao*, Pengle Zhang, Xiangyu Zhao

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

This paper provides a systematic analysis of the real-world applications of large language models (LLMs) in human-computer interaction, emphasizing their performance and effectiveness. In recent years, the advanced capabilities of LLMs have revolutionized this field, leading to widespread adoption across both academic and practical domains. However, the lack of comprehensive assessments of the practical performance of these models has hindered researchers and practitioners from distinguishing between their capabilities and performance differences. This study examines how LLMs are applied in reasoning tasks within natural language processing, offering a detailed perspective that enhances the understanding and application of these models for both researchers and practitioners. It assesses the performance of leading open-source LLMs using the Moss dataset, focusing on their effectiveness, reliability, and applicability in real-world scenarios. Through meticulous manual comparison and evaluation across eleven key performance metrics, this research reveals performance disparities among these models in practical tasks. By shedding light on these comparative analyses, this study aims to guide future investigations toward a nuanced comprehension of LLM capabilities and limitations, addressing the evolving needs of academia and industry. Future endeavors will expand this analysis to encompass a broader spectrum of models and tasks, providing deeper insights and actionable recommendations for both the research and practical communities. © 2024 IEEE.
Original languageEnglish
Title of host publicationThe 10th International Conference on Big Data and Information Analytics (BigDIA 2024) - Proceedings
PublisherIEEE
Pages23-30
ISBN (Electronic)979-8-3503-5462-1
ISBN (Print)979-8-3503-5463-8
DOIs
Publication statusPublished - 2024
Event10th International Conference on Big Data and Information Analytics (BigDIA 2024) - Chiang Mai, Thailand
Duration: 25 Oct 202428 Oct 2024

Publication series

NameInternational Conference on Big Data and Information Analytics, BigDIA
ISSN (Print)2771-6910
ISSN (Electronic)2771-6902

Conference

Conference10th International Conference on Big Data and Information Analytics (BigDIA 2024)
Country/TerritoryThailand
CityChiang Mai
Period25/10/2428/10/24

Bibliographical note

Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).

Research Keywords

  • Application
  • LLMs
  • Manual Metrics

Fingerprint

Dive into the research topics of 'Research on Development of LLMs and Manual Comparison of Applications'. Together they form a unique fingerprint.

Cite this