Skip to main navigation Skip to search Skip to main content

Empirical Analysis of Beam Search Curse and Search Errors with Model Errors in Neural Machine Translation

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

72 Downloads (CityUHK Scholars)

Abstract

Beam search is the most popular decoding method for Neural Machine Translation (NMT) and is still a strong baseline compared with the newly proposed sampling-based methods. To better understand the beam search, we investigate its two well-recognized issues, beam search curse and search error, not only on the test data as a whole but also at the sentence level. We find that only less than 30% of sentences in the WMT17 En–De and De–En test set experience these issues. Meanwhile, there is a related phenomenon. For the majority of sentences, their gold references get lower probabilities than the predictions from the beam search. We also test with different levels of model errors including a special test using training samples and models without regularization. In this test, the model has an accuracy of 95% in predicting the tokens on the training data. We find that these phenomena still exist even for such a model with very high accuracy. These findings show that it is not promising to improve the beam search by seeking higher probabilities and further reducing the search errors in decoding. The relationship between the quality and the probability at the sentence level in our results provides useful information to find new ways to improve NMT. © 2023 The authors.
Original languageEnglish
Title of host publicationProceedings of the 24th Annual Conference of the European Association for Machine Translation
PublisherEuropean Association for Machine Translation
Pages91-101
ISBN (Print)9789520329471
Publication statusPublished - Jun 2023
Event24th Annual Conference of the European Association for Machine Translation, EAMT 2023 - Tampere, Finland
Duration: 12 Jun 202315 Jun 2023
https://aclanthology.org/2023.eamt-1

Publication series

NameProceedings of the 24th Annual Conference of the European Association for Machine Translation, EAMT

Conference

Conference24th Annual Conference of the European Association for Machine Translation, EAMT 2023
PlaceFinland
CityTampere
Period12/06/2315/06/23
Internet address

Bibliographical note

Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).

Publisher's Copyright Statement

  • This full text is made available under CC-BY-ND 4.0. https://creativecommons.org/licenses/by-nd/4.0/

Fingerprint

Dive into the research topics of 'Empirical Analysis of Beam Search Curse and Search Errors with Model Errors in Neural Machine Translation'. Together they form a unique fingerprint.

Cite this