Abstract
Beam search is the most popular decoding method for Neural Machine Translation (NMT) and is still a strong baseline compared with the newly proposed sampling-based methods. To better understand the beam search, we investigate its two well-recognized issues, beam search curse and search error, not only on the test data as a whole but also at the sentence level. We find that only less than 30% of sentences in the WMT17 En–De and De–En test set experience these issues. Meanwhile, there is a related phenomenon. For the majority of sentences, their gold references get lower probabilities than the predictions from the beam search. We also test with different levels of model errors including a special test using training samples and models without regularization. In this test, the model has an accuracy of 95% in predicting the tokens on the training data. We find that these phenomena still exist even for such a model with very high accuracy. These findings show that it is not promising to improve the beam search by seeking higher probabilities and further reducing the search errors in decoding. The relationship between the quality and the probability at the sentence level in our results provides useful information to find new ways to improve NMT. © 2023 The authors.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 24th Annual Conference of the European Association for Machine Translation |
| Publisher | European Association for Machine Translation |
| Pages | 91-101 |
| ISBN (Print) | 9789520329471 |
| Publication status | Published - Jun 2023 |
| Event | 24th Annual Conference of the European Association for Machine Translation, EAMT 2023 - Tampere, Finland Duration: 12 Jun 2023 → 15 Jun 2023 https://aclanthology.org/2023.eamt-1 |
Publication series
| Name | Proceedings of the 24th Annual Conference of the European Association for Machine Translation, EAMT |
|---|
Conference
| Conference | 24th Annual Conference of the European Association for Machine Translation, EAMT 2023 |
|---|---|
| Place | Finland |
| City | Tampere |
| Period | 12/06/23 → 15/06/23 |
| Internet address |
Bibliographical note
Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).Publisher's Copyright Statement
- This full text is made available under CC-BY-ND 4.0. https://creativecommons.org/licenses/by-nd/4.0/
Fingerprint
Dive into the research topics of 'Empirical Analysis of Beam Search Curse and Search Errors with Model Errors in Neural Machine Translation'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver