METHOD OF SYSTEM ENGINEERING OF NEURAL MACHINE TRANSLATION SYSTEMS

Authors

DOI:

https://doi.org/10.20535/kpisn.2021.2.236939

Keywords:

method, neural machine translation system, system engineering, Erickson-Penker method

Abstract

Background. There are not many machine translation companies on the market whose products are in demand. These are, for example, free and commercial products such as “GoogleTranslate”, “DeepLTranslator”, “ModernMT”, “Apertium”, “Trident”, to name a few. To implement a more efficient and productive process for developing high-quality neural machine translation systems (NMTS), appropriate scientifically based methods of NMTS engineering are needed in order to get a high-quality and competitive product as quickly as possible.

Objective. The purpose of this article is to apply the Eriksson-Penker business profile to the development and formalization of a method for system engineering of NMTS.

Methods. The idea behind the neural machine translation system engineering method is to apply the Eriksson-Penker system engineering methodology and business profile to formalize an ordered way to develop NMT systems.

Results. The method of developing NMT systems based on the use of system engineering techniques consists of three main stages. At the first stage, the structure of the NMT system is modelled in the form of an Eriksson-Penker business profile. At the second stage, a set of processes is determined that is specific to the class of Data Science systems, and the international CRISP-DM standard. At the third stage, verification and validation of the developed NMTS is carried out.

Conclusions. The article proposes a method of system engineering of NMTS based on the modified Erickson-Penker business profile representation of the system at the meta-level, as well as international process standards of Data Science and Data Mining. The effectiveness of using this method was studied on the example of developing a bidirectional English-Ukrainian NMTS EUMT (English-Ukrainian Machine Translator) and it was found that the EUMT system is at least as good as the quality of English-Ukrainian translation of the popular Google Translate translator. The full version code of the EUMT system is published on the GitHub platform and is available at: https://github.com/EugeneSel/EUMT.

References

H.-E. Eriksson and M. Penker, Business modeling with UML. New York: John Wiley & Sons, 2000, 459 p.

A. Kossiakoff et al., Systems Engineering Principles and Practice, V.K. Batovrin, Ed. Moscow, Russia: DMK Press, 2014, 624 p.

D.K. Hitchins, Systems Engineering: A 21st Century Systems Methodology. Wiley, 2007, 528 p.

S. Krymskyi, “Metod,” in Filosofskyi Entsyklopedychnyi Slovnyk, V.I. Shynkaruk, Ed. Kyiv, Ukraine: Abrys, 2002, 742 p.

P.Р. Maslyanko and O.S. Maystrenko, “The system engineering of organizational system informatization projects,” KPI Sci. News, no. 6, pp. 34–42, 2008.

C. O'Neil and R. Schutt, Doing data science: Straight talk from the frontline. O'Reilly Media, Inc., 2013, 406 p.

F. Provost and T. Fawcett, Data science for business: What you need to know about data mining and data-analytic thinking. O'Reilly Media, Inc., 2013.

D. Bahdanau et al., “Neural machine translation by jointly learning to align and translate,” in 3rd International Conference on Learning Representations, San Diego, United States, 2014.

J. Gehring et al. (2016). A convolutional encoder model for neural machine translation [Online]. Available: https://arxiv.org/pdf/1611.02344.pdf

A. Vaswani et al., “Attention is all you need,” in 31st Conference on Neural Information Processing Systems, Long Beach, CA, USA, 2017.

P. Bojanowski et al., “Enriching word vectors with subword information,” Trans. Assoc. Computation. Ling., vol. 5, pp. 135–146, 2017.

D.P. Kingma and J.L. Ba. (2014). Adam: A method for stochastic optimization [Online]. Available: https://arxiv.org/pdf/1412.6980.pdf?source=post_page

C. Szegedy et al. (2016). Rethinking the inception architecture for computer vision [Online]. Available: https://arxiv.org/pdf/1512.00567.pdf

K. Papineni et al. “Bleu: A method for automatic evaluation of machine translation,” in Proc. 40th Annual Meeting of the Association for Computational Linguistics (ACL), Philadelphia, 2002, pp. 311–318.

J. Tiedemann. (2012). Parallel data, tools and interfaces in OPUS [Online]. Available: http://www.lrec-conf.org/proceedings/lrec2012/pdf/463_Paper.pdf

M. Johnson et al. (2017). Google’s multilingual neural machine translation system: Enabling zero-shot translation [Online]. Available: https://arxiv.org/pdf/1611.04558.pdf

S. Holger et al. (2019). WikiMatrix: Mining 135M parallel sentences in 1620 language pairs from Wikipedia [Online]. Available: https://arxiv.org/pdf/1907.05791.pdf

A. El-Kishky et al. (2021). XLEnt: Mining Cross-lingual Entities with Lexical-Semantic-Phonetic Word Alignment [Online]. Available: http://data.statmt.org/xlent/elkishky_XLEnt.pdf

A. Abdelali et al., “The AMARA Corpus: Building parallel language resources for the educational domain,” in Proc. 9th International Conference on Language Resources and Evaluation (LREC'14), Reykjavik, Iceland, 2014, p. 1856–1862.

Published

2021-08-31

Issue

Section

Статті