25
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      MT Evaluation in the Context of Language Complexity

      1 , 1 , 1 , 2 , 3
      Complexity
      Hindawi Limited

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The paper focuses on investigating the impact of artificial agent (machine translator) on human agent (posteditor) using a proposed methodology, which is based on language complexity measures, POS tags, frequent tagsets, association rules, and their summarization. We examine this impact from the point of view of language complexity in terms of word and sentence structure. By the proposed methodology, we analyzed 24 733 tags of English to Slovak translations of technical texts, corresponding to the output of two MT systems (Google Translate and the European Commission’s MT tool). We used both manual (adequacy and fluency) and semiautomatic (HTER metric) MT evaluation measures as the criteria for validity. We show that the proposed methodology is valid based on the evaluation of frequent tagsets and rules of MT outputs produced by Google Translate or of the European Commission’s MT tool, and both postedited MT (PEMT) outputs using baseline methods. Our results have also shown that PEMT output produced by Google Translate is characterized by more frequent tagsets such as verbs in the infinitive with modal verbs compared to its MT output, which is characterized by masculine, inanimate nouns in locative of singular. In the MT output, produced by the European Commission’s MT tool, the most frequent tagset was verbs in the infinitive compared to its postedited MT output, where verbs in imperative and the second person of plural occurred. These findings are also obtained from the use of the proposed methodology for MT evaluation. The contribution of the proposed methodology is an identification of systematic not random errors. Additionally, the study can also serve as information for optimizing the translation process using postediting.

          Related collections

          Most cited references58

          • Record: found
          • Abstract: not found
          • Article: not found

          Task complexity: A review and conceptualization framework

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            On Using Very Large Target Vocabulary for Neural Machine Translation

              Bookmark
              • Record: found
              • Abstract: not found
              • Book: not found

              The Growth and Maintenance of Linguistic Complexity

                Bookmark

                Author and article information

                Contributors
                Journal
                Complexity
                Complexity
                Hindawi Limited
                1099-0526
                1076-2787
                December 17 2021
                December 17 2021
                : 2021
                : 1-15
                Affiliations
                [1 ]Department of Computer Science, Constantine the Philosopher University, Nitra, SK-949 01, Slovakia
                [2 ]Institute of Automation and Computer Science, Brno University of Technology, Brno, CZ-619 69, Czech Republic
                [3 ]Department of Informatics, Mendel University in Brno, Brno, CZ-613 00, Czech Republic
                Article
                10.1155/2021/2806108
                2ede557e-90cd-4239-a20c-bda59e04fc49
                © 2021

                https://creativecommons.org/licenses/by/4.0/

                History

                Comments

                Comment on this article