You are here: University of Vienna PHAIDRA Detail o:1626053
Title
Neural Machine Translation - How machines learn to translate patent language
Subtitle (en)
An overview, evaluation and tutorial
Alternative Title (de)
Neuronale maschinelle Übersetzung - wie Maschinen lernen Patentsprache zu übersetzen : ein Überblick, eine Evaluation und ein Leitfaden
Language
English
Description (de)
This work strives to be an easy to understand overview of how the current state-of-the-art in machine translation, neural machine translation (NMT), works. Using the example of patent translation, the thesis aims to both demystify the terms “AI” and “deep-learning”, that are often associated with NMT and aims to provide an accessible guide for translators and Translation Studies scholars to work with, create and understand their own NMT models. A theoretical foundation to MT is provided on which the work presents the creation and evaluation of five NMT models to determine the impact of data selection before model training. For this purpose, the five models were trained on five different datasets sorted by domain: A mixed dataset, an optics dataset, a dataset containing all domains but optics and two smaller versions of the mixed and optics-free dataset for parity in data quantity with the optics dataset. It was found that the network’s performance varied noticeably depending on how much and which data was used for training. While the common conception, that more data equals better results, held true in the automatic evaluation, it was shown that the domain specific training can help with improving results in the human evaluation. In fact, a large discrepancy between the automatic evaluation (BLEU score) and the human evaluation (based on SAE J2450) could be ovserved, with the worst performing model in the automatic metric having the best results in the human evaluation. Many of the problems can be attributed to the lack of extra-sentential context consideration in both the translation and evaluation.
Keywords (de)
NMÜ ; neuronale maschinelle Übersetzung ; MÜ ; maschinelle Übersetzung ; TQA ; translation quality assessment ; Patentübersetzung ; Englisch ; Japanisch ; statistische maschinelle Übersetzung ; SMÜ ; regelbasierte maschinelle Übersetzung ; RBMÜ AutorInnen-Schlagwörter (eng.): NMT ; neural machine translation ; MT ; machine translation ; TQA ; translation quality assessment ; patent translation ; English ; Japanese ; statistical machine translation ; SMT ; rule-based machine translation ; RBMT
Coverage (de)
Maschnielle Übersetzung (Stand 2020)
Author of the digital object
Christian  Lang  (Universität Wien)
21.01.2023
Format
application/pdf
Size
4.2 MB
Licence Selected
All rights reserved
Type of publication
Master's Dissertation
Date of approbation period
2020-07-31
Organization Association
Centre for Translation Studies
Study
undefined > 070 > 331 > 342
Content
Details
Object type
PDFDocument
Format
application/pdf
Created
21.01.2023 09:39:11
Metadata