Multiword Units in Machine Translation and Translation Technology

Ruslan Mitkov

University of Wolverhampton

Johanna Monti

“L’Orientale” University of Naples

Gloria Corpas Pastor

University of Málaga

Violeta Seretan

University of Geneva

2019


E-Book: 272 English pages

Price: 5.000 Toman

Download: Multiword Units in Machine Translation and Translation Technology (Mitkov & Monti & Corpas PastorSeretan 2019).


راهنمای سریع دانلود، کلیک کنید .


The correct interpretation of Multiword Units (MWUs) is crucial to many applications in Natural Language Processing but is a challenging and complex task. In recent years, the computational treatment of MWUs has received considerable attention but there is much more to be done before we can claim that NLP and Machine Translation (MT) systems process MWUs successfully.

This volume provides a general overview of the field with particular reference to Machine Translation and Translation Technology and focuses on languages such as English, Basque, French, Romanian, German, Dutch and Croatian, among others. The chapters of the volume illustrate a variety of topics that address this challenge, such as the use of rule-based approaches, compound splitting techniques, MWU identification methodologies in multilingual applications, and MWU alignment issues.


Table of Contents
Multiword units in machine translation and translation technology
Johanna Monti, Violeta Seretan, Gloria Corpas Pastor and Ruslan Mitkov
Part 1. Multiword units in machine translation

Analysing linguistic information about word combinations for a Spanish-Basque rule-based machine translation system

Uxoa Iñurrieta, Itziar Aduriz, Arantza Díaz de Ilarraza, Gorka Labaka and Kepa Sarasola
How do students cope with machine translation output of multiword units? An exploratory study
Joke Daems, Michael Carl, Sonia Vandepitte, Robert J. Hartsuiker and Lieve Macken
Aligning verb + noun collocations to improve a French-Romanian FSMT system
Amalia Todiraşcu and Mirabela Navlea
Part 2. Multiword units in multilingual NLP applications

Multiword expressions in multilingual information extraction

Gregor Thurmair
A multilingual gold standard for translation spotting of German compounds and their corresponding multiword units in English, French, Italian and Spanish
Simon Clematide, Stéphanie Lehner, Johannes Graën and Martin Volk
Dutch compound splitting for bilingual terminology extraction
Lieve Macken and Arda Tezcan
Part 3. Identification and translation of multiword units
A flexible framework for collocation retrieval and translation from parallel and comparable corpora
Oscar Mendoza Rivera, Ruslan Mitkov and Gloria Corpas Pastor
On identification of bilingual lexical bundles for translation purposes: The case of an English-Polish comparable corpus of patient information leaflets
Łukasz Grabowski
The quest for croatian idioms as multiword units
Kristina Kocijan and Sara Librenjak
Corpus analysis of croatian constructions with the verb doći ‘to come’
Goranka Blagus Bartolec and Ivana Matas Ivanković
Anaphora resolution, collocations and translation
Eric Wehrli and Luka Nerima