Using Linguistic Knowledge in Statistical Machine Translation

Using Linguistic Knowledge in Statistical Machine Translation
Author :
Publisher :
Total Pages : 162
Release :
ISBN-10 : OCLC:710154183
ISBN-13 :
Rating : 4/5 ( Downloads)

Book Synopsis Using Linguistic Knowledge in Statistical Machine Translation by : Rabih Mohamed Zbib

Download or read book Using Linguistic Knowledge in Statistical Machine Translation written by Rabih Mohamed Zbib and published by . This book was released on 2010 with total page 162 pages. Available in PDF, EPUB and Kindle. Book excerpt: In this thesis, we present methods for using linguistically motivated information to enhance the performance of statistical machine translation (SMT). One of the advantages of the statistical approach to machine translation is that it is largely language-agnostic. Machine learning models are used to automatically learn translation patterns from data. SMT can, however, be improved by using linguistic knowledge to address specific areas of the translation process, where translations would be hard to learn fully automatically. We present methods that use linguistic knowledge at various levels to improve statistical machine translation, focusing on Arabic-English translation as a case study. In the first part, morphological information is used to preprocess the Arabic text for Arabic-to-English and English-to-Arabic translation, which reduces the gap in the complexity of the morphology between Arabic and English. The second method addresses the issue of long-distance reordering in translation to account for the difference in the syntax of the two languages. In the third part, we show how additional local context information on the source side is incorporated, which helps reduce lexical ambiguity. Two methods are proposed for using binary decision trees to control the amount of context information introduced. These methods are successfully applied to the use of diacritized Arabic source in Arabic-to-English translation. The final method combines the outputs of an SMT system and a Rule-based MT (RBMT) system, taking advantage of the flexibility of the statistical approach and the rich linguistic knowledge embedded in the rule-based MT system.


Using Linguistic Knowledge in Statistical Machine Translation Related Books

Using Linguistic Knowledge in Statistical Machine Translation
Language: en
Pages: 162
Authors: Rabih Mohamed Zbib
Categories:
Type: BOOK - Published: 2010 - Publisher:

DOWNLOAD EBOOK

In this thesis, we present methods for using linguistically motivated information to enhance the performance of statistical machine translation (SMT). One of th
Linguistically Motivated Statistical Machine Translation
Language: en
Pages: 159
Authors: Deyi Xiong
Categories: Language Arts & Disciplines
Type: BOOK - Published: 2015-02-11 - Publisher: Springer

DOWNLOAD EBOOK

This book provides a wide variety of algorithms and models to integrate linguistic knowledge into Statistical Machine Translation (SMT). It helps advance conven
Introducing linguistic knowledge into statistical machine translation[
Language: en
Pages:
Authors: Adrià Gispert Ramis
Categories:
Type: BOOK - Published: 2007 - Publisher:

DOWNLOAD EBOOK

Machine Translation
Language: en
Pages: 114
Authors: Maja Popović
Categories:
Type: BOOK - Published: 2009 - Publisher:

DOWNLOAD EBOOK

Statistical Language Learning
Language: en
Pages: 196
Authors: Eugene Charniak
Categories: Computers
Type: BOOK - Published: 1996 - Publisher: MIT Press

DOWNLOAD EBOOK

This text introduces statistical language processing techniques--word tagging, parsing with probabilistic context free grammars, grammar induction, syntactic di