Generalized Domain Adaptation for Sequence Labeling in Natural Language Processing

Generalized Domain Adaptation for Sequence Labeling in Natural Language Processing
Author :
Publisher :
Total Pages : 100
Release :
ISBN-10 : OCLC:1280138683
ISBN-13 :
Rating : 4/5 ( Downloads)

Book Synopsis Generalized Domain Adaptation for Sequence Labeling in Natural Language Processing by : Min Xiao

Download or read book Generalized Domain Adaptation for Sequence Labeling in Natural Language Processing written by Min Xiao and published by . This book was released on 2016 with total page 100 pages. Available in PDF, EPUB and Kindle. Book excerpt: Sequence labeling tasks have been widely studied in the natural language processing area, such as part-of-speech tagging, syntactic chunking, dependency parsing, and etc. Most of those systems are developed on a large amount of labeled training data via supervised learning. However, manually collecting labeled training data is too time-consuming and expensive. As an alternative, to alleviate the issue of label scarcity, domain adaptation has recently been proposed to train a statistical machine learning model in a target domain where there is no enough labeled training data by exploiting existing free labeled training data in a different but related source domain. The natural language processing community has witnessed the success of domain adaptation in a variety of sequence labeling tasks. Though the labeled training data in the source domain are available and free, however, they are not exactly as and can be very different from the test data in the target domain. Thus, simply applying naive supervised machine learning algorithms without considering domain differences may not fulfill the purpose. In this dissertation, we developed several novel representation learning approaches to address domain adaptation for sequence labeling in natural language processing. Those representation learning techniques aim to induce latent generalizable features to bridge domain divergence to enable cross-domain prediction. We first tackle a semi-supervised domain adaptation scenario where the target domain has a small amount of labeled training data and propose a distributed representation learning approach based on a probabilistic neural language model. We then relax the assumption of the availability of labeled training data in the target domain and study an unsupervised domain adaptation scenario where the target domain has only unlabeled training data, and give a task-informative representation learning approach based on dynamic dependency networks. Both works are developed in the setting where different domains contain sentences in different genres. We then extend and generalize domain adaptation into a more challenging scenario where different domains contain sentences in different languages and propose two cross-lingual representation learning approaches, one is based on deep neural networks with auxiliary bilingual word pairs and the other is based on annotation projection with auxiliary parallel sentences. All four specific learning scenarios are extensively evaluated with different sequence labeling tasks. The empirical results demonstrate the effectiveness of those generalized domain adaptation techniques for sequence labeling in natural language processing.


Generalized Domain Adaptation for Sequence Labeling in Natural Language Processing Related Books

Generalized Domain Adaptation for Sequence Labeling in Natural Language Processing
Language: en
Pages: 100
Authors: Min Xiao
Categories:
Type: BOOK - Published: 2016 - Publisher:

DOWNLOAD EBOOK

Sequence labeling tasks have been widely studied in the natural language processing area, such as part-of-speech tagging, syntactic chunking, dependency parsing
Biomedical Natural Language Processing
Language: en
Pages: 174
Authors: Kevin Bretonnel Cohen
Categories: Computers
Type: BOOK - Published: 2014-02-15 - Publisher: John Benjamins Publishing Company

DOWNLOAD EBOOK

Biomedical Natural Language Processing is a comprehensive tour through the classic and current work in the field. It discusses all subjects from both a rule-bas
Spoken Language Understanding
Language: en
Pages: 443
Authors: Gokhan Tur
Categories: Language Arts & Disciplines
Type: BOOK - Published: 2011-05-03 - Publisher: John Wiley & Sons

DOWNLOAD EBOOK

Spoken language understanding (SLU) is an emerging field in between speech and language processing, investigating human/ machine and human/ human communication
Practical Natural Language Processing
Language: en
Pages: 456
Authors: Sowmya Vajjala
Categories: Computers
Type: BOOK - Published: 2020-06-17 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

Many books and courses tackle natural language processing (NLP) problems with toy use cases and well-defined datasets. But if you want to build, iterate, and sc
Natural Language Processing and Chinese Computing
Language: en
Pages: 850
Authors: Jie Tang
Categories: Computers
Type: BOOK - Published: 2019-09-30 - Publisher: Springer Nature

DOWNLOAD EBOOK

This two-volume set of LNAI 11838 and LNAI 11839 constitutes the refereed proceedings of the 8th CCF Conference on Natural Language Processing and Chinese Compu