搜索结果: 211-225 共查到“计算语言学”相关记录1239条 . 查询时间(0.478 秒)
This article proposes ESA, a new unsupervised approach to word segmentation. ESA is an iterative process consisting of 3 phases: Evaluation, Selection, and Adjustment. In Evaluation, both certainty an...
Bilingual Co-Training for Sentiment Classification of Chinese Product Reviews
Bilingual Co-Training Sentiment Classification Chinese Product
2015/9/9
The lack of reliable Chinese sentiment resources limits research progress on Chinese sentiment classification. However, there are many freely available English sentiment resources on the Web. This art...
Towards Automatic Error Analysis of Machine Translation Output
Automatic Error Analysis Machine Translation Output
2015/9/9
Evaluation and error analysis of machine translation output are important but difficult tasks. In this article, we propose a framework for automatic error analysis and classification based on the iden...
Levenshtein Distances Fail to Identify Language Relationships Accurately
Levenshtein Distances Fail Identify Language
2015/9/9
The Levenshtein distance is a simple distance metric derived from the number of edit operations needed to transform one string into another. This metric has received recent attention as a means of aut...
Dependency Parsing Schemata and Mildly Non-Projective Dependency Parsing
Dependency Parsing Schemata Mildly Non-Projective Dependency Parsing
2015/9/9
We introduce dependency parsing schemata, a formal framework based on Sikkel's parsing schemata for constituency parsers, which can be used to describe, analyze, and compare dependency parsing algorit...
What Determines Inter-Coder Agreement in Manual Annotations?A Meta-Analytic Investigation
Inter-Coder Agreement Manual Annotations Meta-Analytic
2015/9/9
Recent discussions of annotator agreement have mostly centered around its calculation and interpretation, and the correct choice of indices. Although these discussions are important, they only conside...
This article presents our work on constructing a corpus of news articles in which events are annotated for estimated bounds on their duration, and automatically learning from this corpus. We describe ...
Noun phrases (nps) are a crucial part of natural language, and can have a very complex structure. However, this np structure is largely ignored by the statistical parsing field, as the most widely use...
A Strategy for Information Presentation in Spoken Dialog Systems
Information Presentation Spoken Dialog Systems
2015/9/9
In spoken dialog systems, information must be presented sequentially, making it difficult to quickly browse through a large number of options. Recent studies have shown that user satisfaction is negat...
Information Status Distinctions and Referring Expressions:An Empirical Study of References to People in News Summaries
Information Status Distinctions Referring Expressions References to People News Summaries
2015/9/9
Although there has been much theoretical work on using various information status distinctions to explain the form of references in written text, there have been few studies that attempt to automatica...
This article investigates the effects of different degrees of contextual granularity on language model performance. It presents a new language model that combines clustering and half-contextualization...
Splittability of Bilexical Context-Free Grammars is Undecidable
Splittability Bilexical Context-Free Grammars Undecidable
2015/9/9
Bilexical context-free grammars (2-LCFGs) have proved to be accurate models for statistical natural language parsing. Existing dynamic programming algorithms used to parse sentences under these models...
Discriminative Word Alignment by Linear Modeling
Discriminative Word Alignment Linear Modeling
2015/9/8
Word alignment plays an important role in many NLP tasks as it indicates the correspondence between words in a parallel text. Although widely used to align large bilingual corpora, generative models a...
Generating Phrasal and Sentential Paraphrases:A Survey of Data-Driven Methods
Generating Phrasal Sentential Paraphrases Data-Driven Methods
2015/9/8
The task of paraphrasing is inherently familiar to speakers of all languages. Moreover, the task of automatically generating or extracting semantic equivalences for the various units of language—words...
Disentangling Chat
Disentangling Chat
2015/9/8
When multiple conversations occur simultaneously, a listener must decide which conversation each utterance is part of in order to interpret and respond to it appropriately. We refer to this task as di...