The Role of Approximate Negators in Modeling the Automatic Detection of Negation in Tweets
by
 
Palomino, Norma, author.

Title
The Role of Approximate Negators in Modeling the Automatic Detection of Negation in Tweets

Author
Palomino, Norma, author.

ISBN
9780438102101

Personal Author
Palomino, Norma, author.

Physical Description
1 electronic resource (203 pages)

General Note
Source: Dissertation Abstracts International, Volume: 79-11(E), Section: B.
 
Advisors: Nancy McCracken Committee members: Jeff Hemsley; Keisuke Inoue; Jaklin Kornfelt; Barbara Kwasnik; Bei Yu.

Abstract
Although improvements have been made in the performance of sentiment analysis tools, the automatic detection of negated text (which affects negative sentiment prediction) still presents challenges. More research is needed on new forms of negation beyond prototypical negation cues such as "not" or "never." The present research reports findings on the role of a set of words called "approximate negators," namely "barely," "hardly," "rarely," "scarcely," and "seldom," which, in specific occasions (such as attached to a word from the non-affirmative adverb "any" family), can operationalize negation styles not yet explored. Using a corpus of 6,500 tweets, human annotation allowed for the identification of 17 recurrent usages of these words as negatives (such as "very seldom") which, along with findings from the literature, helped engineer specific features that guided a machine learning classifier in predicting negated tweets. The machine learning experiments also modeled negation scope (i.e. in which specific words are negated in the text) by employing lexical and dependency graph information. Promising results included F1 values for negation detection ranging from 0.71 to 0.89 and scope detection from 0.79 to 0.88. Future work will be directed to the application of these findings in automatic sentiment classification, further exploration of patterns in data (such as part-of-speech recurrences for these new types of negation), and the investigation of sarcasm, formal language, and exaggeration as themes that emerged from observations during corpus annotation.

Local Note
School code: 0659

Subject Term
Artificial intelligence.
 
Linguistics.
 
Computer science.
 
Web studies.

Added Corporate Author
Syracuse University. Information Studies.

Electronic Access
http://gateway.proquest.com/openurl?url_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation&res_dat=xri:pqm&rft_dat=xri:pqdiss:10748085


Shelf NumberItem BarcodeShelf LocationShelf LocationHolding Information
XX(688646.1)688646-1001Proquest E-Thesis CollectionProquest E-Thesis Collection