TY - GEN
T1 - The Case against Sentiment Analysis for Natural Text
AU - Siddiqui, Shamoon
AU - Rasool, Ghulam
AU - Ramachandran, Ravi P.
N1 - Funding Information:
This work was supported by the U.S. Department of Education Graduate Assistance in Areas of National Need (GAANN) Grant Number P200A180055.
Publisher Copyright:
© 2021 IEEE.
PY - 2021/7/18
Y1 - 2021/7/18
N2 - Natural language processing is a broad field that encompasses several sub-tasks. One problem that has gained visibility over the past several years is that of Sentiment Analysis. This is the process of determining the attitude of an author towards some subject across some spectrum, typically 'positive' or 'negative, ' by analyzing the textual information. Whereas the field started with simple counting of words with certain characteristics, it has grown in complexity with the advent of deep learning and neural network based language models. Typically, datasets used to train and evaluate these models consist of text with appropriate labels, such as movie reviews with an accompanied star rating. However, the applicability of those results to other scenarios, such as unstructured or natural text has not been clear. In this paper, we demonstrate a clear and simple case that shows that the problem of sentiment analysis is fundamentally unsuitable for natural text. We consider state-of-the-art black box models developed and hosted by 3 of the largest companies in this field: Amazon, Google and IBM.
AB - Natural language processing is a broad field that encompasses several sub-tasks. One problem that has gained visibility over the past several years is that of Sentiment Analysis. This is the process of determining the attitude of an author towards some subject across some spectrum, typically 'positive' or 'negative, ' by analyzing the textual information. Whereas the field started with simple counting of words with certain characteristics, it has grown in complexity with the advent of deep learning and neural network based language models. Typically, datasets used to train and evaluate these models consist of text with appropriate labels, such as movie reviews with an accompanied star rating. However, the applicability of those results to other scenarios, such as unstructured or natural text has not been clear. In this paper, we demonstrate a clear and simple case that shows that the problem of sentiment analysis is fundamentally unsuitable for natural text. We consider state-of-the-art black box models developed and hosted by 3 of the largest companies in this field: Amazon, Google and IBM.
UR - http://www.scopus.com/inward/record.url?scp=85116420746&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85116420746&partnerID=8YFLogxK
U2 - 10.1109/IJCNN52387.2021.9533870
DO - 10.1109/IJCNN52387.2021.9533870
M3 - Conference contribution
AN - SCOPUS:85116420746
T3 - Proceedings of the International Joint Conference on Neural Networks
BT - IJCNN 2021 - International Joint Conference on Neural Networks, Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 International Joint Conference on Neural Networks, IJCNN 2021
Y2 - 18 July 2021 through 22 July 2021
ER -