Machine Learning Techniques for Sentiment Analysis of Code-Mixed and Switched Indian Social Media Text Corpus: A Comprehensive Review

Gazi Imtiyaz Ahmad; Jimmy Singla; Anis Ali; Aijaz Ahmad Reshi; Anas A. Salameh

doi:10.14569/IJACSA.2022.0130254

Machine Learning Techniques for Sentiment Analysis of Code-Mixed and Switched Indian Social Media Text Corpus: A Comprehensive Review

Gazi Imtiyaz Ahmad
, Jimmy Singla
, Anis Ali
, Aijaz Ahmad Reshi
, Anas A. Salameh

Research output: Contribution to journal › Article › peer-review

31 Scopus citations

Abstract

A comprehensive review of sentiment analysis for code-mixed and switched text corpus of Indian social media using machine learning (ML) approaches, based on recent research studies has been presented in this paper. Code-mixing and switching are linguistic behavior shown by the bilingual/multilingual population, primarily in spoken but also in written communication, especially on social media. Code-mixing involves combining lower linguistic units like words and phrases of a language into the sentences of other language (the base language) and code-switching involves switching to another language, for the length of one sentence or more. In code-mixing and switching, a bilingual person takes one or more words or phrases from one language and introduces them into another language while communicating in that language in spoken or written mode. People nowadays express their views and opinions on several issues on social media. In multilingual countries, people express their views using English as well as their native languages. Several reasons can be attributed to code-mixing. Lack of knowledge in one language on a particular subject, being empathetic, interjection and clarification are some to name. Sentiment analysis of monolingual social media content has been carried out for the last two decades. However, during recent years, Natural Language Processing (NLP) research focus has also shifted towards the exploration of code-mixed data, thereby, making code mixed sentiment analysis an evolving field of research.

Original language	English
Pages (from-to)	455-467
Number of pages	13
Journal	International Journal of Advanced Computer Science and Applications
Volume	13
Issue number	2
DOIs	https://doi.org/10.14569/IJACSA.2022.0130254
State	Published - 2022

Keywords

Code mixing
Corpus
Deep learning
Machine learning
Nlp
Sentiment analysis
Social media text

Access to Document

10.14569/IJACSA.2022.0130254

Cite this

@article{8e50ae8188ac40c8b87ccf8917093529,

title = "Machine Learning Techniques for Sentiment Analysis of Code-Mixed and Switched Indian Social Media Text Corpus: A Comprehensive Review",

abstract = "A comprehensive review of sentiment analysis for code-mixed and switched text corpus of Indian social media using machine learning (ML) approaches, based on recent research studies has been presented in this paper. Code-mixing and switching are linguistic behavior shown by the bilingual/multilingual population, primarily in spoken but also in written communication, especially on social media. Code-mixing involves combining lower linguistic units like words and phrases of a language into the sentences of other language (the base language) and code-switching involves switching to another language, for the length of one sentence or more. In code-mixing and switching, a bilingual person takes one or more words or phrases from one language and introduces them into another language while communicating in that language in spoken or written mode. People nowadays express their views and opinions on several issues on social media. In multilingual countries, people express their views using English as well as their native languages. Several reasons can be attributed to code-mixing. Lack of knowledge in one language on a particular subject, being empathetic, interjection and clarification are some to name. Sentiment analysis of monolingual social media content has been carried out for the last two decades. However, during recent years, Natural Language Processing (NLP) research focus has also shifted towards the exploration of code-mixed data, thereby, making code mixed sentiment analysis an evolving field of research.",

keywords = "Code mixing, Corpus, Deep learning, Machine learning, Nlp, Sentiment analysis, Social media text",

author = "Ahmad, \{Gazi Imtiyaz\} and Jimmy Singla and Anis Ali and Reshi, \{Aijaz Ahmad\} and Salameh, \{Anas A.\}",

year = "2022",

doi = "10.14569/IJACSA.2022.0130254",

language = "English",

volume = "13",

pages = "455--467",

journal = "International Journal of Advanced Computer Science and Applications",

issn = "2158-107X",

publisher = "Science and Information Organization",

number = "2",

}

TY - JOUR

T1 - Machine Learning Techniques for Sentiment Analysis of Code-Mixed and Switched Indian Social Media Text Corpus

T2 - A Comprehensive Review

AU - Ahmad, Gazi Imtiyaz

AU - Singla, Jimmy

AU - Ali, Anis

AU - Reshi, Aijaz Ahmad

AU - Salameh, Anas A.

PY - 2022

Y1 - 2022

N2 - A comprehensive review of sentiment analysis for code-mixed and switched text corpus of Indian social media using machine learning (ML) approaches, based on recent research studies has been presented in this paper. Code-mixing and switching are linguistic behavior shown by the bilingual/multilingual population, primarily in spoken but also in written communication, especially on social media. Code-mixing involves combining lower linguistic units like words and phrases of a language into the sentences of other language (the base language) and code-switching involves switching to another language, for the length of one sentence or more. In code-mixing and switching, a bilingual person takes one or more words or phrases from one language and introduces them into another language while communicating in that language in spoken or written mode. People nowadays express their views and opinions on several issues on social media. In multilingual countries, people express their views using English as well as their native languages. Several reasons can be attributed to code-mixing. Lack of knowledge in one language on a particular subject, being empathetic, interjection and clarification are some to name. Sentiment analysis of monolingual social media content has been carried out for the last two decades. However, during recent years, Natural Language Processing (NLP) research focus has also shifted towards the exploration of code-mixed data, thereby, making code mixed sentiment analysis an evolving field of research.

AB - A comprehensive review of sentiment analysis for code-mixed and switched text corpus of Indian social media using machine learning (ML) approaches, based on recent research studies has been presented in this paper. Code-mixing and switching are linguistic behavior shown by the bilingual/multilingual population, primarily in spoken but also in written communication, especially on social media. Code-mixing involves combining lower linguistic units like words and phrases of a language into the sentences of other language (the base language) and code-switching involves switching to another language, for the length of one sentence or more. In code-mixing and switching, a bilingual person takes one or more words or phrases from one language and introduces them into another language while communicating in that language in spoken or written mode. People nowadays express their views and opinions on several issues on social media. In multilingual countries, people express their views using English as well as their native languages. Several reasons can be attributed to code-mixing. Lack of knowledge in one language on a particular subject, being empathetic, interjection and clarification are some to name. Sentiment analysis of monolingual social media content has been carried out for the last two decades. However, during recent years, Natural Language Processing (NLP) research focus has also shifted towards the exploration of code-mixed data, thereby, making code mixed sentiment analysis an evolving field of research.

KW - Code mixing

KW - Corpus

KW - Deep learning

KW - Machine learning

KW - Nlp

KW - Sentiment analysis

KW - Social media text

UR - https://www.scopus.com/pages/publications/85126130893

U2 - 10.14569/IJACSA.2022.0130254

DO - 10.14569/IJACSA.2022.0130254

M3 - Article

AN - SCOPUS:85126130893

SN - 2158-107X

VL - 13

SP - 455

EP - 467

JO - International Journal of Advanced Computer Science and Applications

JF - International Journal of Advanced Computer Science and Applications

IS - 2

ER -

Machine Learning Techniques for Sentiment Analysis of Code-Mixed and Switched Indian Social Media Text Corpus: A Comprehensive Review

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this