Skip to main content

2023 | OriginalPaper | Buchkapitel

A Survey on Code-Mixed Sentiment Analysis Based on Hinglish Dataset

verfasst von : Rekha Baghel

Erschienen in: Proceedings of Fourth International Conference on Computing, Communications, and Cyber-Security

Verlag: Springer Nature Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In recent years, the use of the Internet has been proliferating. Organizations use social media platforms to promote their products and find people’s opinions about their products. People share their experiences, views, and thoughts on social media platforms such as Twitter, Facebook, LinkedIn. Business organizations analyze the posts created by the users about their products through sentiment analysis. India is multilingual, so people use multiple languages to post their opinion, e.g., English and Hindi. Code-mixed language is a way in which people use words of different languages to express their thoughts. There is limited research done on code-mixed languages such as Hinglish, even though a great deal of effort has been devoted to assessing the sentiments of a single language. A comprehensive review of the Hinglish dataset is offered in this study. In this research, we first discuss several datasets used by the researchers to tackle this problem and then perform some extensive investigation to look more closely at them. Then, we examine several efficient approaches for categorizing sentiment for the Hinglish dataset.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Patro J, Samanta B, Singh S, Basu A, Mukherjee P, Choudhury M, Mukherjee A (2017) All that is English may be Hindi: enhancing language identification through automatic ranking of likeliness of word borrowing in social media. CoRR abs/1707.08446. http://arxiv.org/abs/1707.08446 Patro J, Samanta B, Singh S, Basu A, Mukherjee P, Choudhury M, Mukherjee A (2017) All that is English may be Hindi: enhancing language identification through automatic ranking of likeliness of word borrowing in social media. CoRR abs/1707.08446. http://​arxiv.​org/​abs/​1707.​08446
3.
Zurück zum Zitat Thakur V, Sahu R, Omer S (2020) Current state of Hinglish text sentiment analysis. In: Proceedings of the international conference on innovative computing communications ICICC Thakur V, Sahu R, Omer S (2020) Current state of Hinglish text sentiment analysis. In: Proceedings of the international conference on innovative computing communications ICICC
4.
Zurück zum Zitat Singh P, Lefever E (2020) LT3 at SemEval-2020 task 9: cross-lingual embeddings for sentiment analysis of Hinglish social media text Singh P, Lefever E (2020) LT3 at SemEval-2020 task 9: cross-lingual embeddings for sentiment analysis of Hinglish social media text
5.
Zurück zum Zitat Singh G (2021) Sentiment analysis of code-mixed social media text (Hinglish) Singh G (2021) Sentiment analysis of code-mixed social media text (Hinglish)
6.
Zurück zum Zitat Gopalan V, Hopkins M (2020) Reed at SemEval-2020 task 9: fine-tuning and bag-of-words approaches to code-mixed sentiment analysis Gopalan V, Hopkins M (2020) Reed at SemEval-2020 task 9: fine-tuning and bag-of-words approaches to code-mixed sentiment analysis
7.
Zurück zum Zitat Patwa P, Aguilar G, Kar S, Pandey S, Pykl S, Gambäck B, Chakraborty T, Solorio T, Das A (2020) SemEval-2020 task 9: overview of sentiment analysis of code-mixed tweets Patwa P, Aguilar G, Kar S, Pandey S, Pykl S, Gambäck B, Chakraborty T, Solorio T, Das A (2020) SemEval-2020 task 9: overview of sentiment analysis of code-mixed tweets
8.
Zurück zum Zitat Singh G (2020) Decision tree J48 at SemEval-2020 task 9: sentiment analysis for code-mixed social media text (Hinglish) Singh G (2020) Decision tree J48 at SemEval-2020 task 9: sentiment analysis for code-mixed social media text (Hinglish)
9.
Zurück zum Zitat Baroi SJ, Singh N, Das R, Singh TD (2020) NITS-Hinglish-SentiMix at SemEval-2020 task 9: sentiment analysis for code-mixed social media text using an ensemble model Baroi SJ, Singh N, Das R, Singh TD (2020) NITS-Hinglish-SentiMix at SemEval-2020 task 9: sentiment analysis for code-mixed social media text using an ensemble model
10.
Zurück zum Zitat Sasidhar T (2020) Emotion detection in Hinglish (Hindi + English) code-mixed social media text. Procedia Comput Sci Sasidhar T (2020) Emotion detection in Hinglish (Hindi + English) code-mixed social media text. Procedia Comput Sci
11.
Zurück zum Zitat Vijay D, Bohra A, Singh V, Akhtar SS, Shrivastava M (2018) Corpus creation and emotion prediction for Hindi-English code-mixed social media text. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: student research workshop. Association for Computational Linguistics, New Orleans, pp 128–135 Vijay D, Bohra A, Singh V, Akhtar SS, Shrivastava M (2018) Corpus creation and emotion prediction for Hindi-English code-mixed social media text. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: student research workshop. Association for Computational Linguistics, New Orleans, pp 128–135
12.
Zurück zum Zitat Wadhawan A, Aggarwal A (2021) Towards emotion recognition in Hindi-English code-mixed data: a transformer based approach. In: Proceedings of the eleventh workshop on computational approaches to subjectivity, sentiment and social media analysis. Association for Computational Linguistics, pp 195–202 Wadhawan A, Aggarwal A (2021) Towards emotion recognition in Hindi-English code-mixed data: a transformer based approach. In: Proceedings of the eleventh workshop on computational approaches to subjectivity, sentiment and social media analysis. Association for Computational Linguistics, pp 195–202
13.
Zurück zum Zitat Mahadzir NH, Omar MF, Nawi N, Salameh A, Hussin K (2021) Sentiment analysis of code-mixed text: a review Mahadzir NH, Omar MF, Nawi N, Salameh A, Hussin K (2021) Sentiment analysis of code-mixed text: a review
14.
Zurück zum Zitat Sharma R, Nigam S, Jain R (2014) Opinion mining in Hindi language: a survey. Int J Found Comput Sci Technol Sharma R, Nigam S, Jain R (2014) Opinion mining in Hindi language: a survey. Int J Found Comput Sci Technol
15.
Zurück zum Zitat Shelke R, Thakore D (2020) A novel approach for named entity recognition on Hindi language using residual Bilstm network. Int J Nat Lang Comput Shelke R, Thakore D (2020) A novel approach for named entity recognition on Hindi language using residual Bilstm network. Int J Nat Lang Comput
16.
Zurück zum Zitat Srivastava V, Singh M (2020) IIT Gandhinagar at SemEval-2020 task 9: code-mixed sentiment classification using candidate sentence generation and selection Srivastava V, Singh M (2020) IIT Gandhinagar at SemEval-2020 task 9: code-mixed sentiment classification using candidate sentence generation and selection
17.
Zurück zum Zitat Kumar A, Agarwal H, Bansal K, Modi A (2020) BAKSA at SemEval-2020 task 9: bolstering convolutional neural net (CNN) with self-attention for sentiment analysis of code mixed text Kumar A, Agarwal H, Bansal K, Modi A (2020) BAKSA at SemEval-2020 task 9: bolstering convolutional neural net (CNN) with self-attention for sentiment analysis of code mixed text
18.
Zurück zum Zitat Srinivasan R, Subalalitha C (2021) Sentimental analysis from imbalanced code-mixed data using machine learning approaches. Distrib Parallel Databases Srinivasan R, Subalalitha C (2021) Sentimental analysis from imbalanced code-mixed data using machine learning approaches. Distrib Parallel Databases
19.
Zurück zum Zitat Sazzed S, Jayarathna S (2021) SSentiA: a self-supervised sentiment analyzer for classification from unlabeled data. Mach Learn Appl Sazzed S, Jayarathna S (2021) SSentiA: a self-supervised sentiment analyzer for classification from unlabeled data. Mach Learn Appl
20.
Zurück zum Zitat Jain P, Pamula R, Srivastava G (2021) A systematic literature review on machine learning applications for consumer sentiment analysis using online reviews. Comput Sci Rev Jain P, Pamula R, Srivastava G (2021) A systematic literature review on machine learning applications for consumer sentiment analysis using online reviews. Comput Sci Rev
21.
Zurück zum Zitat Rani S, Kumar P (2018) A journey of Indian languages over sentiment analysis: a systematic review. Artif Intell Rev Rani S, Kumar P (2018) A journey of Indian languages over sentiment analysis: a systematic review. Artif Intell Rev
22.
Zurück zum Zitat Ahmad GI, Singla J, Anis A, Reshi AA, Salameh AA (2022) Machine learning techniques for sentiment analysis of code-mixed and switched Indian social media text corpus: a comprehensive review Ahmad GI, Singla J, Anis A, Reshi AA, Salameh AA (2022) Machine learning techniques for sentiment analysis of code-mixed and switched Indian social media text corpus: a comprehensive review
23.
Zurück zum Zitat Gupta V, Jain N, Shubham S, Madan A, Chaudhary A, Xin Q (2021) Toward integrated CNN-based sentiment analysis of tweets for scarce-resource language—Hindi. Trans Asian Low Resour Lang Inf Process 20:1–23CrossRef Gupta V, Jain N, Shubham S, Madan A, Chaudhary A, Xin Q (2021) Toward integrated CNN-based sentiment analysis of tweets for scarce-resource language—Hindi. Trans Asian Low Resour Lang Inf Process 20:1–23CrossRef
24.
Zurück zum Zitat Gysels M (1992) French in urban Lubumbashi Swahili: codeswitching, borrowing, or both? J Multiling Multicult Dev 13:41–55CrossRef Gysels M (1992) French in urban Lubumbashi Swahili: codeswitching, borrowing, or both? J Multiling Multicult Dev 13:41–55CrossRef
26.
Zurück zum Zitat Kaur G, Kaushik A, Sharma S (2019) Cooking is creating emotion: a study on Hinglish sentiments of YouTube cookery channels using semi-supervised approach. Big Data Cogn Comput 3:37CrossRef Kaur G, Kaushik A, Sharma S (2019) Cooking is creating emotion: a study on Hinglish sentiments of YouTube cookery channels using semi-supervised approach. Big Data Cogn Comput 3:37CrossRef
Metadaten
Titel
A Survey on Code-Mixed Sentiment Analysis Based on Hinglish Dataset
verfasst von
Rekha Baghel
Copyright-Jahr
2023
Verlag
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-99-1479-1_18