TwitterSentimentAnalysis
Sentiment analysis for Twitter's tweet (in Indonesia language) was built with 3 models to get a comparison in determining which model gives the best results for predicting a tweet to have a positive or negative meaning.
Install / Use
/learn @devildances/TwitterSentimentAnalysisREADME
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
</head>
<body>
<h1>TWITTER SENTIMENT ANALYSIS</h1>
<br>
<p>
<h2>Background</h2>
Businesses get customer feedback through multiple channels – both offline and online feedback. In fact, online feedback systems and platforms that, by design aggregate customer sentiment, are becoming dominant these days with social media platforms. Customers share their thoughts through Facebook likes, Twitter tweets, LinkedIn comments, Pinterest pins, and more
</p><hr>
<p>
<h2>Process Workflows</h2>
<img src='Dashboard/static/images/processflow.JPG' alt="processworkflows" style="display: block; margin-left: auto;margin-right: auto; width:40%" />
</p><hr>
<p>
<h2>ETL (Extract, Transform, Load)</h2>
Data source:<ul>
<li>Live stream Twitter API
<li>https://github.com/keyreply/Bahasa-Indo-NLP-Dataset
<li>https://www.kaggle.com/ilhamfp31/indonesian-abusive-and-hate-speech-twitter-text
<li>Indonesia Corpus https://dumps.wikimedia.org/idwiki/latest/ (idwiki-latest-pages-articles.xml.bz2)
</ul>
<img src='Dashboard/static/images/etl.JPG' alt="etl" style="display: block; margin-left: auto;margin-right: auto; width:40%" />
</p><hr>
<p>
<h2>Text Cleaning (DC-DP)</h2>
<img src='Dashboard/static/images/textcleaning.JPG' alt="textclean" style="display: block; margin-left: auto;margin-right: auto; width:40%" />
</p><hr>
<p>
<h2>Modeling</h2>
<img src='Dashboard/static/images/modeling.JPG' alt="modeling" style="display: block; margin-left: auto;margin-right: auto; width:40%" />
</p><hr>
<p>
<h2>Result</h2>
<img src='Dashboard/static/images/result.JPG' alt="result" style="display: block; margin-left: auto;margin-right: auto; width:40%" />
</p><hr>
<p>
<h2>Conclusion</h2>
<ul>
<li>Natural Language Processing (NLP) is a hotbed of research in data science these days and one of the most common applications of NLP is sentiment analysis
<li>From opinion polls to creating entire marketing strategies, this domain has completely reshaped the way businesses work
<li>Thousands of text documents can be processed for sentiment in seconds, compared to the hours it would take a team of people to manually complete the same task
</ul>
</p><hr>
<p>
<h2>Future Works</h2>
<ul>
<li>Indonesia has so many sub-languages and this is a big homework to build a corpus that can provide all of it for mapping every each word that has common similarity with other words
<li>When we talk about N-grams bag of words, sometimes stopwords can be meaningful
<li>Misspelling is the most common issue for Bahasa especially in social media, that's why we need to build a large library that can manage every word so stemming words can be an easy task
<li>Satire, this is so painful work that our model can't distinguish the meaning of sentences
<li>Word2Vec and GloVe as the recommend algorithms to build a robust word embedding model
</ul>
</p><hr>
<p>
<h2>Webapps Display (in mobile device resolution example)</h2>
<img src='Dashboard/static/images/main_page.png' alt="result" style="display: block; margin-left: auto;margin-right: auto; width:40%" />
</p>
</html>
Related Skills
node-connect
342.5kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
85.3kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
342.5kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
342.5kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
