The Application of Machine Learning Algorithms for Text Mining based on Sentiment  Analysis Approach

رضا سمیع زاده؛ الناز محمودی سعید آباد

نرم افزار کتابخانه و مرکز اسناد دانشگاه تهران

منو

رکورد قبلی

رکورد بعدی

" The Application of Machine Learning Algorithms for Text Mining based on Sentiment Analysis Approach "
رضا سمیع زاده؛ الناز محمودی سعید آباد 2021/09/25

Record Number	:	1502297
Language of Document	:	English
Main Entry	:	رضا سمیع زاده
Title & Author	:	The Application of Machine Learning Algorithms for Text Mining based on Sentiment Analysis Approach [electronic resources: essay]/ رضا سمیع زاده؛ الناز محمودی سعید آباد
Piece Level	:	Journal of Information Technology Management
Notes Pertaining to Publication, Distribution, Etc.	:	Research Paper
	:	September 2018
Access Link	:	https://jitm.ut.ac.ir/article_65661.html
	:	https://jitm.ut.ac.ir/article_65661_c8b4e78bb9acd81d32aaae090dd3eee4.pdf
Summary or Abstract	:	Classification of the cyber texts and comments into two categories of positive and negative sentiment among social media users is of high importance in the research are related to text mining. In this research, we applied supervised classification methods to classify Persian texts based on sentiment in cyber space. The result of this research is in a form of a system that can decide whether a comment which is published in cyber space such as social networks is considered positive or negative. The comments that are published in Persian movie and movie review websites from 1392 to 1395 are considered as the data set for this research. A part of these data are considered as training and others are considered as testing data. Prior to implementing the algorithms, pre-processing activities such as tokenizing, removing stop words, and n-germs process were applied on the texts. Naïve Bayes, Neural Networks and support vector machine were used for text classification in this study. Out of sample tests showed that there is no evidence indicating that the accuracy of SVM approach is statistically higher than Naïve Bayes or that the accuracy of Naïve Bayes is not statistically higher than NN approach. However, the researchers can conclude that the accuracy of the classification using SVM approach is statistically higher than the accuracy of NN approach in 5% confidence level.
	:	تخصیص نظرها و متنهای منتشر شدۀ کاربران در فضای مجازی به طبقاتی با احساسات مثبت یا منفی، در تحقیق‌های مربوط به متن‌کاوی اهمیت بسیار زیادی دارد. هدف این مقاله، استفاده و مقایسۀ روش‌های یادگیری ماشین در طبقه‌بندی متنهای فارسی بر اساس احساسات کاربران فعال در فضای مجازی است. داده‌های پژوهش، مجموعه نظرهای منتشرشده دربارۀ فیلم‌های ایرانی و خارجی در بازۀ زمانی 1392 تا 1395 در سایت‌های سینمایی و نقد فیلم فارسی زبان است. پیش از بهکارگیری الگوریتم‌ها، فرایند پیش‌پردازش داده‌ها بر اساس تبدیل آنها به نویسه، حذف ایست‌واژه‌ها و تحلیل چند‌واژه‌ای انجام گرفت. برای طبقه‌بندی داده‌ها، الگوریتم‌های با نظارت نایوبیز، ماشین‌ بردار ‌پشتیبان و شبکۀ عصبی استفاده شد. براساس نتایج بهدست‌آمده، در آزمون خارج از نمونه ‌با وجود دقت عددی بیشتر الگوریتم نایوبیز بر شبکۀ عصبی و ماشین‌ بردار پشتیبان بر نایوبیز، برتری آماری نایوبیز بر شبکه‌های عصبی و ماشین ‌بردار ‌پشتیبان بر نایوبیز اثبات نشد. با وجود این، نتایج تحقیق گویای برتری معنادار الگوریتم ماشین بردار پشتیبان بر شبکه‌های عصبی در دقت طبقه‌بندی در سطح اطمینان 5 درصد است.
Topical Name Used as Subject	:	neural network
	:	Sentiment analysis
	:	Support vector machine
	:	Text mining
	:	Naïve bayes
	:	شبکه‌های عصبی
	:	ماشین ‌بردار ‌پشتیبان
	:	متن‌کاوی
	:	نایو بیز
	:	آنالیز احساس
Personal Name - Alternative Intelectual Responsibility	:	الناز محمودی سعید آباد
Originating Source	:	University of Tehran. Central Library and Documentation Center

کپی لینک

پیشنهاد خرید

پیوستها

Search result is zero

نظرسنجی

نظرسنجی منابع

1 - کیفیت نمایش فایلهای دیجیتال چگونه است؟


X	1. عالی 2. خوب 3. متوسط 4. ضعیف
X یادداشت

2 - کیفیت دانلود فایلهای دیجیتال چگونه است؟


X	1. عالی 2. خوب 3. متوسط 4. ضعیف
X یادداشت


X


	خطا

خطا

1 - کیفیت نمایش فایلهای دیجیتال چگونه است؟

2 - کیفیت دانلود فایلهای دیجیتال چگونه است؟