In this paper, the efficiency of many ML models, such as logarithmic regression, SVMs, Naive Bayesian, random forest, and LightGBM are discussed in light of sentiment analysis of e-commerce customer review samples. Based on these findings, it is clear that SVM is the model of choice when all four performance measures— accuracy, precision, recall, and F1 score—are critically important. Other algorithms, such as logistic regression and LightGBM, are relatively good in accuracy and retain high precision and good recall scores. As visualized from the results, both the Naive Bayes and random forest models were convincingly effective in terms of recall, but performed highly in learning of positive sentiments while discerning more FPs. Comparing the work with other studies show that it is possible to improve the random forest and other ensemble methods depending on the preprocessing of data and selection of features.
This work enriches the theoretical background of sentiment analysis while specifying that the choice of the ML model should be optimized for the existing objectives and datasets characteristics. It furthers existing research in the direction of proving the fact that no model under study outperforms all the other models in measures, hence there is a need for model selection depending on the problem that the model is going to solve.
In practical terms, this work reveals that SVM can be adopted as a robust model by corporate organizations intending to employ sentiment analysis when accuracy of all the parameters is paramount. Nevertheless, for the cases when recall rate is more important, for instance in the customer feedback analysis when the goal is to capture all the positive opinions, the Naive Bayes and random forest methods might be used. Thus, it is crucial to continue to refine the models and calibrate them for every dataset and enhancing the domain of knowledge to change the ML.
Several research findings are presented by this study which include the following: the study focuses on the evaluation of the performance of individual and composite classification models on a real-world dataset that is relevant to the sentiment analysis in e-commerce; reveals the advantages and limitations of each model in terms of data processing of the customer sentiment; and offers a guide to researchers and practitioners on the models to use depending on the task at hand.
Applying the results of the present work, one can help manage large amounts of customer reviews with the help of autonomic ML models. Such models enable businesses to work through a large number of review statements and the output is achieved in less time. A surprisingly simple model, such as SVM or logistic regression, that is recommended for implementation can produce quite stable results, which allows companies with limited computing power to work with them.
The contribution of this research is as follows: first, it involves the innovation of applying ML models in the sentiment analysis of e-commerce. Second, although there have been many studies done on e-commerce sentiment analyses particularly using classifiers and ensembles, this study aims at comparing the ML classifiers and ensembles only for the e-commerce sentiment analysis of women’s dress from the aspect of customer reviews. The study also gives useful tips on how models’ performances can be influenced by characteristics of data and the metric under consideration to enhance the knowledge base of this field of study.
The main constraint relating to this study is the usage of one data setting and therefore the findings may not be generalized. In future research, similar models should be tested on various sets of data from other domains or from other categories of products. A limitation of the present study is the use of only traditional and ensemble models to classify the tweets, without the use of deep learning models, such as CNNs or RNNs, which could contain different findings.
This research should be extended in the future to use architectures, such as CNNs and RNNs, for dealing with the complexities of sentiment analysis in e-commerce since such architecture is capable of handling many features of textual data. As suggested by the results from [35], it could be insightful to analyze deep learning models against classical ML and the torrents in-between to determine which methods are best in serving the e-commerce platforms. However, these studies should also compare the results of using more complex preprocessing features, including more detailed feature engineering and fewer dimensions. The creation of methods, which possess a number of algorithms in their base and apply the data of each algorithm to improve the outcome of the other, might also be researched further especially when working with big datasets with a significant disproportion between the amount of positive and negative instances. Lastly, it would be possible to recommend more accurate and relevant models based on one or several types of products or one or another type of customers, which would help to get deeper understanding of customers’ attitudes towards products and services and provide better recommendation systems to improve the users’ experience on different kinds of e-commerce industries.