In this article, the problem of detecting depression disorder through text mining of users' posts on social networks has been investigated in order to support the mental health of society. The review of the literature shows that so far there have been extensive studies in the field of detecting mental disorders in social networks (Bauer et al., 2018; Hassantabar et al., 2022; Wongkoblap et al., 2022). Also, a glance view at the previous methods indicates there is a variety of effective ideas. Therefore, it seems necessary to get familiar with the problem-solving space and the previous solutions in order to present a more efficient and effective method. Based on the mentioned necessity, this section follows two significant goals. The first, classifying the previous studies from perspective of the scope covered by the method. Second, reviewing instances of previous experimental studies to get familiar with the variety of previous ideas in the field under study.
According to the first goal, we have presented a classification to categorize the previous studies regarding mental disorders detection based on the scope covered by the methods. The proposed classification is one of research achievements in our research work. Our proposed classification is divided mental disorders detection methods into four general classes as shown in Fig. 2.
As seen in Fig. 2, the methods are divided into 4 different classes based on the scope covered by the methods. Methods that are placed in the prevention class can help in early diagnosis of mental disorders. In fact, this class of methods can be useful to prevent the progression of the mental disorder, increase its severity, or even prevent suicide. For example, research has helped experts to prevent people from committing suicide by finding early signs of suicidal tendencies (Homan et al., 2014). In another research, this class of methods is used to find signs of the disorder, months before the diagnosis of the experts, enabling timely action to prevent the progression of the disorder (Reece et al., 2017).
In prediction class, researches predict mental disorders in social networks according to the signs in the data (Huang, 2022; Large, 2022). These signs can be hidden in the texts or photos posted by the user or his interactions with others. Among the signs of mental disorders in the text, we can refer directly to the name of the disorder, referring to the symptoms of the disease such as lack of sleep or anorexia, or the linguistic characteristics of the user such as the frequent use of negative sentences. Regarding the features related to the image, we can mention the tendency to use certain color combinations. Also, the user's interactions with other users can also help to investigate the mental disorders of the users.
In the monitoring class, studies monitor the status of a pre-confirmed disorder. This class of methods can help doctors to check and control the patient's condition as well as treat the patient (Bauer et al., 2018; Fuller-Tyszkiewicz et al., 2018).
Some studies deal with intervention in the process of mental disorders. This intervention is carried out in two forms: warning and treatment. Warning-oriented intervention can be investigated in two ways. Based on this, the warnings that the proposed systems produce for intervention are warnings that either inform the doctor of the patient's condition (Guntuku et al., 2017) or inform the patient condition to those around him (Jia, 2018).
In studies based on treatment-oriented intervention, the intervention takes place in the form of treatment of mental disorders. For example, in (Park et al., 2015), after the diagnosis of the disorder, a free consultation link is sent to the patients for treatment. In another research, users are sent a link to information and services related to mental health (Wongkoblap et al., 2017).
According to second goal, we have reviewed some experimental studies regarding mental disorders detection using machine learning methods.
(Wongkoblap et al., 2022) have analyzed the big data available in social networks in the field of mental health. The authors have stated that although social networks can be a powerful source in mental health-related research, the topic of exploring textual big data in these networks will be a challenging issue.
In (Su et al., 2020), Su et al. presented a review article that focused on the use of deep learning in identifying mental disorders. Authors said that their review study follows three main directions “ investigating techniques”, “ identifying challenges”, and “ presenting several suggestions for better using deep learning techniques in mental health problems”. They have investigated the overall function of most deep learning techniques in the field of mental health and then identified challenges in the problem under study.
In (Yates et al., 2017), authors have proposed methods to identify text posts harmful to the mental health of the society. They believe that identifying depressed people and thinking of supportive measures to prevent them from harming themselves and others can play an important role in ensuring the mental health of society. The authors state that they have presented a neural network architecture for classifying the texts of users' posts in social networks, which leads to the identification of depressed people at risk of self-harm. In the architecture proposed by the authors, each input is processed by a convolution network and after merging these processed inputs, the presentation vector is made of user activities.
(Cong et al., 2018) presented a deep learning based techniques called X-A-BiLSTM for depression detection on imbalanced social networks data. The authors stated that their proposed method includes two main parts called XGBoost and Attention-BiLSTM neural network. They have said that the first part of the method reduces the amount of data imbalance and the second part of the method increases the power of data classification. Like other related studies, the authors have evaluated the efficiency of their proposed method by applying the method on the RSDD (Reddit Self-Reported Depression Diagnosis) dataset. By reporting the performance evaluation results, they have claimed that their proposed method has provided better function compared to other methods under test.
In (Bouarara, 2022), the authors considered the main goal of their research to identify the behavior of people suffering from mental problems among Twitter users in order to support them. For this purpose, they have used text mining strategy relying on machine learning techniques such as naïve Bayes and k-nearest neighbours. By observing the performance evaluation results of their proposed method, they have concluded that machine learning techniques can play an important role in the text processing process and provide acceptable results in mental disorders detection.
(Babu and Kanaga, 2022) provided a review on sentiment analysis using artificial intelligence on social networks. The authors stated that the most of previous studies followed 6 steps gathering requirements, collecting data, cleaning data, analyzing data, representing data, and visualizing data to investigate raw data. They have said that most of the techniques that have been used for text processing and data classification in the field under study are multi-class machine learning techniques instead of binary-class.
(Aguilera et al., 2021) detected depression and anorexia problem using a one-class classification technique in social networks. The authors stated that their proposed method evaluated the relationship of documents based on their strengths and relying on the gravitational attraction. They said that their proposed one-class approach considers the similarity between the input texts and the relationship between them to make a decision in the context of recognizing the desired class. Also, the authors proposed a new criterion to identify the relationship between documents for the task under study.
(Gupta et al., 2021) intend to identify people with adverse mental conditions by examining the posts of users in the Reddit dataset. For this purpose, they have tried to reveal the emotional mentality of people in social networks by processing the text of users' posts. To achieve this goal, the authors have used six different classification techniques. After evaluating the effectiveness of different classifiers and sentiment analysis and studying the evaluation results, they have stated that the Naïve Bayes classifier has provided an acceptable performance.
In (Kim et al., 2020), Kim et al. used a deep learning-based model to identify mental patients according to the analysis of user's posts content. They employed XGBoost and CNN models to classify people in this field. The authors used TF-IDF vector in the XGBoost classifier to convert the words into n-dimensional vectors. Finally, they have stated that intend to present an ensemble approach based on multi-class classification models in the future to solve the problem of identifying mental disorders. On the other hand, the authors stated that exist some weaknesses in their research. For example, they have not considered the impact of some factors such as socio-demographic and regional differences in classification task. Therefore, solving this challenge with the aim of increasing the accuracy of data classification can be considered as a future work.
In (Hassantabar et al., 2022), Hassantabar et al. investigated a mental health system to detect mental disorders. For this purpose, authors presented a hybrid method called Mhdeep, which use commercially available WMSs and effective deep neural networks models. Also, the authors stated that they used a synthetic dataset to pre-train their model weights. The authors have claimed that their proposed model has achieved acceptable results compared to other methods of identifying mental disorders.
By studying the empirical literature, it is concluded that most previous sources have received acceptable results from machine learning methods, especially techniques based on neural networks. But the problem of the sequential and temporal nature of data has been ignored in previous related studies. Therefore, providing an efficient solution based on the effectiveness of the sequential and temporal nature of data in the process of identifying people's mental disorders in social networks can be raised as a research innovation.