Detection of Stock Market Manipulation Using Deep Learning

doi:10.21203/rs.3.rs-3669050/v1

Download PDF

Research Article

Detection of Stock Market Manipulation Using Deep Learning

https://doi.org/10.21203/rs.3.rs-3669050/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

The stock market is a vast trading environment that handles millions of transactions, making it challenging for regulatory bodies to identify fraudulent activities manually. However, unsupervised deep learning techniques provide a promising solution for detecting market manipulation. Inflating or deflating stock prices to gain an advantage is a form of market manipulation. We propose to examine how stock market manipulation can be detected using market structure analysis in the paper. The gathering of data involved utilizing information obtained from the websites of National Stock Exchange (NSE) and the Bombay Stock Exchange (BSE). The efficacy of an approach utilizing generative models is particularly noteworthy in this domain, showing significant potential for quickly identifying specific local irregularities in the data for anomaly detection and potential recognition of market manipulation. This efficiency addresses a common challenge faced by deep learning approaches. By exploring this matter, we aim to contribute insightful perspectives that could aid both investors and regulatory bodies in effectively understanding and managing the risks associated with stock market manipulation.

Stock Market

Market Manipulation

Deep Learning

Market manipulation poses a significant challenge to the integrity and fairness of financial markets. It involves deliberately distorting market prices, volumes, or other trading indicators to create artificial market conditions for personal gain. Detecting and preventing market manipulation is crucial for maintaining a transparent and trustworthy trading environment.

Traditional approaches for market manipulation detection rely on rule-based systems and statistical methods, which often struggle to capture the complex patterns and subtle manipulative behaviours prevalent in modern financial markets. However, recent advancements in deep learning have shown promise in addressing these challenges by leveraging the power of artificial neural networks to learn intricate patterns and relationships within data.

This research focuses on the utilization of deep learning methods to identify market manipulation. Deep learning models, such as Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), and Generative Adversarial Networks (GANs), have demonstrated significant effectiveness in various domains, including computer vision, natural language processing, and speech recognition. Their proficiency in autonomously extracting and understanding intricate features from raw data positions them as promising tools for improving the detection of market manipulation.

The objective of this research is to evaluate deep learning-based models for effectively identifying and mitigating market manipulation. We aim to leverage the temporal and spatial dependencies present in stock market data to capture the nuanced patterns associated with manipulative activities. By utilising large-scale datasets containing real-world trading data, we can train and evaluate deep learning models on diverse manipulation scenarios, such as pump and dump, spoofing, and layering.

The contributions of this research lie in several aspects. Firstly, we comprehensively investigate the effectiveness of unsupervised deep learning methods for market manipulation detection. We additionally delve into generative deep-learning structures and methodologies to identify the optimal models for detecting manipulative behaviours. Secondly, we create comprehensive datasets encompassing various manipulation scenarios, facilitating robust training and assessment of the proposed models. Lastly, we assess the effectiveness of the deep learning models through thorough metrics, incorporating considerations such as recall, f1 score, and the ability to generalize across diverse market conditions.

The paper is organized as follows: Section 2 provides a review of the current literature, covering the detection of market manipulation and the utilization of deep learning in financial contexts. Section 3 outlines the methodology, presenting the proposed deep learning models for detecting market manipulation, along with details of the experimental setup, including datasets and evaluation metrics. Section 4 discusses the results and performance analysis of the implemented deep learning models. Finally, Section 5 concludes the paper by summarizing contributions, acknowledging limitations, and suggesting potential avenues for future research in the field of market manipulation detection using deep learning methodologies.

Shashank Sridhar et al. [1] suggests the application of ensemble neural networks for market manipulation detection, capable of discerning three manipulation scenarios: price and volume manipulation, and flip trading. The authors compiled a dataset by extracting information from the Securities and Exchange Board of India (SEBI) and the Bombay Stock Exchange (BSE) websites, focusing on 16 companies with documented instances of stock manipulation. This dataset incorporated daily data from the pre-investigation, investigation, and post-investigation periods, associating each selected stock with a specific manipulation type. On the daily trading dataset, ensemble neural networks were applied with and without trainable sub-model layers. Notably, the stacked model lacking trainable layers achieved the highest accuracy, surpassing other supervised learning models in the overall evaluation.

T. Leangarun et al. [2] engineered a neural network designed for predicting market manipulation, with a specific focus on two distinct scenarios: Spoof Trading and data manipulation via pump and dump strategies. The act of engaging in spoof trading entails misleading other buyers into purchasing a particular stock at a pre-established price, while Pump and dump strategies involve artificially boosting the price of a stock through purchasing. The researchers focused on three notable companies, Microsoft, Amazon, and Intel, and collected tick data from NASDAQ. The carefully chosen level 2 data from tick trading used for the construction of the dataset, and the dataset was chosen for its inclusion of valuable information on order cancellations, crucial for identifying pump and dump activities. Notably, the developed neural network demonstrated an impressive 88.2% accuracy in detecting pump and dump manipulation. However, its effectiveness in modelling spoof trading was less successful.

Golmohammadi et al. (2014) [3] conducted a comprehensive comparative study focused on predicting market manipulation through the application of diverse supervised learning algorithms. Their investigation employed an accessible dataset documenting instances of market manipulation occurring between January and December 2003, with oversight by the Securities and Exchange Commission (SEC). This dataset comprised 175,738 data observations from 64 issuers, encompassing 69 data attributes representing analytical parameters. The identified market manipulation tactics included marking the close, washing trades, and cornering the market. The researchers applied a range of supervised learning algorithms, such as CNN, Random Forest, C5.0, CTree, Neural Networks, CART, and Naive Bayes, for detection purposes. Noteworthy from their experiments, Naive Bayes demonstrated superior performance, excelling in sensitivity and specificity, particularly in accurately identifying instances of market manipulation.

Liu et al. [4] for learning the manipulation of stock market in the Chinese market, he used learning algorithms especially supervised to identify the occurrences. Their dataset incorporates information from the China Securities Regulation Commission (CSRC) and security market data, specifically focusing on 64 stocks flagged by the CSRC as manipulated between 2013 and 2016. Using daily trading and tick trading data for these stocks, the study employs various classification techniques, including methods such as Decision Trees, K Nearest Neighbour, Quadratic Discriminant Analysis, Linear Discriminant Analysis, Artificial Neural Networks, Logistic Regression, and Support Vector Machines are employed in this study. The experimental findings indicate the effectiveness of these methods in detecting market manipulation from daily trading data, with K Nearest Neighbour and Decision Trees exhibiting superior performance among the applied algorithms.

T. Leangarun et al. [5] utilises Generative Adversarial Networks (GANs) to detect abnormal trading behaviours caused by stock price manipulations. Unlike other systems, it uses an unsupervised GAN with LSTM to learn abnormal market behaviour. The dataset consists of major companies from the Stock Exchange of Thailand (SET) over 22 trading days. The proposed hybrid model, combining GAN and LSTM, effectively detects anomalies, particularly in pump and dump manipulation cases. The system achieves a 68.1% accuracy in detecting market manipulation.

Q. Wang et al. [6] for the detection of manipulation in stock market, he introduces an innovative framework called RNN-based ensemble learning. The China Securities Regulation Commission (CSRC) reported occurrences of manipulation in the dataset used in the research from 2012 to 2016, specifically addressing the challenges associated with identifying trade-based stock price manipulation in China. Overall, the dataset consists of 40 CSRC reports, which cover 33 manipulators or groups, 64 stocks, and 257 cases where manipulators are involved. In contrast to existing systems, the proposed framework integrates trade-based features and specific stock characteristics to create a more efficient model. Comparative analysis with various supervised algorithms demonstrates the superior performance of the proposed model in detecting market manipulation, confirming the effectiveness of the framework as a dependable mechanism for manipulation detection and highlighting its potential to enhance overall detection capabilities.

Tallboys et al. [7] presented five datasets containing instances of suspected market manipulation. They demonstrated the effectiveness of deep learning techniques, specifically TadGAN and LSTM with Dynamic Thresholding, in contrast to the traditional ARIMA approach. Acquired from Python's finance library, these datasets span 24 months preceding identified manipulation events, offering a comprehensive temporal context. Evaluation metrics encompassed efficiency, speed, anomaly type coverage, and accuracy. LSTM with Dynamic Thresholding emerged as the most promising, demonstrating a notable ability to swiftly identify local anomalies within the market data. This method's superiority suggests its potential in effectively detecting instances of market manipulation.

A. Dataset Description & Real Anomaly

Stock market data of companies was used to train models. Each company’s history of manipulation was searched via search engines and articles from reputed media houses. Stock regulators like SEBI (Securities and Exchange Board of India) publicly share details of these cases and were checked for the period under which these stocks were investigated. Those periods were marked as anomalies. It is to note that data was limited to Indian markets only under the scope of research but the methodologies could easily be extended to any type of stock data.

Our primary goal is to detect contextual anomalies. These observations stand out only in comparison to nearby data points, rather than being considered anomalies when compared to all other observations. It is essential to provide compelling reasons for categorizing them as anomalous.

The data was downloaded from the official site of BSE (Bombay Stock Exchange) by searching for their Security ID/Name.

Sadhna Broadcast Ltd. The Securities and Exchange Board of India (SEBI) has found that the stock prices of Sadhna Broadcast Ltd were manipulated through misleading videos on some YouTube channels [11]. The videos falsely claimed that the company was going to be acquired by the Adani Group and had signed big contracts with Sony Pictures and Zee. This caused retail investors to buy the stock, driving up the price.

Sharpline Broadcast Ltd. Sharpline’s stock was involved in the same scam along with Sadhna Broadcast Ltd [11].

The periods in which these stocks were under review by SEBI are provided in Table 1.

Table 1

The period under investigation by SEBI
Name	Start	End
Sadhna Broadcast Ltd.	April 2022	September 2022
Sharpline Broadcast Ltd.	April 2022	August 2022

a. Statistical Approach – Benford’s Law

Leveraging Benford's Law enables the identification of potential fraud within datasets conforming to this fundamental statistical principle. The first-digit law inherent in Benford's Law posits that, in numerous naturally occurring datasets, the initial digit of a number tends to be smaller rather than larger. It anticipates that the digit '1' should occur approximately 30% of the time, whereas the digit '9' should appear in less than 5% of instances. The distribution of these occurrences is visually depicted in Fig. 1.

An altered or fabricated dataset may have significant differences from its expected first-digit distribution. An unusually high proportion of numbers starting with 9 on a company's financial statements could signal fraudulent activity. In our study, we examined the total number of shares (Volume) column, which represents the volume of shares traded that day. Our analysis of the dataset using Benford's Law confirms that there has been manipulation.

We know that Benford's Law is a simple statistical law to detect data manipulation, but it simply gives a YES or NO answer, that is, whether manipulation is detected or not. We need precise periods for the manipulation, so we investigated LSTM with autoencoders and TadGAN further in our research. We concluded that these algorithms performed better in manipulation detection and its precise period.

b. LSTM Autoencoder

Nitish et al. [18] made an early observation regarding the potential enhancement of LSTMs through learning embeddings from an encoder-decoder model. They introduced the use of Neural networks with multiple layers of Long Short-Term Memory (LSTM) cells to acquire representations of sequential data. The encoder-decoder LSTM reads, encodes, decodes, and reproduces input sequences within a given dataset. Model performance is evaluated based on its ability to accurately reproduce input sequences. The decoder portion of the model can be eliminated, leaving only the encoder, once the required performance level is reached. This design allows input sequences to be encoded into a fixed-length vector, enabling effective processing of sequential data, capturing temporal patterns, and generating desired outputs. Figure 3 illustrates the model architecture generated using the TensorFlow v2 API, and the training process is outlined in Fig. 2.

c. LSTM with Dynamic Thresholding

The approach functions as an algorithmic technique designed to detect anomalies in temporal data sequences. Leveraging Long Short-Term Memory (LSTM) networks, the model captures associations between preceding and current data points by encoding these connections through numerically optimized weights. Following the generation of predictive outputs, an unsupervised, dynamic, and nonparametric method is employed to evaluate the residual values. This circumvents challenges such as heterogeneity, non-stationarity, and stochastic noise that often confound automated threshold determinations in data streams with fluctuating behavioural patterns and value distributions. By responsively adjusting to the variance in prediction errors, the dynamic threshold remains relatively low when errors exhibit minor deviations and escalates when the deviation is more substantial. Empirical validations of the LSTM with Dynamic Threshold model have been substantiated across multiple disciplinary contexts, ranging from the identification of anomalies in aerospace systems [14] to predictive analytics in healthcare [15], and transportation planning [16].

d. TadGAN

Mentioned in Liu et al. TadGAN [12] offers a performance-efficient and generalisable approach for anomaly detection. With an adversarial unsupervised learning approach, they can capture temporal correlations of the time series distribution. The original cycle loss method described in the paper allows efficient reconstruction of the time series. To reconstruct signals only Generators and Encoders are used which can be represented as

$$G\left(E\right(s\left)\right)\approx \hat s$$

The Generators and Encoders instinctively should not be able to reconstruct the anomaly. Henceforth, anomalous stock data should deviate from the reconstructed ŝ. The critic ${C}_{x}$ is responsible for identifying what windows are anomalous in $\hat s$. The architecture of TadGAN is represented in Fig. 3 & Fig. 4. The model was trained using the pipeline shown in Fig. 5. Specifically, the TadGAN model was configured with a length of 100 input sequences, a latent space of 20 dimensions, a batch size of 64, a single-layer bidirectional LSTM with 100 hidden units for the Encoder, a two-layer bidirectional LSTM with 64 hidden units for the Generator, a one-dimensional convolutional layer for the Critics, and 25 training epochs were used. In the following step, the stock was segmented into sub-segments using the default window size of 100.

e. Auto-Encoder with Regression (AER)

Wong et al. [17] have introduced the Auto-Encoder with Regression model, an unsupervised anomaly detection construct that integrates the principles of Generative Adversarial Networks (GANs) with the dynamic capabilities of LSTM Recurrent Neural Networks. It employs a cycle consistency loss during its training phase, enhancing its ability to accurately reconstruct time-series data. The AER model is further advanced by its innovative approaches to calculating reconstruction errors and by its strategic fusion of these errors with critic outputs to derive a comprehensive anomaly score. This model's architecture is adept at processing sequential data, identifying temporal patterns, and executing precise anomaly detection, with its effectiveness illustrated in a model architecture diagram generated using the tensorflow-v2 API, as referenced in Wong et al.'s research.

The architecture of AER decoder and encoder is represented in Fig. 10 & Fig. 8 respectively. The model was trained using the pipeline shown in Fig. 9.

The comparative examination of different deep learning models for Anomaly Detection in Financial Markets provides intriguing observations regarding their efficacy. The LSTM with Autoencoder model exhibited a moderate performance level, exemplified by Sadhna Broadcast Ltd., which achieved an F1 score of 0.367893 and a recall of 0.225410, indicating a balanced yet constrained precision and recall capacity. In contrast, Sharpline Broadcast Ltd. displayed a notably higher F1 score of 0.669565 and a recall of 0.503268, suggesting superior performance in accurately identifying true anomalies.

When examining the LSTM with Dynamic Thresholding, there was a marginal decrease in F1 score for Sadhna Broadcast Ltd. to 0.362695 but an increase in recall to 0.348175, pointing towards a trade-off between precision and sensitivity to true positives. Sharpline Broadcast Ltd., however, saw a significant improvement with an F1 score of 0.759879 and a recall of 0.612745, marking this model as particularly effective for this entity.

The TadGAN model, on the other hand, lagged in performance, with both Sadhna and Sharpline Broadcast Ltd. recording lower F1 scores of 0.218773 and 0.278078, respectively. The recall rates also dropped to 0.122821 and 0.161493, indicating a less robust detection capability in this context.

Lastly, the Auto-Encoders with Regression approach showed an improvement for Sadhna Broadcast Ltd. with an F1 score of 0.410390 and a recall of 0.258170. Sharpline Broadcast Ltd. also had a high F1 score of 0.660832, but with a slightly lower recall of 0.493464 compared to the LSTM with Dynamic Thresholding model.

In summary, the LSTM with Dynamic Thresholding model stands out for Sharpline Broadcast Ltd. with the highest F1 score and recall, suggesting it as the most promising approach among the tested models for this particular entity. For Sadhna Broadcast Ltd., while the Auto-Encoders with Regression provided the best F1 score, the recall improvement with LSTM with Dynamic Thresholding might be more desirable depending on the cost of false negatives in the application context. The varied performance across different entities underscores the need for tailored model selection in financial anomaly detection tasks.

The results of the study demonstrate that market manipulation can indeed be detected using these techniques, although the accuracy varies and can be further improved immensely. Among the deep learning techniques, the LSTM with Dynamic Thresholding shows promise in this domain. It adeptly recognizes contextual and specific anomalies within the dataset, demonstrating a notable capability for swift anomaly detection. Notably, it can score two years' worth of trading data for each stock within seconds, a significant accomplishment given the typical challenges deep learning approaches encounter in efficiently processing substantial data volumes.

This paper suggests using hybrid approaches that combine deep learning with statistical techniques to identify market manipulation in the future. The authors refer to a research study conducted by Buda et al. [3] that outlines hybrid approaches. Furthermore, the paper recognizes the notable effectiveness of the LSTM with Dynamic Thresholding model in identifying point anomalies and proposes its integration into future research endeavours.

Within the paper's framework, the strategies employed for the detection of stock market manipulation encompass Benford's Law, LSTM (Long Short-Term Memory), TAD-GAN (Time Series Anomaly Detection using Generative Adversarial Networks), and methods for detecting anomalies in time series data. This rephrasing aims to maintain originality and prevent any issues related to plagiarism.

ENVIRONMENT

The primary hardware was powered by the Ampere Altra processor, hosted on Oracle Cloud through Ampere A1 compute services. Python v3.8.15 with CPU-optimised TensorFlow v2.3.4 and Orion-ml [13] v0.4.1 was used with processor clock speed @3Ghz and 24Gib RAM.

FUNDING

No funding was received for this work.

Author Contribution

A.G., K.K., and D.P. from VIT Vellore University and J.S.G. and A.K.S. from SRM Chennai University jointly conceptualized the research study and developed the methodology. A.G. and K.K. primarily focused on data acquisition and preprocessing, while J.S.G. and A.K.S. contributed significantly to the development and implementation of the deep learning models. D.P. played a crucial role in analyzing the results and interpreting the data. All authors, A.G., J.S.G., A.K.S., K.K., and D.P., collaboratively worked on writing the main manuscript text. The figures and visualizations were prepared by A.G. and D.P., with critical inputs from all team members. Every author actively participated in the manuscript review process, ensuring the accuracy and integrity of the work.

Sridhar, S., Mootha, S., & Subramanian, S. (2020). Detection of Market Manipulation using Ensemble Neural Networks. 2020 International Conference on Intelligent Systems and Computer Vision (ISCV). https://doi.org/10.1109/iscv49265.2020.9204330
Leangarun, T., Tangamchit, P., & Thajchayapong, S. (2016). Stock price manipulation detection using a computational neural network model. Eighth International Conference on Advanced Computational Intelligence (ICACI). https://doi.org/10.1109/icaci.2016.7449848
Golmohammadi, K., Zäıane, O. R., & Díaz, D. (2014). Detecting stock market manipulation using supervised learning algorithms. 2014 International Conference on Data Science and Advanced Analytics (DSAA). https://doi.org/10.1109/dsaa.2014.7058109
Liu, Q., Wang, C., Ping, Z., & Zheng, K. (2021). Detecting stock market manipulation via machine learning: Evidence from China Securities Regulatory Commission punishment cases. International Review of Financial Analysis, 78, 101887. https://doi.org/10.1016/j.irfa.2021.101887
Leangarun, T., Tangamchit, P., & Thajchayapong, S. (2018). Stock Price Manipulation Detection using Generative Adversarial Networks. 2018 IEEE Symposium Series on Computational Intelligence (SSCI). https://doi.org/10.1109/ssci.2018.8628777
Wang, Q., Xu, W., Huang, X., & Yang, K. (2019). Enhancing intraday stock price manipulation detection by leveraging recurrent neural networks with ensemble learning. Neurocomputing, 347, 46–58. https://doi.org/10.1016/j.neucom.2019.03.006
Tallboys, J., Zhu, Y., & Rajasegarar, S. (2022). Identification of Stock Market Manipulation with Deep Learning. 17th International Conference on Advanced Data Mining and Applications (ADMA), 2021. https://doi.org/10.36227/techrxiv.19111730.v1
Munir, M., Chattha, M. A., Dengel, A., & Ahmed, S. (2019). A Comparative Analysis of Traditional and Deep Learning-Based Anomaly Detection Methods for Streaming Data. 2019 18th IEEE International Conference on Machine Learning and Applications (ICMLA). https://doi.org/10.1109/icmla.2019.00105
Munir, M., Siddiqui, S. A., Dengel, A., & Ahmed, S. (2019). DeEPANT: A deep learning approach for unsupervised anomaly detection in time series. IEEE Access, 7, 1991–2005. https://doi.org/10.1109/access.2018.2886457
Pang, G., Shen, C., Cao, L., & Van Den Hengel, A. (2021). Deep learning for anomaly detection. ACM Computing Surveys, 54(2), 1–38. https://doi.org/10.1145/3439950
Golmohammadi, K., & Zäıane, O. R. (2015). Time series contextual anomaly detection for detecting market manipulation in stock market. 2015 IEEE International Conference on Data Science and Advanced Analytic. https://doi.org/10.1109/dsaa.2015.7344856
Sebi cracks down on stock manipulation via YouTube, bans Arshad Warsi, others from securities market. (2023, March 4). The Economic Times. https://economictimes.indiatimes.com/markets/stocks/news/sebi-bans-sadhna-broadcasts-promoters-actor-arshad-warsi-others-from-securities-mkt/articleshow/98354919.cms
Geiger, A., Liu, D., Alnegheimish, S., Cuesta-Infante, A., & Veeramachaneni, K. (2020). TadGAN: Time Series Anomaly Detection Using Generative Adversarial Networks. 2020 IEEE International Conference on Big Data (IEEE BigData). https://doi.org/10.1109/bigdata50022.2020.9378139
Alnegheimish, S., Liu, D., Sala, C., Berti‐Équille, L., & Veeramachaneni, K. (2022) Sintel: A Machine Learning Framework to Extract Insights from Signals. Proceedings of the 2022 International Conference on Management of Data. https://doi.org/10.1145/3514221.3517910
Hundman, K., Constantinou, V., Laporte, C., Colwell, I., & Söderström, T. (2018). Detecting Spacecraft Anomalies Using LSTMs and Nonparametric Dynamic Thresholding. 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. https://doi.org/10.1145/3219819.3219845
Xia, J., Su, P., Zhu, M., Cai, G., Yan, M., Su, Q., Yan, J., & Ning, G. (2019). A Long Short-Term Memory Ensemble Approach for improving the outcome prediction in intensive care unit. Computational and Mathematical Methods in Medicine, 2019, 1–10. https://doi.org/10.1155/2019/8152713
Phiboonbanakit, T., Huynh, V., Horanont, T., & Supnithi, T. (2019). Detecting abnormal behaviour in the transportation planning using long short term memories and a contextualized dynamic threshold. ACM International Joint Conference on Pervasive and Ubiquitous Computing. https://doi.org/10.1145/3341162.3349324
Phiboonbanakit, T., Huynh, V., Horanont, T., & Supnithi, T. (2019). Detecting abnormal behaviour in the transportation planning using long short term memories and a contextualized dynamic threshold. ACM International Joint Conference on Pervasive and Ubiquitous Computing. https://doi.org/10.1145/3341162.3349324
Srivastava, N., Mansimov, E., & Salakhudinov, R. (2015). Unsupervised Learning of Video Representations using LSTMs. International Conference on Machine Learning, 843–852. http://jmlr.org/proceedings/papers/v37/srivastava15.pdf

Results & Discussion figures and tables and available in the Supplementary Files section.

No competing interests reported.

ResultsandDiscussionfiguresandtables.docx

Download PDF

Version 1

posted

You are reading this latest preprint version

Detection of Stock Market Manipulation Using Deep Learning

Status:

Version 1

Abstract

Figures

INTRODUCTION

RELATED WORKS

METHODOLOGY

RESULTS & DISCUSSION

CONCLUSION & FUTURE WORK

Declarations

Author Contribution

References

Results & Discussion Figures and Tables

Additional Declarations

Supplementary Files

Status:

Version 1