3.2 Main techniques/tools and how they are used
Some of the unique data analytics tools include:
3.2.1. Drones
Also known as unmanned aerial vehicle (UAV), a drone is any aerial vehicle that can be flown remotely by a pilot or that uses software to fly autonomously [31]. Numerous unmanned aircraft have cameras for gathering visual data and propellers to steady flight patterns [11]. Drone technology is progressively being incorporated into sectors such as videography, search and rescue, agriculture, and transportation [32], [33]. In the context of fisheries management, drones that are outfitted with cameras or other sensing equipment are being utilized more and more to study the ocean. Some even have the ability to navigate underwater [34]. Drones are more adaptable and less expensive than oceanographic vessels [35]. They can also offer a more thorough sampling of the surroundings when delivered in groups [36]. In fact, despite covering a smaller area than satellites, drones can produce more precise photos that enable the detection of tiny objects or events [17].
In addition, given that various drones can fly at various elevations and distances, small scale fisheries and near shore marine coastal fisheries can utilize very close-range drones, which can often reach up to three miles [4]. Close-range UAVs have a 30 mile maximum range. Short-range drones, which can fly up to 90 miles, are typically utilized for espionage and intelligence gathering [31]. Mid-range unmanned aerial vehicles (UAVs) have a 400-mile range and can be used for meteorological research [32], marine scientific studies on species such as dolphins and sharks [30], and intelligence gathering [37].
3.2.2. Satellites
According to NASA [37], a satellite is a moon, planet, or object in orbit around a planet or star that serves primarily as a communications tool, transmitting things like phone calls and TV broadcasts around the globe (www.nasa.gov). Satellites have potential to see vast areas of Earth at once due to their eagle's eye viewpoint [31]. Satellites can therefore acquire information more rapidly and effectively than devices on the ground [38]. In the sustainable fisheries management domain, we relate that Satellites equipped with optical and radar sensors can provide an unprecedented level of spatial and temporal resolution [17], which makes them particularly useful for monitoring [4], [39]. The electromagnetic spectrum's whole range of light reflected by the earth's surface is measured by optical sensors [32]. These data can be used to derive crucial oceanic metrics [40] like water temperature and turbidity [8]. Microwave radiation is emitted by radar sensors, which measure the amount that is reflected back to the device [32]. They can offer information about the topography of the water [41], winds, target species detection [42], and vessel movement [1]. Radar devices, unlike optical sensors, can gather data even in bad weather and lighting situations, such as when the sky is overcast or dark [18].
3.2.4. Vessel onboard and underwater surveillance devices
These devices are mostly utilized in the detection, control and monitoring of potential risks in marine areas [30]. Unmanned robots, autonomous underwater vehicles, and unmanned underwater vehicles (UUVs) are a few of the equipment that can locate and identify dangerous and unsafe behaviors in the marine environments [4] such as the seabed [30]. According to the US National Oceanic and Atmospheric Administration [32] traditional ocean monitoring mechanisms were mainly laborious involving the use of observers and manual recording onboard vessels and new devices have bridged these gaps.
For instance, onboard sensors have the ability to automate and simplify this time-consuming procedure [43] while also producing more thorough and trustworthy data that may be included into platforms known as electronic monitoring systems (EMSs) [30]. Additionally, tools like Vehicle Monitoring Systems (VMS) [17] and the Automatic Identification System (AIS) can gather data on a vessel's position [5], speed, and direction [38], supplement radar systems [1], and reduce the likelihood of marine collisions [4].
Based on the evidence of the myriad benefits of data analytics tools and techniques shown above, we contend that data analytics has innumerable benefits related to sustainable fisheries management. One pertinent aspect that needs to be focused on; as data analytics is a new paradigm shift [28], is the spatial spread related to where or which countries and regions use some of these technologies in fisheries practices [3], [4]; which trend we explore in the next section.
3.4 Relevance of Data Analytics
To show the relevance of data analytics in sustainable fisheries management, we breakdown each data analytics technique to show which fishing ecosystem/zone or value chain (Table 2) it can be applied to among others coastal coral reef and offshore zones [47], [48], high seas [49], and in marine [50] and freshwater species zones [51], [52]; so as to mitigate the myriad fisheries management challenges [3] such as marine pollution [53], fish supply chains [54], species stock assessments [55], IUU [56], and sea grass assessments [57] among others.
Table 2
Data analytics techniques, their relevance and where they are used
Data Analytic Method | Where is the method used | Relevance of Data Analytics | Author |
Intelligent Analytics | River depth/River flow | Real time catchment regulation and water management | [58] |
Big data, remote sensing, machine learning | Coastal and marine zones | Transform enormous amounts of data for decision making | [47] |
Down scaling method; statistical methods and hybrid dynamical-statistical methods | Estuarine zones | Evaluation of salinity and estuarine surface temperature | [59] |
Multiple simulation models and observational datasets | Red Sea | Ocean winds, weather forecasting, | [60] |
Bayesian hierarchical methods | High sea | Analysis of trans boundary fisheries | [49] |
DBSIRM model – Big data analytics technology | Yellow sea | Fish index assessment and sustainable growth/development | [55] |
LADS surveys - Bathymetry | Marine benthic species | Swath mapping, quantitative framework for spatial data management | [61] |
GPS tracking technology | Coral reef zones | Segmentation of vessel trajectories into fishing and non-fishing activities | [48] |
PesKAAS application | Timor and Banda Sea | Gives near real time information on catch and effort | [62] |
Predictive analysis, information fusion, and visual analytics | Dovar straight | Automatic identification of structural abnormalities, the prediction of vessel itineraries | [56] |
Predictive analytics and time series analysis | North Atlantic and Labrador Sea | Ocean monitoring, fish food supply chain and spot trends/decision making | [54] |
Principal component analysis (PCA), cluster analysis | Deep Sea, cold coral zones, canyons | Assessment of potential habitat extent in the deep sea | [63] |
mKRISHI® Fisheries analytic system | Indian Ocean/ deep Sea-30km | Identification of risk zones in a fishery system | [15] |
Geovisual analytics, Hybrid spatial temporal filtering (HSF) and automated behavioural change point analysis | Pacific and Atlantic ocean | Assess how well interactive and automated mapping solutions enhance fisheries enforcement efforts | [64] |
Triplot of Redundancy Analyses (RDA) and Pearson correlation | Benthic zones | Assessment of morphological changes in benthic species | [65] |
Multivariate statistical analysis, Principal Component Analysis (PCA) and Hierarchical Cluster Analysis (HCA | Fresh water ecosystems | Assessment of marine pollution and eutrophication in aquatic systems. Identification of distinct groupings of variables and understand the correlations between them | [51] |
AI to pattern analysis and information prediction | Fresh water ecosystems | Aquaculture improvement in small scale fisheries | [52] |
Log transformation, R, SPSS | Deep sea | Determining micro plastic pollution in zoo plankton | [53] |
Sensor information systems and data analysis using scatter plots for visualization | Arabian sea | To find enabling trends and insights in fish ecosystems and value chains | [66] |
Google Earth Engine (GEE), Regression plots | Near shore, Sea grass ecosystems and Island zones, and Arabian Sea | Multi-temporal analysis of Sea grass losses | [57] |
Satellite photos | Coastal zones of India | Assessment of ocean acidification/ environmental, social and governance | [39] |
Non-parametric Kruskal-Wallis test, permutational multivariate analysis of variance (PERMANOVA) | Coastal mangrove ecosystems | Spatial temporal data on and quantification of marine litter/pollution in mangroves | [67] |
Remote sensing and video surveillance devices | Indian coast/ Arabian Sea | To detect spatial and temporal changes in beach ecosystems | [68] |
Multivariate techniques, such as PCO analysis | Estuaries | Demonstrates spatial temporal changes in micro plastics pollution in fishes | [69] |
Statistical data analysis, Pearson Product Moment Correlation analysis | Arabian Sea, Indian coast, Marine fish products | Estimate micro plastic presence in fishes | [70] |
SPSS software version 24 and Excel 2019, Shapiro-Wilk test, Pearson correlation tests | Persian Gulf | Understand the concentration and marine plastics in fishes | [71] |
Machine learning methods Linear Regression, Random Forest (RF), and Support Vector Regression (SVR) | Benthic and coastal zones | Estimating water turbidity, chlorophyll (macro and micro algae) | [72] |
Principal Component Analysis (PCA) using R, | Native coastal and fresh water ecosystems/zones | Climate change and species vulnerability assessment | [73] |
Pearson correlation, cluster analysis, and principle component analysis (PCA). For instance, cluster analysis | Ghana/Gulf of Guinea | Radiological evaluation of beach sediments and species risk | [74] |
Multivariate analysis e.g. Pearson correlation, principal component analysis, and cluster analysis. Principal component analysis (PCA) and cluster analysis | Southeast Bay of Bengal | Estimate the health risk posed by consuming tainted seafood by giving useful insights into the distribution of heavy metals in the seafood | [75] |
RD-TFD and wordnet ontology features, tokenization, stopword removal, and stemming algorithms: Statistical metrics like precision, recall, F-measure, and accuracy | Marine ecosystems (Jelly fish clusters) | To analyse and group jelly fish clusters | [50] |
Artificial Intelligence (AI) models e.g. tray based on a deep-learning, simple online and real-time tracking algorithms | Coastal aquaculture zones/China | Making best technology and data driven decisions for sustainable prawn farm management | [76] |
Univariate and multivariate statistics e.g. fixed permutational analysis of variance (PERMANOVA), Canonical Analysis of Principal Coordinates (CAP) | MPAs, Coral zones and non-protected zones | Compare fish conservation and management policies in MPAs and non MPA zones | [77] |
Deep learning models and dimensionality reduction methods | Marine/Deep sea | Precise and effective monitoring cetaceans (marine mammals) | [78] |
Machine learning and data mining method | Deep sea fishing vessels | Creating decision support tools in marine spatial planning / sustainable /harvesting | [79] |
Machine learning and computer vision techniques, Python, and Annotators | Marine ecosystems | Increasing understanding of long term stock fluctuations and environmental changes | [80] |
R; REPHYTOX for data simulation | Marine and coastal areas | Informing management on scallops, effects of pollution, SST, and nutrient concentrations | [81] |
Machine learning and statistical modelling solution - Haul biomass Index Estimator (HBIE) | High Sea | To fill gaps in trawl surveys | [82] |
ArcGIS, Google Earth | Coastal marsh land zones | Analysis of microplastic characteristics and distribution | [83] |
Visualization tools | USA coast | Obtain insights into spatial dynamics of anglers’ behaviour and fish populations | [84] |
Isotope ratios, and Internet of Things (IoT) | Mediterranean coast | Identify patterns and insights of sea food/ fish supply chains | [85] |
Remote sensing, modelling and sampling designs | Fresh water and riverine ecosystems | Shape fish populations and ecosystems dynamics | [86] |
Logistic regression model | Migratory Salmon Species / River estuaries | Sex identification of returning / spawning salmons | [87] |
Clustering analysis using R | East and South China Sea | Understand mixed trawl fisheries characteristics and fishing habits of target species. | [43] |
Bhattacharya technique, FISAT software and Barefoot Ecologist Toolbox | Java Sea | Understand the spawning potential ratio of invasive species | [88] |
Generalised linear mixed model (GLMM) | Fresh water ecosystems | Estimate fish effort from remote traffic encounters | [35] |
3.4.1 Relevance in coastal and near-shore zones fisheries management
Coastal zones are a rich habitat for species and ecosystems [47], [61] such as mangroves [88], estuaries [59] sea grasses [57], and coral reefs [48], [87] among others which not only provide fish resources but are crucial management zones [3]. However, the increase in fish effort coupled with management gaps and IUU fishing [3], [46], [75] are threatening the sustainability of such ecosystems such as migratory species [87]. With this hindsight, there’s growing evidence that data analytics could be applied [5].
Recent studies in the review revealed that data analytics methods and techniques are relevant in coastal zones as they aid benthic species mapping using the Triplot of Redundnacy Analyses (RDA), Pearson correlation and big data, remote sensing, machine learning and Bathymetry LADS surveys in marine benthic zones respectively [61], [65], decision making on species conservation and management using artificial intelligence models such as tray-based on machine learning, simple online and real time tracking algorithms in coastal aquaculture zones of China, univariate and multivariate statistics such as fixed Permutational Analysis of Variance PERMANOVA, Canonical Analysis of Principal Coordinates (CAP) in Marine Protected Areas (MPAs), coral and non-protected zones, and big data, remote sensing, machine learning respectively [47], [76], [77], determination of salinity and surface temperatures in estuaries using the downscaling method, statistical and hybrid dynamical-statistical methods [59], segmentation of fishing vessel trajectories in coral reef zones using Global Positioning Systems (GPS) [48], determination of coastal fish vulnerability zones in the Indian Ocean using the mKRISHI Fisheries Analytic system and App (Singh, 2016), analysis of sea grass losses in near shore ecosystems, Island zones and the Arabian sea using Google Earth Engine (GEE) and regression plots [57], detection and quantification of marine litter in mangrove zones using the Non-Parametric Kruskal-Wallis test and PERMANOVA [67], identification of changes in micro plastics pollution in coastal fish species in estuaries using multivariate techniques such as PCO analysis [69] and microplastic distribution in coastal marshlands using ArcGIS and Google Earth [83], radiological evaluation of the risk of species habituating along and in beaches such as crabs along the Gulf of Guinea using Pearson correlation, cluster analysis and Principle Component Analysis (PCA) [74], long-term understanding of species shocks and fluctuations due to environmental change using machine learning, computer vision techniques, Python, and Annotators [80], and obtaining insights into the spatial dynamics of fish anglers behaviors to fish populations especially along the US coast using visualization tools [84].
Some of the major breakthroughs in managing coastal pollution have been reported by studies conducted in estuaries in Tamil Nadu in India [69], marshland zones [83], and in mangrove ecosystems in the Central West coast of India [67]. In the study, Jeyasenta [69], represented the geographic similarity patterns of micro plastics in the water and sediment of the assessment sites across both seasons using multivariate techniques, such as PCO analysis. The software tool PRIMER ver. 7.0 was used to carry out these investigations. Additionally, the SPSS 20.0 software Programme was used to do ANOVA and correlation statistical tests. As a result, the analysis and interpretation of the data gathered for this study heavily relied on data analytics. In addition, De et al. [67] analysis and interpretation of the data gathered during the fieldwork is important to the use of data analytics. The data were analyzed using a variety of statistical techniques, including the non-parametric Kruskal-Wallis test, paired Mann-Whitney tests, permutational multivariate analysis of variance (PERMANOVA), and homogeneity tests for multivariate dispersion. With the aid of these techniques, it was possible to assess the significance of spatial variation differences in litter abundance among mangrove stands based on habitat characteristics, concentration differences between rural, peri-urban, and megacity regions, and variation in litter abundance between mangrove floors and canopies based on location. The paper also discusses the techniques for data analysis used to evaluate the homogeneity and normality of variances. The findings of these analyses shed light on the effects of anthropogenic litter pollution on India's coastal mangroves and can be used to build efficient management measures to lessen these effects. With this insights, we can argue that a whole spectrum of sustainable fisheries complexities in coastal zones-a beehive to several environmental and anthropogenic shocks in fisheries management [3] could be mitigated via data analytics.
3.4.2 Relevance in deep sea fisheries management
Deep sea fisheries are some of the most complex in management partly due to their presence in the ‘global common’ and the technicalities in monitoring high seas [3]. Fortunately, recent advances in technology could help mitigate this gap [4]. Extracted documents revealed that data analytics is increasingly being used to tackle deep sea fishing challenges [17]. For instance, multiple simulation models and observational datasets have been used in the Red Sea to monitor ocean winds, and weather forecasting [60], Bayesian hierarchical methods are used in the analysis of transboundary fisheries [49], the DBSIRM model-a big data analytics technology is being used in the Yellow sea for fish index assessments, sustainable growth and development [55], the PesKAAS application has been used in the Timor and Banda sea to give near real-time information on fish catch and effort [62], in the Dover straight, Alessandrini [56] used predictive analysis, information fusion and visual analytics for automatic identification of structural abnormalities in fishes and prediction of vessel itineraries well as Coronado [54] used predictive analytics and time series analysis for ocean monitoring, food supply chain and spot trends in fisheries to inform decision making in Canada’s North Atlantic and Labrador sea. Other breakthroughs in the deep sea include the assessment of potential habitat extent in deep sea cold coral zones and submarine canyons using PCA and cluster analysis [63]. In the Pacific and Atlantic ocean, geo-visual analytics, Hybrid Spatial-temporal filtering (HSF) and automated behavioral change point analysis have been used to assess how well interactive and automated mapping solutions enhance fisheries enforcement efforts [64]. In this study, the comparison of various methods for visualizing and examining movement data collected from Vessel Monitoring System (VMS) data was crucially emphasized. Making sense of the complicated and enormous VMS data sets requires the use of geo-visual analytics and other data analysis techniques. This knowledge can help with decision-making and enhance the management of fisheries resources.
In addition, log transformations, R, and SPSS are increasingly being used in deep sea fishing to determine micro plastic pollution in zooplankton due to the increase in ocean gyres especially in the Indian Ocean and Arabian Sea [53]. Related studies on micro plastic presence in fishes and fish products in the Arabian Sea and Indian Ocean have used advanced methods such as statistical data analysis, Pearson Product Correlation Analysis [70] and SPSS software version 24, Excel 2019, Shapiro-Wilk tests and Pearson correlation tests to understand the concentration of micro plastics in fishes; especially in the Persian gulf [71]. In addition, sensor information systems and data analytics using scatter plots for visualization is used in the Arabian sea to find enabling trends and insights in fish ecosystems and value chains [66] and satellite photos are increasingly becoming relevant in the assessment of ocean acidification, environmental and social governance especially in the Indian ocean zones [39] as well as remote sensing and video surveillance devices to detect spatial and temporal changes in ecosystems [68].
In the high sea benthic zones, machine learning methods using linear regression, Random Forest (RF) and Support Vector Regression (SVR) are used to estimate water turbidity, and chlorophyll (mirco and macro algae blooms) [72]. In addition, deep learning models and dimensionality reduction methods are used to precisely and effectively monitor marine mammals-cetaceans especially in Canada [78]. To achieve this, a robust analysis of massive datasets of photographs of cetaceans combined deep learning models and dimensionality reduction methods is used. This is through the creation of a binary land cover map with human verification as well, which is utilized to keep out of further analysis photos completely engulfed in land. Additionally, in this study, there was avoiding of manual examination of redundant photos during a single iteration by using clustering methods to choose a variety of representative images to annotate. Overall, the researchers were able to create a more effective and precise method of monitoring cetaceans thanks to the application of data analytics.
In the management of scallops and informing management on the scallop diversity and shocks due to the effects of marine pollution, Sea Surface Temperature (SST) changes and nutrient concentrations, R and REPHYTOX for data simulation has come in handy [81]. In the study conducted around the French-English Channel, Chenouf [81] a dataset that was produced using an algorithm that translates the effects of harmful algal blooms (HABs) in terms of fishing area closures by fusing information on regulations, in situ data on phycotoxin concentrations in scallops, and data on fisheries management was created. This algorithm is useful for analyzing management approaches to HABs. This dataset can be further examined using data analytics to glean insights that can guide management and decision-making tactics. This new advance is relevant, for instance, to spot patterns and trends in HAB occurrence and their effects over time on scallop production regions. It can also be used to pinpoint elements like water temperature, nutrient concentrations, and environmental circumstances that influence the development of HABs and the degree of toxicity they exhibit. Overall, data analytics can aid in enhancing our comprehension of the intricate relationships that exist between HABs, scallop production, and human activities in marine and coastal areas and can help informing the creation of more efficient management strategies to lessen the effects of HABs on these activities.
In addition, to fill gaps in trawl surveys, machine learning and statistical modeling solution-Haul Biomass Index Estimator (HBIE) is used [82]. For instance, in a research conducted in the Adriatic Sea and around Italy, advanced spatiotemporal and environmental modelling tools were used to fill in the gaps in trawl survey data. These methods entail the evaluation of sizable data sets, including information on fish biomass, environmental factors, and survey hauls [82]. This data and methodology may be applied to data augmentation, re-application to data from other scientific surveys, and haul contribution analysis—all of which could profit from data analytics methods. Related to the above, a study conducted in the East and China Sea zone used clustering analysis using R to understand mixed trawl fisheries characteristics and fishing habits of targeted species [43]. For instance, in the area around Taiwan, The vast amount of data gathered from the mixed trawl fisheries in Taiwan the data was analysed and interpreted using clustering analysis to classify catch métiers and group fishing expeditions with comparable catch compositions. With this method, it is possible to comprehend mixed fisheries' characteristics and the fishing habits of various target species better. Additionally, a two-step methodology is employed to categorize pertinent clusters and pinpoint particular catch trends. The application of data analytics in this study offers insightful information about the fishery's structure and can guide sustainable management strategies.
Other advances in the deep sea domain have involved the use of Logistic regression model to identify the sex of returning and spawning migratory salmons [5], [87]. For instance, in the study conducted in and around the River Yukon estuary in Canada, a morphometric model using a logistic regression model to determine the sex of Chinook Salmon that were returning to spawn was created. The fish's standard length (SL) and maximum eye diameter (MEF), as well as other demographic information, were used to create the model. The model was then put through a cross-validation evaluation and contrasted with other sex identification techniques [87]. The R statistical software suite was used to carry out each of these studies. As a result, data analytics were essential to the creation and assessment of the morphometric model.
In addition, in the Java Sea, the Bhattacharya technique, FISAT software, and Barefoot Ecologist Toolbox have been used to understand the spawning potential of invasive species [88]. In a study off Indonesia, these data analytics tools are utilized to analyze the information gathered about the length, weight, fecundity, and gonadal maturity of invasive crayfish in Java. The length data is specifically analyzed using the Bhattacharya technique, and the length at first maturity, selectivity capture, mortality fishing, natural mortality, and spawning potential ratio (SPR) are specifically analyzed using FISAT software. The SPR is also examined using the programme from the Barefoot Ecologist Toolbox [88]. With the use of these data analytics techniques, it is possible to gain important insights on the biology and reproduction of invasive crayfish, which are crucial for foretelling how well they will integrate into Java's ecosystem and creating long-term management plans.
3.4.3 Relevance in fishing vessels management
In the context of coastal and high sea fishing vessels management, a combination of data analytics methods have proved crucial [4]. For instance, for deep sea fishing vessels, machine learning, and data mining methods have been used to create decision support tools in marine spatial planning and sustainable fish resource harvesting [79]. Several studies have reported that fishing fleet management requires the integration of substantial volumes of data from numerous sources, including vessel monitoring systems, automatic identification systems, and satellite photography [4] and data analytics plays a significant role in this perspective [17]. To find patterns and trends in fishing activity, stock abundance, and environmental conditions, the data can be examined using machine learning and data mining methods [79] including tools such as drones [31]. This data is then employed to create decision support tools that can help with marine spatial planning, sustainable stock harvesting, and extensive fishing activity monitoring. Data analytics can also aid in identifying knowledge gaps and prioritizing research efforts to close those gaps.
In addition, Singh [15] proposed the mKRISHI fish analytic and system App for small fishing vessels around the Indian Ocean zone that can identify vessels risks and vessels under stress to about 30 kilometers offshore. The system streamlines the distribution of information about vessels, and ocean conditions/forecasts on ocean winds and waves to mobile phones in local languages, assisting fishers, their families, and other stakeholders in identifying risk zones, their occurrence date and time, and responding appropriately [15]. Numerous fishermen are saved from death and risk exposure thanks to this data-driven strategy.
3.4.4 Relevance in freshwater ecosystems management
The relevance of data analytics transcends marine zones as research has demonstrated their application into freshwater fisheries and ecosystem zones [5]. For instance, Petri et al [58] recommended the use of intelligent analytics for real-time catchment regulation and water management in river depth and river flow assessments. This can be done through the utilization of various data analytics techniques such as drones, and satellites [32] to accurately predict river depth, river flow and rainfall up to five days in advance, which data is further analyzed to assess the risk and support informed decisions-making related to regulating the catchment ecosystem [58]. Based on this insight, we argue that data analytics plays a crucial role by providing the necessary insights and information to optimize catchment flow and conserve water resources.
In addition, freshwater ecosystems are increasingly being threatened by eutrophication and pollution [3] and in this case data analytics methods such as PCA, multivariate statistical analysis and Hierarchical Cluster Analysis (HCA) could be leveraged to identify distinct groupings of pollution variables and understand correlations among them [51]. For instance, in a study conducted to identify natural and/or anthropogenic sources of heavy metals and to aid in the interpretation of geochemical data in Lake Ahansar in India, multivariate statistical analysis was used. The correlations between several water quality metrics and heavy metal concentrations were specifically examined using Principal Component Analysis (PCA) and Hierarchical Cluster Analysis (HCA) [51]. In order to interpret the data and reach meaningful conclusions about the historical trends of heavy metal contamination and eutrophication in the Ahansar lake system, these data analytics techniques helped to identify distinct groupings of variables and understand the correlations between them.
In freshwater aquaculture zones and farms, Artificial Intelligence using pattern analysis and information prediction has been used to boost the performance of small scale aquaculture fisheries [52]. In a study conducted in backwater and freshwater aquaculture zones, the importance of data analytics in the context of aquaculture is mentioned. For instance, a study by Jaikumar [52] focused on the application of AI to pattern analysis and information prediction for fish growers based on information from surveys on aquaculture farms.
Freshwater ecosystems threatened by climate change and species vulnerability to climate change have also been monitored through data analytics methods such as PCA using R. A study by Lianthuamluaia et al. [73] in India proved that data analytics is quite important. Principal Component Analysis (PCA) was utilized in the study to analyze the data for water quality metrics using R software. Additionally, the analysis made use of information relevant to a certain region on the composition of inter-seasonal rainfall, yearly rainfall, and air temperature changes over the previous 35 years. The information was examined to determine how the study area's rainfall and temperature varied from the Long Period Average (LPA) kept by IMD. The results of this study can be utilized to create a database that will be used to create future conservation and exploitation plans for other small native freshwater fish in South Asia.
Along riverine ecosystem zones, data analytics tools such as Remote sensing, modeling and sample designs is relevant in understanding ecosystems dynamics and shaping of fish populations. For instance, in Riverscape study in USA by Torgersen [86], data analytics was very important because it's essential to the analysis stage. Data analysis is required to connect intricate patterns in heterogeneous data sets to ecological processes and forces that shape population and ecosystem dynamics. Data management (storage and geo-referencing, visualization, and processing (such as filtering, reduction, and manipulation) can be used to categorize the complexity of data analysis. This is relevant due to the fact that practitioners may need to acquire new skills or employ personnel with the necessary experience due to the particular challenges posed by enormous, geographically referenced data sets and sophisticated tools for statistical analysis and visualization. Data analytics is thus a crucial instrument for comprehending patterns and processes in a management environment as well as for making defensible choices regarding the kinds of data gathering that will be aimed at being studied.
Furthermore, in the review, we observed that data analytics methods such as Generalized Linear Mixed Models (GLMM) are relevant in estimating fish effort from remote traffic encounters; especially in over fished freshwater ecosystem zones. A study in Canada by Trudeau et al, [35], heavily relied on data analytics to estimate fishing activity throughout a fisheries landscape. To attain this, data from different sources, such as aerial surveys, creel surveys, and angler diary surveys was combined, using a generalized linear mixed model (GLMM). Trudeau et al [35] also evaluate the precision of their model predictions using cross-validation techniques. The most relevant recommendation from the study is that it is proposed that for estimating fishing effort based on lake features, nonparametric techniques such as random forests could be employed to account for nonlinear responses and interactions of lake characteristics. This overly proves our hypothesis that the effective integration and analysis of various data sources using data analytics is essential for estimating fishing effort in recreational fisheries
3.4.5 Other fisheries related management practices such as supply chain management and business
Since the 1990s global fisheries are increasingly being strained by the skyrocketing consumption and demand of both marine and freshwater species especially in western markets [3], [17]. This has not only led to increased fish effort but also complex fish value chains [4] dotted with IUU [3] and less regulation of fish trade value chains [16]. This requires the monitoring of value chains to trace the types of fish consumed, nature of processing, and general fish standards [3]. With this complex scenario, data analytics has emerged as a bridge. An insightful research by Palocci, [85] to identify patterns and insights of seafood and fish supply chains based on innovative data analytics tools supports this argument. In their study in the Mediterranean Sea zone in Europe, they significantly observed that isotope ratios, and Internet of Things (IoT) is a feasible option to identifying fish supply chains. They argued that, in order to manage, store, and analyze big data in a way that is useful for decision-making, new techniques are required using the ‘search engine concept’ as an alternative method for accessing and discovering data and metadata integration and analysis based on research inquiries about nutritional quality, food safety, authenticity, and transparency. Data analytics could therefore be very useful in this research because it can aid in the identification of patterns and insights from the vast volumes of data gathered from various food supply chains.
In the Bay of Bengal, methods such as multivariate analysis e.g. Pearson correlation, PCA, and cluster analysis have been used to estimate health risks posed by consuming contaminated seafood through giving insights into heavy metal pollution in species and seafood [50]. The authors suggested the use of a number of data analytics tools, including feature extraction using RD-TFD and wordnet ontology features, tokenization, stopword removal, and stemming algorithms. The documents are clustered using the specified clustering method after the retrieved features are used to identify key traits using COA. Additionally, statistical metrics like precision, recall, F-measure, and accuracy could be used to analyze the suggested methodology. In data analytics, these statistical metrics are frequently used to assess how well clustering algorithms function. Overall, data analytics is essential to this research because it offers the tools and methods required to efficiently analyze and group massive amounts of material.