The physical land type on the earth's surface is indicated by the land cover, which includes forest, agriculture, water bodies etc. An alteration to the Earth's surface caused by human activity is called a land use /land cover (LU/LC) change. Human being shifts from rural areas to city for better education, good health facilities, employment and comfortable living. This process of shifting leads to urbanization. This urbanization has resulted negative effects on society, the economy, and the environment [1]. So it is very important for academics and policymakers worldwide to monitor and reduce the negative consequences of LU/LC changes for several management and scheduling tasks [2].
Land cover classification is essential for identifying changes, planning for urbanisation, mapping and observing the distribution of land cover on the surface of the globe [3]. Machine learning classifier that gives highest accuracy is always in demand in order to get accurate land use land cover classification of hysperctral remote sensing data. In recent time, web based powerful user friendly and open source cloud computing tool such as Google earth engine (GEE) is widely used for reliable land cover classification of LANDSAT data. In recent times, GEE and machine learning algorithms have gained appeal in a variety of satellite data applications, including LU/LC categorization, deforestation, drought, agricultural monitoring, hydrology and land cover mapping and monitoring, and environmental protection [4]. Users of GEE can assess all publicly accessible, large volumes of LANDSAT and SENTINEL data without needing to download it to their personal computer [5].
In remote sensing land cover classification, random forest is the most popular algorithm [6]. Random Forest is a non-parametric machine learning classifier that can be used to classify heterogeneous areas. The number of trees (m-tree) and the number of split variables (n-tree) are the only two input parameters that must be chosen by the user, making RF a straightforward classifier to use [7]. Several studies have discovered that the accuracy of conventional land cover classification systems suffers from a number of limitations [8, 9]. Numerous investigations have revealed that among the six different machine learning classifiers, RF and SVM have the highest overall accuracy [7].
Geographic information systems (GIS) and remote sensing (RS) give crucial tools in providing precise and up-to-date LU/LC data as well as analysing patterns in a study area [8]. Nainital, a district of Uttarakhand State, India has experienced considerable built-up growth during the past 20 years after forming a new state, Uttarakhand in 2000. Due to this, urban sprawl has occurred, with detrimental social, economic, and environmental effects [1]. Therefore, it is crucial for policymakers to examine LULC changes in the past and present in order to understand the factors that contribute to environmental changes and to consider potential remedial action [10].
The Cellular automata (CA)-Markov model has been used in a number of studies to anticipate past and present land cover changes, which helps land use planners and policy makers make decisions about potential land use concerns such as changes in ecosystems, urbanization, environmental and risk assessment [10, 11, 12, 13, 14, 15]. Therefore, it is crucial for decision makers to examine land use land cover changes in the present past and future in order to understand the factors that contribute to environmental changes and to corrective action for [10].
A combination of the Cellular Automata and Markov models is known as CA-Markov model. Forecasting of LU/LC trends and side effect of urbanization can be studied using the CA-Markov model in TerrSet software, especially in places that are developing quickly [16]. The CA-Markov model is frequently used to produce the transition probability matrix from two different time periods derived land cover maps. In GIS investigations of changes in land-use and land-cover, transition probability matrices are frequently employed to objectively quantify the pace of change.
This research aims to detect the LU/LC change from 2000 to 2020 with its driving factors and to predict LU/LC pattern in 2030 by using powerful tools Google Earth Engine and IDRISI software in the study area. The Nainital district extends over 4251 sq. km and geographically it is divided into two regions viz. plain and hilly. The Nainital district’s sustainability particularly in south and west direction (plain region of Nainital District such as Haldwani and Ramnagar etc.) may face difficulties due to the increasing small scale industrialisation and urbanisation. Previous research revealed a number of issues with data quality, classifier accuracy and data validation. Additionally, it appears from the literature that no research has been done to look into changes in LU/LC patterns and their prediction in the study area [20].Finally, machine learning Random Forest classifier is used to get high accuracy. The results of the study are used to identify the critical elements that affect LU/LC dynamics and improve land use policy for sustainable land use planning and development.