Acquiring high-resolution and accurate data on forest structure in the pantropics is essential for improving the quantification of terrestrial carbon stocks and change (Pugh et al., 2019; Tagesson et al., 2020). Such information also has practical applications, such as estimating forest timber volume, assessing forest degradation, deforestation and regrowth, and modeling primary production, biodiversity and other key ecosystem variables (Asner et al., 2012; Goetz and Dubayah, 2011). In addition, these data are essential for the production of jurisdictional forest biomass and carbon stock estimates, which can help with the production of countries’ National Determined Contribution reports (Secretariat, 2022) and to help achieve the UN Sustainable Development Goals (Olabi et al., 2022).
Airborne lidar campaigns using small-footprint or medium-footprint waveform sensors deliver accurate canopy structural information and enable the estimation of high-quality reference biomass (Duncanson et al., 2020; Fatoyinbo et al., 2021; Huang et al., 2019; Ojoatre et al., 2019), but are limited in spatial extent particularly over tropical regions. Space-based lidar can measure forest structure and biomass consistently at the global scale. The GLAS (Geoscience Laser Altimeter System) instrument onboard NASA’s ICESat mission and, more recently, the ATLAS instrument onboard the ICESat-2 mission have successfully measured forest structure, but are not optimized for vegetation mapping and have quite limited spatial mapping (Narine et al., 2020; Simard et al., 2011).
NASA’s Global Ecosystem Dynamics Investigation (GEDI) instrument is a vegetation-optimized lidar onboard the International Space Station (ISS). Since April 2019, GEDI has provided billions of lidar waveform observations of ecosystem structure between 51.6° N and 51.6° S latitudes (Dubayah et al., 2022a). The GEDI mission provides a variety of products on ecosystem structure including canopy height, canopy cover and vertical profile, canopy leaf area index and profiles and aboveground biomass (AGB) (Dubayah et al., 2020, 2022a). However, although GEDI provides vastly better spatial coverage than previous spaceborne lidar missions, it is also a sampling mission with 60 m gaps along each of its 8 tracks, 600 m spacing between tracks, and variable spacing between the orbital swaths of these 8 tracks, which varies as a function of latitude. Additionally, persistent clouds create gaps in coverage, especially over the tropics. For these reasons, GEDI’s standard, gridded ecosystem products are produced at 1 km resolution which provides near continuous mapping.
Many applications require higher spatial resolution and this has motivated much research into fusion methods that combine the highly accurate but sparse estimates from GEDI with wall-to-wall high-resolution imagery from other sensors. These methods have used passive optical, stereo, and SAR imagery, often in machine learning or other advanced statistical frameworks (Francini et al., 2022; Shendryk, 2022). For example, the first one is the 30-m Global Forest Canopy Height for 2019 by Potapov et al., (2021) which was produced by integrating GEDI data and Landsat data using a bagged regression tree ensemble model. Another product is the 10-m Global Canopy Top Height map for the year 2020 estimated from fusing GEDI with Sentinel-2 multi-spectral imagery using a probabilistic deep learning (Lang et al., 2022). However, ideal ancillary data to map forest structure should be not merely spatially continuous but also (a) less influenced by clouds and (b) sensitive to vertical vegetation structure. While time series of optical imagery, including spatial texture measures, have proven to be somewhat indirectly responsive to height, they may not capture the full range of height and can be limited in areas of high canopy cover, resulting in saturation of predicted heights (Bullock et al., 2020; Chi et al., 2015; Nguyen et al., 2020).
Interferometric Synthetic Aperture Radar (SAR), known as Interferometric SAR or InSAR, has shown to accurately capture the vertical structure of forest canopies. However, its accuracy is influenced by two factors: the wavelength of the radar waves and temporal decorrelation. The wavelength determines the radar's ability to penetrate through the canopy (Jedlovec, 2009), while temporal decorrelation results in a loss of coherence necessary for modeling canopy structure (Simard et al., 2012). Lower-frequency waves like P- and L-band are more sensitive to the tree trunk, underlying ground, and their interaction, while higher-frequency waves like C- and X-band are more sensitive to the crown composed of leaves and branches. However, at each frequency, all components of the tree and the ground contribute to the scattering of the radar signal received (Treuhaft and Siqueira, 2000). This necessitates algorithms capable of separately estimating different types of canopy height and ground elevation. Multiple studies have developed algorithms that use various sources of information such as multi-polarization InSAR (PolInSAR), multi-baseline InSAR/PolInSAR, multi-baseline tomography SAR (TomoSAR) (Reigber and Moreira, 2000; Neumann et al., 2009; Pardini and Papathanassiou, 2012; Kugler et al., 2014; Soja and Ulander, 2016; Pardini et al., 2017;) as well as lidar (Askne et al., 2013, 2017; Qi and Dubayah, 2016; Pulella et al., 2017; Bispo et al., 2019) to identify and remove the ground. These studies were limited to areas where such data sources were available. Forest height estimation has been achieved without the need to identify and remove the ground by utilizing the TanDEM-X (TDX) X-band InSAR coherence magnitude and a physical InSAR scattering model (Chen et al., 2021, 2016; Olesk et al., 2016; Schlund et al., 2019; Gómez et al., 2021). This is significant because TDX is a spaceborne InSAR mission that avoids issues with temporal decorrelation and has the potential for global-scale application. Studies have shown the potential of combining data from GEDI and TDX to map forest structure (Qi et al., 2019a) and biomass (Qi et al., 2019b), but these studies were limited to specific study sites and a few TDX acquisitions. With the availability of abundant GEDI data, further research is needed to evaluate the effectiveness of GEDI and TDX data fusion in larger areas and countries.
The efficacy of GEDI-TDX fusion can be attributed to several factors: 1) InSAR coherence has relatively higher sensitivity to forest structure and biomass than optical or SAR backscatter signals (Raveendrakumar et al., 2022); 2) Lidar and InSAR are similar in the capability of measuring forest vertical structure, although working at different viewing geometry and wavelength (Pardini et al., 2019); 3) TDX interferometer provides unprecedent high-quality interferometric coherence with no temporal decorrelation and high spatial resolution (Chen et al., 2019); 4) The baseline of TDX Interferometer has large variability due to the tandem satellite configuration (as opposed to the fixed baseline used by the earlier SRTM mission (Kobrick, 2006)), allowing for a wide variability of spatial baselines (Zink and Moreira, 2013) which is essential for accurate height retrievals.
Both missions are still operational and have had coincident acquisitions since 2019, which facilitates fusion. However, our earlier studies were conducted over selected experimental sites using simulated GEDI data derived from airborne laser scanning data within a simulator (Hancock et al., 2019), and was based on Random Volume over Ground (RVoG) model (a simplified physical model) which inverts less-accurate canopy height when the assumptions of the scattering model are violated (Pardini et al., 2019). The potential and performance of this type of fusion has not been evaluated using on-orbit GEDI data over large areas.
This study investigates the performance of GEDI-TDX data fusion over forests in Central Africa, Central and South America, including the Amazon Basin at 25 and 100 m resolution. We use a model-based forest height inversion framework where the GEDI lidar waveforms are used to approximate the X-band vertical reflectivity. This provides a key, missing element when using single single-baseline single-polarized TDX InSAR that allows for the inversion of canopy height from interferometric coherence. The inverted TDX height is then calibrated with actual GEDI canopy heights using adaptive regional models and TDX wavenumber kZ as a calibration factor, resulting in final GEDI-TDX (GTDX, hereafter) fusion height maps at 25 m and 100 m. The study is conducted across four pantropical countries/regions: Gabon, Mexico, French Guiana, and Amazon Basin. Our results of the fusion product are validated against independent airborne lidar canopy height for accuracy assessment. Our height canopy map product is complemented with an uncertainty of prediction map, expressed as the RMSE, that was derived under the same closed-form parametric statistical inference framework developed around the GEDI mission (Dubayah et al., 2022a; Patterson et al., 2019; Saarela et al., 2022).