Study site
The Colne valley river watershed was chosen for this study (Fig. 1) as it contains a mix of landcover representative of UK landscapes, from uplands with grass, heather and bracken dominating to lower altitude agricultural fields and woodland patches. The watershed also contains a significant urban land cover lower in the catchment, reaching the outskirts of the city of Huddersfield. An entire watershed was chosen in order to test targeted riparian woodland creation as a potential strategy, and compare this to creation elsewhere in the watershed.Riparian planting is projectedfor this area under National Trust plans, and the upland-lowland interface represents a prime location targeted for afforestation under current UK woodland creation schemes (NT personal communication; Iversen et al., 2024).
Landcover maps
Existing landcover maps such as the CORINE land classification did not meet our spatial resolution criteria, therefore we created a land classification for the watershed using recent 10 m resolution sentinel 2 satellite images and a neural network classifier. Full details of the stepwise process, including R code for this and subsequent sections of the methodology can be found at https://github.com/andyspeak/landclass. Eight land classes were chosen – bare ground, bracken, upland grass, lowland grass, heather, urban, water and woodland. Sentinel 2 image capture was taken on 25/06/2020, with less than 5% cloud cover and downloaded from Copernicus in a pre-processed form (Copernicus, 2024). The pixel based supervised classification takes a shapefile of polygons of known land types (ten samples per class) for training the model, which is then used to create prediction rasters. Three training models within the R caret package (Kuhn, 2008) were used and compared for accuracy – Random Forest, Support Vector Machines and Neural Network, with the best performer being selected for the final prediction raster output.
To elucidate the current situation of riparian woodlands more generally in the UK, we estimated their area using a buffer analysis (50m) on Ordnance Survey (OS) open rivers data (Ordnance Survey, 2023) and National Forestry Inventory woodland shapefiles for England, Scotland and Wales (Forestry Commission, 2024). Additionally, annual woodland inventory and creation data from 2013 to 2016 (Forestry Commission, 2024) were used to calculate the amount of woodland patches created adjacent to existing woodland or created as solitary patches over 50 m distant from existing woodland, using buffer analysis and selection by location in QGIS (v3.16). The same 50 m river buffer was used in conjunction with CORINE land cover data from 2018 (Cole et al., 2021) to determine proportions of riparian landtypes in the UK.
Polygonisation
To model real-world woodland creation as much as possible, non-woodland areas were divided into patches based on property boundaries, and these patches were converted to woodland in their entirety, according to different scenario rules. Property boundary data were derived from the UK government INSPIRE dataset (INSPIRE, 2023), which contains boundary polygons for freehold residential ownerships and subdivisions of rural land which often correspond with geographical borders such as agricultural field boundaries. This is suitable for the woodland creation scenarios as new woodland is often planted in spatial agglomerations based on existing boundaries, for instance converting a disused plot of agricultural land into woodland. Some polygons were very large, especially in upland environments. For these, a further step was taken to subdivide them into suitably sized polygons, or patches.
Subdivision of the landscape represented an opportunity to model the effect of patch area in woodland creation approaches. Three scenarios of mean average patch area were explored, denoted as PAT1.3 (1.3 ha), PAT2.8 (2.8 ha) and PAT5 (5 ha; Fig. 2). PAT1.3 and PAT5 were determined as, respectively, the median and mean of existing National Forestry Inventory woodland patches (Forestry Commission, 2024). PAT2.8 was taken as the mean field area within the existing subdivision of lowland fields, based on a random sample of 100 polygons visually identified as fields on Googleearth satellite images in QGIS (v3.16).
Polygons in the INSPIRE shapefile larger than their scenario’s mean patch area were subdivided in turn, into Voronoi polygons that conformed to that scenarios mean patch area, using a bespoke R function which utilised k-means clustering (see github page). Each polygon was assigned a modal landclass from the underlying land classification raster.
Woodland creation scenarios
The vector shapefiles, subdivided and classified by land use for the three different mean patch area scenarios, serve as input for woodland creation simulation models, using a hypothetical model similar to Lee and Johnson (2005). Woodland coverage across the study area was increased by 10, 30 and 50% (relative to initial coverage), for each maximum patch area scenario (Fig. 2), representing three scenarios of ambition for woodland expansion, and following four strategic targeting scenarios: random, proximal, riparian random, riparian proximal (Fig. 3). For random, non-woodland polygons were converted to woodland at random; for proximal, polygons were iteratively converted to woodland at random from those that bordered existing woodland, causing woodland to grow by expansion; for riparian planting, an additional constraint required converted polygons to partially overlap a 50 m buffer around watercourses. Watercourses were defined using Ordnance Survey (OS) open rivers data (Ordnance Survey, 2023). While a fixed width buffer approach may only approximate variable riparian zones and their ecosystem services (de Sosa et al., 2017), it was appropriate for our simple model scenarios. For the smaller patch sizes of PAT1.3 scenario, it was occasionally necessary to increase the buffer zone to 100 m to allow enough polygons for the + 30% and + 50% ambition scenarios to be reached. The woodland creation percentage increases encompass the UK reforestation target of 17.5% cover which represents a rise in cover of 20.7% above present levels (14.5% at time of writing).
Non-woodland patches of bare ground, built/urban and water were not available for conversion into woodland. Bare ground was predominantly bare rock and scree in upland environments. Patches above 300 m were also not available for conversion, as the upper limit for woodland creation in the UK is between 200 and 450 m. Finally, patches on land classed as high agricultural grade (top two grades out of five) were excluded (Natural England, 2023). Each scenario was run 100 times and the output vectors were rasterised back to the original 10 m resolution of the land classification raster in readiness for calculation of landscape metrics.
We assume, for the purposes of landscape connectivity metrics, that pre-existing and newly created woodlands are equally able to support woodland biota and facilitate species dispersal, representing a future state of woodland after approximately 20 years.
These scenarios, by generating patches of different quantities and sizes in a highly realistic landscape simulation, allow the investigation of the SLOSS debate. Different combinations of patch size and quantity can be tested in terms of their effects on landscape metrics and consequently provide the basis for postulations about their effect on wildlife ecology.
Landscape metrics
Landscape metrics were calculated at the landscape level for the woodland patches using the “landscapemetrics” package (Hesselbarth et al., 2019). The metrics chosen were aggregation index, mean patch area, core area index, edge density, Euclidean nearest neighbour, largest patch index, number of patches, and perimeter area fractal complexity. Edge depth was 10 m corresponding to one pixel in the raster files. Where statistical comparisons were necessary, these were calculated using pair-wise kruskal-wallis, across the different combinations of percentage woodland increase, average patch size and growth scenarios, and tabulated in Appendix 1.
In order to calculate habitat functional connectivity we used a recently published Edge-weighted Habitat Index (Dennis et al., 2024) which utilises a logistic function to link matrix patch area to distance decay effect of the habitat edge using habitat costs and edge effect data from Eycott et al. (2011).
The aforementioned UK reforestation target rise in cover of 20.7% above present levels can be assessed in terms of its effect on the landscape metrics. This was derived from linear regression equations constructed using the percentage increases in cover and percentage changes in the metrics (n = 3). The middle mean patch size of PAT2.8 was used and non-riparian scenarios (allowing a comparison of random versus proximal growth) considered for simplicity.