In machine learning one often assumes the data are independent when evaluating model performance. However, this rarely holds in practise. Geographic information data sets are an example where the data points have stronger dependencies among each other the closer they are geographically. This phenomenon known as spatial autocorrelation (SAC) causes the standard cross validation (CV) methods to produce optimistically biased prediction performance estimates for spatial models, which can result in increased costs and accidents in practical applications. To overcome this problem we propose a modified version of the CV method called spatial k-fold cross validation (SKCV), which provides a useful estimate for model prediction performance without optimistic bias due to SAC. We test SKCV with three real world cases involving open natural data showing that the estimates produced by the ordinary CV are up to 40% more optimistic than those of SKCV. Both regression and classification cases are considered in our experiments. In addition, we will show how the SKCV method can be applied as a criterion for selecting data sampling density for new research area.
Soil rutting caused by forest operations has negative economic and ecological effects and thus limits for rutting are set by forest laws and sustainability criteria. Extensive data on rut depths are necessary for post-harvest quality control and development of models that link environmental conditions to rut formation. This study explored the use of a Light Detection and Ranging (LiDAR) sensor mounted on a forest harvester and forwarder to measure rut depths in real harvesting conditions in Southern Finland. LiDAR-derived rut depths were compared to manually measured rut depths. The results showed that at 10-20 m spatial resolution, the LiDAR method can provide unbiased estimates of rut depth with root mean square error (RMSE) < 3.5 cm compared to the manual rut depth measurements. The results suggest that a LiDAR sensor mounted on a forest vehicle can in future provide a viable method for the large-scale collection of rut depth data as part of normal forestry operations.
Autonomous harvesting and transportation is a long-term goal of the forest industry. One of the main challenges is the accurate localization of both vehicles and trees in a forest. Forests are unstructured environments where it is difficult to find a group of significant landmarks for current fast feature-based place recognition algorithms. This paper proposes a novel approach where local point clouds are matched to a global tree map using the Delaunay triangularization as the representation format. Instead of point cloud based matching methods, we utilize a topology-based method. First, tree trunk positions are registered at a prior run done by a forest harvester. Second, the resulting map is Delaunay triangularized. Third, a local submap of the autonomous robot is registered, triangularized and matched using triangular similarity maximization to estimate the position of the robot. We test our method on a dataset accumulated from a forestry site at Lieksa, Finland. A total length of 200 m of harvester path was recorded by an industrial harvester with a 3D laser scanner and a geolocation unit fixed to the frame. Our experiments show a 12 cm s.t.d. in the location accuracy and with real-time data processing for speeds not exceeding 0.5 m/s. The accuracy and speed limit are realistic during forest operations.
Forest harvesting operations with heavy machinery can lead to significant soil rutting. Risks of rutting depend on the soil bearing capacity which has considerable spatial and temporal variability. Trafficability prediction is required in the selection of suitable operation sites for a given time window and conditions, and for on-site route optimization during the operation. Integrative tools are necessary to plan and carry out forest operations with minimal negative ecological and economic impacts. This study demonstrates a trafficability prediction framework that utilizes a spatial hydrological model and a wide range of spatial data. Trafficability was approached by producing a rut depth prediction map at a 16 × 16 m grid resolution, based on the outputs of a general linear mixed model developed using field data from Southern Finland, modelled daily soil moisture, spatial forest inventory and topography data, along with field measured rolling resistance and information on the mass transported through the grid cells. Dynamic rut depth prediction maps were produced by accounting for changing weather conditions through hydrological modelling. We also demonstrated a generalization of the rolling resistance coefficient, measured with harvester CAN-bus channel data. Future steps towards a nationwide prediction framework based on continuous data flow, process-based modelling and machine learning are discussed.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.