This analysis was performed to support Dr. Ethelyn Echep who is an entomologist and wanted to understand the habitat suitability of two insects (Lipaphis erysimi pseudobrassicae and Myzus persicae) in Ghana as part of her PhD project. She visited 91 cabbage farms in Ghana to collect occurrence data on the two insects, which was used for the analysis.
Presented during the Remote Sensing Day 2021, organized by Where Geospatial
The study area is Ghana. As shown in the map, the red dots are the places where insect occurrence data were collected.
Using the Average Nearest Neighbor analysis in ArcMap, it was found that the spatial distribution was clustered for L.E.P.
Using the Average Nearest Neighbor analysis in ArcMap, it was found that the spatial distribution was clustered for M.Persicae.
To determine the contributing factors, the table shows 19 Bioclimatic variables and 2 Remote Sensing variables that were used as covariates in a supervised machine learning modelling with the species occurrences as the dependent variable.
Visualization of the covariates in R Programming
A Random Forest model was fit for the L.E.P insect data in R Programming using the covariates listed above as predictors and the occurrences as the dependent variable. The accuracy of the model for a 10-fold cross-validation was 66.57%, and the variable importance is shown in the chart.
A Random Forest model was fit for the M.Persicae insect data in R Programming using the covariates listed above as predictors and the occurrences as the dependent variable. The accuracy of the model for a 10-fold cross-validation was 59.79%, and the variable importance is shown in the chart.
The top 6 contributing factors addressing question 2 are:
To address question 3, the modelled random forest was used to make predictions at grid locations created over Ghana for the L.E.P insect.
To address question 3, the modelled random forest was used to make predictions at grid locations created over Ghana for the M.Persicae insect.