Analyzing spatial patterns in biodiversity data
  
São Paulo School of Advanced Science (SPSAS)
Co-designing Biodiversity Assessments
  October 31, 2024
Access to slides
![]() 
Maurício Vancine
- Ecologist and PhD in Ecology
- Post-Doc in Spatial Ecology (Prof. Mathias - Unicamp)
- Spatial Ecology
- Ecological Modeling
- Ecological and Spatial Data Analysis
- Amphibian Ecology and Conservation
- Open source (R, QGIS, GNU/Linux)
 
 
Content
- Biodiversity concepts
- Species occurrences
- Spatial analysis
- Application: Seleção Natural Platform
 
- Install R and RStudio
- Install packages
- Computer testing
- Basic R explanation (maybe)
 
- Basic R explanation
- Species occurrences
- Spatial analysis
 
 
 
Objectives
- Contribution of spatial data and analysis to biodiversity knowledge
- Basic tools for downloading and cleaning species occurrence data
- Basic tools for spatial analysis:
- Spatial units (grids, hexagons, convex hull)
- Point density (kernel maps)
- Species Distribution Modeling (very basic)
 
 
IMPORTANT!!!
We are in a safe and friendly space
Feel free to interrupt me and ask questions
 
IMPORTANT!!!
My English is a work in progress…
 
Biodiversity concepts
What is biodiversity?
Sustainable Ecology and Economic Development (SEED)
 
Biodiversity concepts
How to measure biodiversity?
 
Biodiversity concepts
How to measure biodiversity?
 
Biodiversity concepts
How to measure biodiversity?
 
Why do we want so much to measure biodiversity?
Biodiversity concepts
Carbon credit
- Allows private companies to trade carbon emissions
- Measured in tonnes of carbon dioxide equivalent (tCO2e)
 
Biodiversity concepts
Biodiversity credit
- Allows private companies to finance activities (e.g. forest conservation or restoration) for positive biodiversity gains
 
Biodiversity concepts
Carbon credit vs Biodiversity credit
 
There is a small problem: we do not know all the biodiversity……
Biodiversity concepts
Seven biodiversity shortfalls
- Species taxonomy (Linnean)
- Species distribution (Wallacean)
- Species abundance (Prestonian)
- Species evolutionary patterns (Darwinian)
- Species abiotic tolerances (Hutchinsonian)
- Species traits (Raunkiæran)
- Species biotic interactions (Eltonian)
 
 
 
Biodiversity concepts
Let’s select a biodiversity shortfall to explore
- Species taxonomy (Linnean)
- Species distribution (Wallacean)
- Species abundance (Prestonian)
- Species evolutionary patterns (Darwinian)
- Species abiotic tolerances (Hutchinsonian)
- Species traits (Raunkiæran)
- Species biotic interactions (Eltonian)
 
 
 
Species occurrences
Format
 
Species occurrences
Sources
- Field collections (field sampling)
- Literature (articles, data papers, …)
- Citizen science (e-Bird, iNaturalist, …)
- Scientific collections and museums (National Museum of Brazil, Royal Botanic Gardens - Kew, …)
- Databases (GBIF, speciesLink, VertNet, …)
Species occurrences
Databases
![]()
Species occurrences
R packages
![]()
Species occurrences
spocc
- Global Biodiversity Information Facility (GBIF) (rgbif): Earth’s biodiversity
- iNaturalist (inat): citizen science data on species observations
- VertNet (rvertnet): vertebrate records from institutions and museums
- eBird (rebird): citizen science data on bird abundance and distribution and checklist
- iDigBio (ridigbio): biological and paleobiological specimens
- Ocean Biogeographic Information System (OBIS) (obis): marine species data sets from all of the world’s oceans
- Atlas of Living Australia (ALA)(ala): information about species in Australia
 
![]()
There is another huge problem: sampling bias…
Species occurrences
Sampling bias - Brazil
- 
- 1 million (total) and 900 thousand (valid) occurrences for ~4,500 species 
 
- Groups: vertebrates, arthropods and angiosperms
- All occurrences <1km from access routes (roads and rivers)
 
![]() 
 ![]()
Species occurrences
Sampling bias - World
- 740 million occurrences of 375 thousand species
- Representing only 6.7% of the sampled globe
- At least 80% of records were within 2.5 km of roads
 
![]() 
 ![]()
Species occurrences
CoordinateCleaner
- Automated flagging of common spatial and temporal errors in species occurrences
- Errors include:
- Country and province centroids
- Capital coordinates
- Coordinates of biodiversity institutions
- Duplicated coordinates per species
- Urban areas
- Seas
- Equal longitude and latitude
 
![]()
Species occurrences
sampbias
- A statistical method to evaluate and visualize geographic sampling biases
- Biases include:
- Cities
- Airports
- Roads
- Rivers
 
![]() 
 ![]()
How to analyze this data spatially?
Spatial analysis
Spatial units
 
  
Spatial analysis
Spatial units - Grids
- Geographic area in regular cells, forming a grid
- Each cell is a spatial unit that contains values
![]() 
 ![]()
Spatial analysis
Spatial units - Hexagons
- Geographic area in regular hexagons
- Better for uniformity and connectivity, reduces edge effects
![]() 
 ![]()
Spatial analysis
Convex hull
- Smallest convex region that contains all the occurrences
![]() 
 ![]()
Spatial analysis
Point density (kernel maps)
- Represent the concentration of occurrences or events in a geographic area
![]() 
 ![]()
Spatial analysis
Point density (kernel maps)
- Kernel functions
![]() 
 ![]()
Spatial analysis
Point density (kernel maps)
- Bandwidth
 
Spatial analysis
Point density (kernel maps)
Kernel density estimation (KDE)
 
Spatial analysis
Species Distribution Modeling (SDMs)
 
Spatial analysis
Species Distribution Modeling (SDMs)
- Generalized Linear Models (GLM) - binomial
![]() 
 ![]()
Spatial analysis
Species Distribution Modeling (SDMs)
- Suitability map | Potential distribution
 
Spatial analysis
Species Distribution Modeling (SDMs)
![]()
Application: Seleção Natural Platform
Minimum Viable Product
 
Application: Seleção Natural Platform
Aim
- Prioritizes vegetation fragments that are most relevant for biodiversity conservation
- Conservation actions:
- Searching for new populations
- Monitoring population dynamics
- Creating of protected areas
- Places for reintroduction of species
 
![]()
Application: Seleção Natural Platform
Methods
- SDMs to predict the potential distribution of endangered animal species
- Vegetation fragment prioritization based on species composition and landscape metrics
 
![]() 
 ![]()
Application: Seleção Natural Platform
Workflow
- Client registration (user)
- Upload of one or more property boundaries (.kml) (user)
- Species composition and fragment ranking (automatic)
- Fragment selection (user)
- Conservation action plan (automatic)
 
 
 
Application: Seleção Natural Platform
Platform
 
Application: Seleção Natural Platform
Platform
 
Application: Seleção Natural Platform
Platform
 
Application: Seleção Natural Platform
Platform
 
R Programming Language
![]() 
 ![]()
R Programming Language
Top 10 main concepts in R
- Console
- Script
- Operators
- Objects
- Functions
 
- Packages
- Help
- Environment
- Directory
- Citations
 
 
Chrysocyon brachyurus
Coolest mammal in the world
- English common name: Maned Wolf
- Portuguese common name: Lobo-guará
- The largest canid in South America, with reddish fur
- It lives in open and semi-open habitats (Brazilian Savanna - Cerrado)
 
 
Access to GitHub
![]() 
Articles
- Araújo, M. B. et al. (2019). Standards for distribution models in biodiversity assessments. Science Advances, 5(1), eaat4858. 
- Zurell, D. et al. (2020). A standard protocol for reporting species distribution models. Ecography, 43(9), 1261–1277. 
- Sillero, N., & Barbosa, A. M. (2021). Common mistakes in ecological niche models. International Journal of Geographical Information Science, 35(2), 213–226. 
- Sillero, N. et al. (2021). Want to model a species niche? A step-by-step guideline on correlative ecological niche modelling. Ecological Modelling, 456, 109671. 
- Hesselbarth, M. et al. (2021). Open-source tools in R for landscape ecology. Current Landscape Ecology Reports, 6(3), 97-111 
- Sillero, N. et al. (2023). A curated list of R packages for ecological niche modelling. Ecological Modelling, 476, 110242 
Books
- Franklin, J., & Miller, J. A. (2009). Mapping species distributions: Spatial inference and prediction. Cambridge University Press. 
- Peterson, A. T. (Org.). (2011). Ecological niches and geographic distributions. Princeton University Press. 
- Peterson, A. T. (2014). Mapping disease transmission risk: Enriching models using biogeography and ecology. Johns Hopkins University Press. 
- Guisan, A., Thuiller, W., & Zimmermann, N. E. (2017). Habitat Suitability and Distribution Models: With Applications in R. Cambridge University Press. 
- Fletcher, R., & Fortin, M.-J. (2018). Spatial Ecology and Conservation Modeling: Applications with R. Springer International Publishing. 
- Lovelace, R., Nowosad, J., & Muenchow J. (2025). Geocomputation with R. Springer International Publishing. 
Análises Ecológicas no R (2022)
Muito obrigado!
Acknowledgements:
- Mathias Pires
- Miltinho
- Flavia Pinto
- Matheus Lima-Ribeiro
- João Giovanelli
