Non-metric Multidimensional Scaling (NMDS) - Yiluan Song's teaching website

Guest lecture for Dr. Drew Gronewold’s class EAS 501.077 Multivariate Statistics for Environmental Science in 2023 Fall.

Objectives

Compare parametric and nonparametric methods for ordination
Learn about real-life applications of NMDS
Practice using NMDS in R to analyze community composition
Get familiar with a statistical test, PERMANOVA

Parametric or nonparametric?

In previous classes, we have learned about principal component analysis (PCA).

PCA have certain assumptions, requiring the data to be

continuous,
linear,
normally distributed.

Note that PCA involves calculating the Euclidean distances in the multidimensional space between samples.

What if our data are

discrete (e.g., presence/absence, category),
nonlinear (e.g., day of year, precipitation),
not normally distributed (e.g., count of rare species)?

Does Euclidean distance still make sense?

“Nonparametric statistics is the type of statistics that is not restricted by assumptions concerning the nature of the population from which a sample is drawn.”

Quote from Wikipedia.

PCA is a parametric method for ordination. Today, we are going to add a nonparametric method into our toolbox: non-metric multidimensional scaling (NMDS).

Figure 1: Example NMDS plot.

Source

NMDS places samples that are more “similar” to each other closer in a low-dimensional space.

Advantages of NMDS include

accommodating multiple types of data,
being used with different measures of dissimilarity.

However, be aware of the differences with PCA. Unlike principle components in PCA, the axes in NMDS do not carry a specific meaning.

Applications of NMDS

Case 1: Phytoplankton community composition (Paper)

Tropical urban reservoirs face the problem of phytoplankton bloom, often dominated by toxic cyanobacteria. It has been hypothesized that macrophytes (water plants) can inhibit phytoplankton, presenting an opportunity for reservoir restoration.

Figure 2: Cyanobacterial blooms.

Huisman et al. (2019)

In reservoirs in Singapore, we conducted a series of experiments to test macrophytes’ ability to alter phytoplankton communities and facilitate reservoir restoration.

Figure 3: Set up of a mesocosm experiment.

Sim et al. (2021) Fig. 1

Apart from the effect of macrophytes on phytoplankton biomass, we also care about the effects on phytoplankton community composition.

Figure 4: Relative abundance (biovolume) of phytoplankton taxa.

Sim et al. (2021) Fig. 3

Community composition data is essentially multivariate. The (relative) abundance of each taxa is a variable, and there are usually many taxa. How do we analyze the response of all these variables together?

We used NMDS to visualize the effect of macrophyte treatment on phytoplankton community composition.

Figure 5: NMDS of the relative biovolumes of phytoplankton genera in the control and macrophyte treatments.

Mowe et al. (2019) Fig. 6

Here we see how the introduction of macrophyte caused a shift in phytoplankton community composition, with the shift being greater as density of macrophyte increased.

Case 2: Grassland taxonomic, phylogenitic, and functional trait composition (Paper)

The trajectory of early succession communities is shaped by the plant phylogenetic and trait history. Teasing apart these two processes has important implications for restoration.

Holding starting species richness constant, Karimi et al. planted communities of different phylogenetic diversity (PD) and functional trait diversity (FD).

Figure 6: NMDS of taxonomic, phylogenetic and functional trait composition in two types of restoration treatments.

Karimi et al. (2021) Fig. 4

Similar to case study 1, they examined how different treatments drive differences in community composition. Apart from taxonomic composition, they also analyzed responses in phylogenetic and functional trait composition.

I highlight this study because they used a combination of data types to characterize composition.

“Functional diversity was assessed using 12 continuous leaf traits, 6 categorical traits, 8 binary root traits, seed mass, a categorical habitat moisture trait and genome size.”

They also used NMDS to study changes in the three kinds of community composition.

Figure 7: NMDS of taxonomic, phylogenetic and functional trait composition before and after restoration treatment.

Karimi et al. (2021) Fig. 4 (Note their dissimilarity measures.)

Here, we see that the PD treatment increased the dispersion of taxonomic and phylogenetic composition between communities (increased beta diversity) and caused directional shifts in functional trait composition (convergence).

Hands-on community composition analysis

There are really good NMDS tutorial1 tutorial 2 which I encourage you to try at home. Here I give another example with real-life data and some other visualization options.

Background: The National Ecological Observatory Network (NEON) collects ecological and biogeochemical data with standardized protocols across 81 field sites across the United States. The Woody Plant Vegetation Structure dataset (DP1.10098) describes the structure and composition of woody vegetation through the mapping, identification, and measurement of free-standing woody plants including trees, saplings, shrubs, lianas, etc.

We have downloaded NEON vegetation structure data at one site, Bartlett Experimental Forest (BART), for you to analyze the composition of woody vegetation. Please download them here.

Read in some data frames.

dat <- read_rds("data.rds")

df_tree <- dat$vst_apparentindividual %>%
  arrange(desc(publicationDate)) %>%
  distinct(plotID, individualID, .keep_all = T) %>%
  filter(str_detect(plantStatus %>% tolower(), "live")) %>%
  select(plotID, individualID)

df_tree_sp <- dat$vst_mappingandtagging %>%
  arrange(desc(publicationDate)) %>%
  distinct(individualID, plotID, .keep_all = T) %>%
  select(individualID, plotID, taxonID, scientificName) %>%
  filter(!is.na(taxonID)) %>%
  filter(taxonID != "2PLANT")

df_plot <- dat$vst_perplotperyear %>%
  arrange(desc(publicationDate)) %>%
  distinct(plotID, plotType, .keep_all = T) %>%
  select(plotID, plotType, nlcdClass, lon = decimalLongitude, lat = decimalLatitude)

Join the data frames. Each row is an individual tree (with unique individualID). Feel free to explore this dataset.

df <- df_tree %>%
  inner_join(df_tree_sp,
    by = c("individualID", "plotID")
  ) %>%
  inner_join(df_plot,
    by = c("plotID")
  )
df %>% head(10)

##      plotID            individualID taxonID                 scientificName
## 1  BART_075 NEON.PLA.D01.BART.03230    FAGR        Fagus grandifolia Ehrh.
## 2  BART_075 NEON.PLA.D01.BART.04011    FAGR        Fagus grandifolia Ehrh.
## 3  BART_036 NEON.PLA.D01.BART.04548    FAGR        Fagus grandifolia Ehrh.
## 4  BART_075 NEON.PLA.D01.BART.04702    TSCA Tsuga canadensis (L.) Carrière
## 5  BART_075 NEON.PLA.D01.BART.03257    FAGR        Fagus grandifolia Ehrh.
## 6  BART_075 NEON.PLA.D01.BART.03148    FAGR        Fagus grandifolia Ehrh.
## 7  BART_075 NEON.PLA.D01.BART.03254    FAGR        Fagus grandifolia Ehrh.
## 8  BART_075 NEON.PLA.D01.BART.04703    ACPE          Acer pensylvanicum L.
## 9  BART_036 NEON.PLA.D01.BART.04554    FAGR        Fagus grandifolia Ehrh.
## 10 BART_075 NEON.PLA.D01.BART.03300   BEAL2  Betula alleghaniensis Britton
##    plotType       nlcdClass       lon      lat
## 1     tower     mixedForest -71.28649 44.05914
## 2     tower     mixedForest -71.28649 44.05914
## 3     tower deciduousForest -71.28588 44.06208
## 4     tower     mixedForest -71.28649 44.05914
## 5     tower     mixedForest -71.28649 44.05914
## 6     tower     mixedForest -71.28649 44.05914
## 7     tower     mixedForest -71.28649 44.05914
## 8     tower     mixedForest -71.28649 44.05914
## 9     tower deciduousForest -71.28588 44.06208
## 10    tower     mixedForest -71.28649 44.05914

Process the joined data frame to get a community composition data frame. Each row is a community (a plot). Species names are now columns.

df_comm <- df %>%
  group_by(plotID, plotType, nlcdClass, lat, lon, scientificName) %>%
  summarise(count = n()) %>%
  ungroup() %>%
  spread(key = "scientificName", value = "count", fill = 0)
df_comm %>% head(10)

## # A tibble: 10 × 36
##    plotID   plotType    nlcdClass   lat   lon `Abies sp.` Acer pensylvanicum L…¹
##    <chr>    <chr>       <chr>     <dbl> <dbl>       <dbl>                  <dbl>
##  1 BART_001 distributed mixedFor…  44.0 -71.3           0                      2
##  2 BART_002 distributed deciduou…  44.0 -71.3           0                      6
##  3 BART_003 distributed deciduou…  44.1 -71.3           0                      0
##  4 BART_004 distributed mixedFor…  44.0 -71.3           0                      1
##  5 BART_005 distributed mixedFor…  44.1 -71.3           1                      0
##  6 BART_006 distributed deciduou…  44.1 -71.3           1                      6
##  7 BART_007 distributed mixedFor…  44.0 -71.3           0                      0
##  8 BART_010 distributed deciduou…  44.1 -71.3           0                      0
##  9 BART_011 distributed mixedFor…  44.1 -71.3           0                      0
## 10 BART_012 distributed deciduou…  44.0 -71.3           0                      0
## # ℹ abbreviated name: ¹`Acer pensylvanicum L.`
## # ℹ 29 more variables: `Acer rubrum L.` <dbl>, `Acer saccharinum L.` <dbl>,
## #   `Acer saccharum Marshall` <dbl>,
## #   `Acer saccharum Marshall var. saccharum` <dbl>, `Acer sp.` <dbl>,
## #   `Betula ×caerulea Blanch. var. caerulea` <dbl>,
## #   `Betula alleghaniensis Britton` <dbl>, `Betula lenta L.` <dbl>,
## #   `Betula papyrifera Marshall` <dbl>, …

We need to make this community composition data frame a matrix. Again, each row is a community (a plot) and each species is a column. Note that we leave out the metadata for plots.

mat_comm <- df_comm %>%
  select(-plotID, -plotType, -nlcdClass, -lon, -lat) %>%
  as.matrix()
mat_comm [1:6, 1:6]

##      Abies sp. Acer pensylvanicum L. Acer rubrum L. Acer saccharinum L.
## [1,]         0                     2              1                   0
## [2,]         0                     6              0                   0
## [3,]         0                     0             15                   0
## [4,]         0                     1              5                   0
## [5,]         1                     0              0                   0
## [6,]         1                     6              0                   0
##      Acer saccharum Marshall Acer saccharum Marshall var. saccharum
## [1,]                       0                                      1
## [2,]                       0                                     22
## [3,]                       0                                      7
## [4,]                       0                                      0
## [5,]                       0                                      0
## [6,]                       0                                      2

With this community composition matrix, we can use the metaMDS function in vegan package to perform NMDS.

set.seed(1)
mds_comm <- vegan::metaMDS(mat_comm, distant = "bray", k = 4, try = 100)

## Square root transformation
## Wisconsin double standardization
## Run 0 stress 0.08885421 
## Run 1 stress 0.09016598 
## Run 2 stress 0.08885805 
## ... Procrustes: rmse 0.003813373  max resid 0.01310509 
## Run 3 stress 0.08885508 
## ... Procrustes: rmse 0.0002817591  max resid 0.0009507222 
## ... Similar to previous best
## Run 4 stress 0.08885586 
## ... Procrustes: rmse 0.0004902184  max resid 0.001639388 
## ... Similar to previous best
## Run 5 stress 0.0903252 
## Run 6 stress 0.08885492 
## ... Procrustes: rmse 0.0002619428  max resid 0.0008819165 
## ... Similar to previous best
## Run 7 stress 0.08885847 
## ... Procrustes: rmse 0.003935785  max resid 0.01353974 
## Run 8 stress 0.08885396 
## ... New best solution
## ... Procrustes: rmse 0.002582129  max resid 0.008932619 
## ... Similar to previous best
## Run 9 stress 0.09029896 
## Run 10 stress 0.09034585 
## Run 11 stress 0.08893074 
## ... Procrustes: rmse 0.009504952  max resid 0.03165667 
## Run 12 stress 0.09032054 
## Run 13 stress 0.08885345 
## ... New best solution
## ... Procrustes: rmse 0.002182506  max resid 0.007588686 
## ... Similar to previous best
## Run 14 stress 0.08889528 
## ... Procrustes: rmse 0.005338802  max resid 0.01761426 
## Run 15 stress 0.08886069 
## ... Procrustes: rmse 0.003999398  max resid 0.01384676 
## Run 16 stress 0.08885448 
## ... Procrustes: rmse 0.0005095852  max resid 0.001713473 
## ... Similar to previous best
## Run 17 stress 0.09028926 
## Run 18 stress 0.08886793 
## ... Procrustes: rmse 0.005178257  max resid 0.0181378 
## Run 19 stress 0.08885554 
## ... Procrustes: rmse 0.002771664  max resid 0.009498437 
## ... Similar to previous best
## Run 20 stress 0.09016291 
## *** Best solution repeated 3 times

mds_comm

## 
## Call:
## vegan::metaMDS(comm = mat_comm, k = 4, try = 100, distant = "bray") 
## 
## global Multidimensional Scaling using monoMDS
## 
## Data:     wisconsin(sqrt(mat_comm)) 
## Distance: bray 
## 
## Dimensions: 4 
## Stress:     0.08885345 
## Stress type 1, weak ties
## Best solution was repeated 3 times in 20 tries
## The best solution was from try 13 (random start)
## Scaling: centring, PC rotation, halfchange scaling 
## Species: expanded scores based on 'wisconsin(sqrt(mat_comm))'

We check the stress value to find out if our NMDS has a good fit. We can also make a stressplot.

mds_comm$stress

## [1] 0.08885345

vegan::stressplot(mds_comm)

Figure 8: NMDS stress plot.

Is our stress value considered good? A rule of thumb is that stress < 0.1 is good and stress < 0.05 is excellent. You can increase the number of dimensions (k) to reduce stress. However, large k is usually not useful and can even be harmful.

For more considerations, read this chapter.

Now we can generate a basic NMDS plot. Labels in black show communities, and labels in red show species.

plot(mds_comm, type = "t")

Figure 9: A basic NMDS plot.

You can see that some communities are more similar than others, and some species tend to occur together.

We can try to redraw the NMDS plot using ggplot. This gives you more control on the graph elements.

df_nmds_comm <- mds_comm %>%
  vegan::scores(display = "sites") %>%
  data.frame() %>%
  bind_cols(df_comm %>%
    select(plotID, plotType, nlcdClass))

df_nmds_sp <- mds_comm %>%
  vegan::scores(display = "species") %>%
  data.frame() %>%
  rownames_to_column(var = "species")

ggplot(df_nmds_comm, aes(x = NMDS1, y = NMDS2)) +
  geom_point() +
  ggrepel::geom_text_repel(data = df_nmds_sp, aes(NMDS1, NMDS2, label = species), color = "dark grey") +
  ggthemes::theme_few()

Figure 10: A NMDS plot drawn using ggplot.

We ca draw ellipses based on existing grouping of these communities. In experiment, we can draw ellipses for control and treatment. In observations, we can draw ellipses for different time points. Here, I draw ellipses for communities from different land cover types.

ggplot(df_nmds_comm, aes(x = NMDS1, y = NMDS2, col = nlcdClass)) +
  geom_point() +
  stat_ellipse() +
  ggrepel::geom_text_repel(data = df_nmds_sp, aes(NMDS1, NMDS2, label = species), color = "dark grey") +
  ggthemes::theme_few() +
  coord_equal()

Figure 11: NMDS plot showing composition of communities from different land cover types.

You can see some differences in the composition of communities from different land cover types. Note that we used the first two axes of NMDS. What if we use another two axes?

ggplot(df_nmds_comm, aes(x = NMDS3, y = NMDS4, col = nlcdClass)) +
  geom_point() +
  stat_ellipse() +
  ggrepel::geom_text_repel(data = df_nmds_sp, aes(NMDS1, NMDS2, label = species), color = "dark grey") +
  ggthemes::theme_few() +
  coord_equal()

Figure 12: NMDS plot showing composition of communities from different land cover types, using NMDS3 and NMDS4.

We still see some differences using NMDS3 and NMDS4, but perhaps less distinct compared to when we used NMDS1 and NMDS2.

NEON has two types of plots, distributed and tower. Their sampling methods differ. Let’s see their difference.

ggplot(df_nmds_comm, aes(x = NMDS1, y = NMDS2, col = plotType)) +
  geom_point() +
  stat_ellipse() +
  ggrepel::geom_text_repel(data = df_nmds_sp, aes(NMDS1, NMDS2, label = species), color = "dark grey") +
  ggthemes::theme_few() +
  coord_equal()

Figure 13: NMDS plot showing composition of communities from different plot types.

Communities from distributed plots seem to be more dispersed? The reason might be tower plots have a more constrained sampling area.

PERMANOVA

We have done some visualization that hopefully help us intuitively see the similarity and differences between groups of samples. What if we are asked to statistically quantify the differences between these groups? How can we get a p value?

Permutational multivariate analysis of variance (PERMANOVA) is a nonparametric multivariate statistical permutation test.

A significant p value indicates that the two groups are different in the their centroids OR dispersion in the multidimensional space.
It is similar to ANOVA, but it does not have many assumptions except exchangeability (usually satisfied).

(The two case studies we introduced both used PERMANOVA.)

In practice, we can easily use the adonis2 function from the vegan package.

set.seed(1)
res_permanova <- vegan::adonis2(mat_comm ~ nlcdClass, data = df_comm, permutations = 9999)
res_permanova

## Permutation test for adonis under reduced model
## Terms added sequentially (first to last)
## Permutation: free
## Number of permutations: 9999
## 
## vegan::adonis2(formula = mat_comm ~ nlcdClass, data = df_comm, permutations = 9999)
##           Df SumOfSqs      R2      F Pr(>F)    
## nlcdClass  2   2.1141 0.31126 8.3605  1e-04 ***
## Residual  37   4.6781 0.68874                  
## Total     39   6.7922 1.00000                  
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

set.seed(1)
res_permanova <- vegan::adonis2(mat_comm ~ plotType, data = df_comm, permutations = 9999)
res_permanova

## Permutation test for adonis under reduced model
## Terms added sequentially (first to last)
## Permutation: free
## Number of permutations: 9999
## 
## vegan::adonis2(formula = mat_comm ~ plotType, data = df_comm, permutations = 9999)
##          Df SumOfSqs      R2      F Pr(>F)    
## plotType  1   1.0483 0.15434 6.9352  6e-04 ***
## Residual 38   5.7439 0.84566                  
## Total    39   6.7922 1.00000                  
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

set.seed(1)
res_permanova <- vegan::adonis2(mat_comm ~ nlcdClass * plotType, data = df_comm, permutations = 9999)
res_permanova

## Permutation test for adonis under reduced model
## Terms added sequentially (first to last)
## Permutation: free
## Number of permutations: 9999
## 
## vegan::adonis2(formula = mat_comm ~ nlcdClass * plotType, data = df_comm, permutations = 9999)
##                    Df SumOfSqs      R2       F Pr(>F)    
## nlcdClass           2   2.1141 0.31126 10.0769 0.0001 ***
## plotType            1   0.4133 0.06085  3.9399 0.0113 *  
## nlcdClass:plotType  1   0.5933 0.08735  5.6560 0.0026 ** 
## Residual           35   3.6715 0.54054                   
## Total              39   6.7922 1.00000                   
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Can you try to interpret the results of these three PERMANOVA test that correspond to the three NMDS plots above? Are the differences that we observed from NMDS plots statistically significant?