Differences in sampling effort impact Bray-Curtis distances and rarefaction can minimize it (CC191)
Bray-Curtis distances are impacted by the difference the number of observations for each sample being compared. Here Pat shows that the average distance is constant when using rarefaction whereas it is more variable with not rarefying, using relative abundance, or normalization. He shows how to do this analysis using ggplot2’s cut_width function. He’ll also show how to generate a density plot of the distributions and use geom_smooth to get an overall sense of the variation in the data. He’ll show how to synthesize and visualize the data using tools from dplyr, ggplot2, and other tidyverse packages.
Code
You can browse the state of the repository at the