Succession in practice

This session we'll be putting the quantitative approaches we met in the asynchronous session into practice.

There's a lot to cover this week, so don't hesitate to call for help if it's taking a while to wrap your head around a particular concept.

Getting set up

This session uses R. Before the session, please:

  • Download and install R. Then, download and install RStudio Desktop.
    • Can't download? Run RStudio online with a free posit Cloud account. Launch RStudio via New Project → New RStudio Project.
    • We recommend that you do not use the AppsAnywhere installer, as you must then launch all other software (e.g. File explorer, internet browser) via AppsAnywhere in order for files to be visible to RStudio. [details]
  • In the RStudio console, run the command install.packages("palec", repos = c(getOption("repos"), "https://ms609.github.io/packages")) to install the 'palec' teaching package.
    • There's no need to update packages, or to install packages from source, which is time consuming.

Troubleshooting

  • General principles:
    • If a step doesn't work, read the error message carefully for clues as to why. Copying the error message into a search engine can often lead to a straightforward solution.
    • Try running the command again in case an issue was transitory.
  • If you see "Unable to locate R binaries": Try Downloading R and installing in the default location.

Terebratulid survivorship

[15–25 minutes]

We're going to learn how to interpret life history from assemblage data.

You can choose between using a point-and-click interface within R (straightforward) or constructing the plots yourself, by hand, spreadsheet, or writing your own code (which will help you obtain a deeper understanding of the methods.)

Thayer (1977) report length measurements (in mm) from 42 terebratulid brachiopods that died between censuses conducted in 1974 and 1975.

Getting the data

Save the data file to your computer, and inspect its contents.

  • To save a file from the internet, right-click the link and select "Save file as…" (Windows) or "Download linked file as…" (Mac). Save the file somewhere you'll be able to locate it later.

The format is simply a list of measurements, separated by spaces.

Open the histogram plotter

  • In the RStudio console, type palec::Histogram() to run the histogram 'app'.
    • You can resize the viewer, or click the icon to launch the app in a separate web browser, to gain a little more space.
  • Click 'Browse…' to load the data you saved to disk. The histogram will update automatically, showing the frequency of taxa with different lengths.
  • Set the X Label to "length / mm".

Save your plot in a convenient format using one of the download buttons, then answer the questions below. Write your answers down before clicking on them for my answers.

  • Do the data fit a normal distribution, or are they right-skewed (longer tail on the right) or left-skewed?
  • Use the “Fit Normal” checkbox to overlay the normal distribution that best matches the data.
  • Adjusting the bin size (number of bins) sometimes gives a clearer perspective.
  • What does the skewness suggest about the level of infant mortality?

Open the survivorship plotter

  • In the RStudio console, press Esc to terminate the Histogram app, then launch the survivorship app by running palec::Survivorship().
  • Load the terebratulid data again. The survivorship plot will update automatically, showing the proportion of individuals that survived to each age (i.e. size).
  • Configure the plot as necessary.
    • Which axes should be log-transformed?
  • Is this survivorship curve:
  • Type I (convex)
  • Type II (linear)
  • Type III (concave)

Careful examination of the living terebratulids determined that older individuals grow more slowly. The age of an individual thus scales with log(length).

Log-transform the x-axis, so that it represents a better proxy for age (rather than size).

  • Does this affect your interpretation?

Bringing it together

Now let's combine these insights to consider the likely life history of these terebratulids.

  • Based on the information from the size-frequency analysis and the survivorship plot, are the terebratulids more likely to be:
  • r-selected
  • K-selected
  • Are they likely to be associated with:
  • Early successional communities
  • Climax communities
  • As such, would you expect the taxon’s original community to be:
  • High diversity
  • Low diversity

Extension exercise

If you've got time, use a text editor such as Notepad++ to create your own .txt documents and load them into the plotters. Can you create a dataset that corresponds to each of the three survivorship curve trajectories? Does the skew of these datasets correspond to what you expect?

Manod Bach

Llanvirn assemblages from Wales

45–90 minutes

Now we're going to look at assemblage-level data in order to explore changes in the successional stage of an entire community.

Our data (download) are species counts from ten 5–10 kg samples taken at intervals from an Ordovician sedimentary succession in Wales (from Williams et al. 1981; see graphical log).

Launch the palec::Diversity() app and load the Llanvirn.txt data. Cross-reference the species with those shown below.

Each column in the input dataset represents a different assemblage. Start by selecting assemblage CD6.

  • What are your initial impressions of the diversity and dominance of this assemblage? Which taxon/taxa would you expect to form the trophic nucleus?

Next, we'll compare CD6 with the underlying and overlying assemblages, CD5 and CD7. Note from the log that these represent quite different environments, separated by unconformities. Look at the bar plots to get a feel for the relative species richness and dominance of assemblages CD5–CD7.

  • Note the different x axis scales, denoting different sample sizes. Use the "Axis size" slider to compare the assemblages to the same scale.
  • Looking for now just at the bar plots, mark each of these three communities on the crude spectra below.
  • As an example, I've sketched in assemblage CD1, which has very low species richness and is strongly dominated.
  • Double click the white "×" button to clear the canvas.

Now we're going to explore some of the quantitative indices of diversity, to see how they measure up against the intuitive assessment that you just made.

Look at the diversity indices for each assemblage, displayed beneath the bar plot.

  • Rank assemblages CD5–CD7 from least to most diverse, based on their Species richness (S).
  • Do you think that this metric is a reliable measure of diversity?
  • Hint: look at the number of individuals sampled in each assemblage.
  • Have a go at devising a formula that will allow the species richness of two assemblages, each containing a very different number of individuals, to be compared.
  • Getting the "right answer" is not important here: I'm interested in your thought process.

Richness indices

Menhinick’s and Margalef’s richness 'indices' are two attempts to normalize the number of species observed based on the number of individuals sampled.

The two approaches differ in their concept of how many more 'rare' species we expect to see as we sample more individuals: Menhinick's (S / √n) posits that the number of species increases with the square root of the number of individuals sampled; Margalef's ((S – 1) / ln n) that S should increase with the logarithm of sample size. Empirically, which is a better model seems to vary from assemblage to assemblage, so neither is necessarily “better”.

  • How does each approach order the three assemblages?
  • Does it matter which you choose to calculate?

Dominance indices

Now let’s consider how we might measure dominance.

It's a good idea to measure this using an index: mathematically, an index is a dimensionless value that ranges from zero to one. In this context, a value of one should denote maximum dominance; zero, maximum evenness.

The simplest measure, and the one that you are perhaps most likely to encouter, is the Berger–Parker index. The BPI is the proportion of the n individuals that belong to the most common taxon (nt) – i.e. nt / n.

Worked example:

Species A: 5 individuals
Species B: 15 individuals
Species C: 10 individuals
Species D: 10 individuals

n = 5 + 15 + 10 + 10 = 40 individuals

Most common taxon = Species B (15 individuals)

nt = 15

BPI = nt / t = 15 / 40 = 3 / 8 = 0.375

  • Consider two assemblages, A and B. A has a dominance index of 0.25. B has a dominance index of 0.25. Which is more dominated?
  • This is a somewhat mischevious question. Answer it anyway...
  • Consider an assemblage that contains 48 individuals and four species. What is the range of possible values that the Berger–Parker index can take?
  • Hint: Consider two scenarios: one where an assemblage is maximally dominated, one where it is maximally even.
    • Create a file containing these two assemblages, and load them into the Diversity viewer, to check your answers.
    • Set up a spreadsheet that will calculate the BPI as you edit the number of individuals and number of species, then play with the figures to see how the BPI changes.
  • More generally, what value does the Berger–Parker index take in an assemblage of n individuals that has perfect evenness (i.e. minimum domination)?
  • You may need to express this value in terms of the species richness S.
  • Do the Berger-Parker index values for each assemblage match up with your evaluation of their relative dominances? Do you intuitively agree, based on the graphs, that assemblages CD6 and CD7 have very similar dominances?
  • Now's a good time to revisit those two assemblages with a BPI of 0.25. What else do you need to know about the assemblages to work out which is more dominated? How useful is the BPI as a measure of dominance?
  • To mitigate these effects, I've added an option in the Diversity app to correct visualization of the BPI and related values for sample size effects, such that a zero-width bar corresponds to the lowest possible value, and a full-width bar corresponds to the highest.

    Use the "Correct range for sample size" tickbox to see how much difference this makes in practice – what properties characterize the most-affected assemblages?

Simpson Index

Did you notice in your calculations that the Berger–Parker index only incorporates the dominance of the most abundant taxon?

It doesn't seem right that these two assemblages would be considered equally dominated:

Species:ABCDn
Assemblage X:25251152
Assemblage Y:2599952

The Simpson index of dominance addresses this concern. It is given by Σ pi², where pi is the proportion of individuals belonging to taxon i.

  • What are the Simpson indices of these two assemblages?
  • What range of values can this index take for an assemblage with S species?

Shannon entropy / Equitability

The Shannon entropy, given by –Σpi ln pi, is a related but more pleasing measure of dominance.

Entropy reflects how likely you are to win a bet on what species the next individual to be sampled will belong to – in a dominated community, a bet on the dominant species will usually pay out.

Entropy is a measure of information (the average information content of an outcome), measured in non-arbitrary units (bits – the same unit of data that measures your broadband speed). As such, this is the only measure that has any objective meaning.

With some simple maths, this can be normalized to run from 0 (entirely dominated) to 1 (maximum evenness); this is reported as the Equitibility (J).

  • How do these three metrics of dominance compare with one another, and with your intuition, for the three assemblages?
Manod Bach

Envinronmental change in the Llanvirn

Application

Time to put this theory into practice. Examine each of the ten samples in the succession. View their individual graphs, then compare their dominance and diversity.

  • Can you detect any trends as time goes by (from assemblage CD-1 to assemblage CD-10)?

Whittaker plots

Now we will transform our data to generate rank-abundance plots.

  • In the Diversity app, select "Order by rank abundance". That's the 'rank' bit taken care of.
  • We need to log-transform the abundance – handily, there's a box for that too. Make sure it's checked.
  • To make the graph easier to interpret, we want to rotate it, and present it as points rather than bars. Select "Scatter plot" to accomplish this.
  • You can add a 'line of best fit' with the "Fit (log)linear" option: useful for evaluating possible geometric distributions.
  • Do any assemblages approximate (loosely) the straight line expected of a geometric distribution? Do others approximate the S-shape of the log-normal distribution, with few common and few rare species?
  • Don’t get thrown by “steps” in the data associated with jumps from one integer count to the next. Many species may be represented by a single individual – you can’t count 1.3 brachiopods.
  • Use Octave plots to validate your answers

Synthesis

Now it’s time to bring all these observations together.

  • From what you know about the ecology of the constituent organisms, reconstruct the palaeoenvironment in which they dwelt and provide a geological and ecological interpretation for any variation in diversity and dominance, and the shape of rank abundance plots, through the sedimentary sequence.
  • You may wish to refer to Fossils at a Glance and Scott's reference ternary plots.
  • Are your interpretations consistent with the sedimentary log?

References

  • THAYER, C. W. 1977. Recruitment, growth, and mortality of a living articulate brachiopod, with implications for the interpretation of survivorship curves. Paleobiology, 3, 98–109.
  • WILLIAMS, A., LOCKLEY, M. G. and HURST, J. M. 1981. Benthic palaeocommunities represented in the Ffairfach Group and Coeval Ordovician successions of Wales. Palaeontology, 24, 661–694.