Contributors: Jay Lennon, Ken Locey, Mario Muscarella
Funding: Development of Indiana University's first Quantitative Biodiversity class, including the materials in this repository, were supported by a National Science Foundation Dimensions of Biodiversity grant (1442246) awarded to Drs Jay Lennon and Ken Locey of Indiana University, Bloomington.
R version 3.1.2
RStudio Version 0.98.1062 or higher
Git version 1.9.3 or higher
Windows specific:
LaTeX: MiKTEX, http://miktex.org/download
Git: GitBash, unix-style terminal window, may install with Git
Mac specific:
LaTeX: MacTeX or BasicTeX https://tug.org/mactex/morepackages.html
NOTE: GitHub for Mac and Windows can also be used, but were not used as part of the 2015 course.
The 2015 Quantitative Biodiversity class consisted of eight classes. The contents of the course materials are presented here, in the order the classes were taught.
The objectives in this assignment were actually conducted over the span of two weeks. In the first week, we introduced students to Git and GitHub, and explained the general workflow of the class. We had students fork the class repository to their own IU-Enterprise GitHub account. We then had students clone that repository onto their classroom desktop.
In the second week, we introduced students to programming in R and the R Studio development environment. Students completed basic calculations, data manipulation, and introductory data visualization and statistics, including correlation, regression, and ANOVA.
- Course Goals: Computer Setup; Intro to Git. This can be completed once weeks 1 and 2 are split
The third week focused on aspects of alpha-diversity, i.e., diversity within a site. Students were introduced to the site-by-species matrix as a primary ecological data structure. Students worked with the vegan package in R and the BCI data within vegan, as well as soil bacteria data. They estimated coverage, richness, evenness, and diversity, and conducted rarefaction and species abundance distribution analyses.
-
data: Contains a file of site-by-OTU data for soil bacteria (soilbac.txt)
-
alpha_exercise.Rmd: An R Markdown document distributed to student repositories; includes the in-class assignment and homework. Knitting generates a .pdf of their work.
-
alpha_exercise.pdf: Output of alpha_exercise.Rmd. This is the file that students turned in via submitting a pull request on GitHub.
-
alpha_handout.Rmd: Generates a handout that was given to students to guide them through the in-class assignment, which includes example code.
-
alpha_handout.pdf: pdf version of alpha_handout.Rmd
-
alpha_supplementary.Rmd: Contains code that could be made available to students or included into the exercise
The fourth week focused on aspects of beta-diversity, i.e., heterogeneity or turnover in diversity across sites. Instructors emphasized tools to quantify beta-diversity, including distance metrics. With this information, students learned how to visualize data (heatmaps, clustern analysis, ordination) and test hypotheses (CCA, PERMANOVA).
-
data: folder containing files of data needed for exercise
-
bci.soil.txt : Environmental data for Barro-Colorado Island (BCI), obtained from (anadat-r)
-
soil.factors.txt : KBS Soil Experimental Design File
-
soilbac.RData : R Data object with KBS Soil Bacterial Community Site-by-Species matrix and taxonomy
-
soilbac.list.txt : Soil Bacteria Site-by-Species Matrix and Taxonomy
-
soilbacfull.txt : Soil Bacteria Site-by-Species Matrix and Taxonomy
-
-
beta_exercise.Rmd: An R Markdown document distributed to student repositories; includes the in-class assignment and homework. Knitting generates a .pdf of their work.
-
beta_exercise.pdf: Output of beta_exercise.Rmd. This is the file that students turned in via submitting a pull request on GitHub.
-
beta_handout.Rmd: Generates a handout that was given to students to guide them through the in-class assignment, which includes example code.
-
beta_handout.pdf: pdf version of beta_handout.Rmd
-
beta_homework_list.Rmd: instructor directions for making RData file
-
beta_supplementary.Rmd: Contains code that could be made available to students or included into the exercise
Week 5 focused on geographical ecology, i.e., examining diversity across a landscape in a spatially explicit context, using R Studio as a geographical information systems (GIS) environment. This lesson introduced students to the concepts of spatial aggregation, spatial autocorrelation, and considerations of spatial scale. This lesson also introduced students to the primary biogeographical/macroecological patterns known as the distance-decay relationship, spatial abundance distribution, and the species-area relationship.
-
BrownCoData:
- 20130801_PondDataMod.csv : Geographical and Environmental data for refuge ponds (n = 51) sampled within the Brown County, Yellowwood State Forest, and Hoosier National Forest areas during August, 2013.
- SiteBySpecies.csv : File containing the abundances of bacterial taxa (DNA and cDNA) across the 51 refuge ponds.
- pond.JPG : An image of one of the refuge ponds that can be imported into the exercise handout.
-
LandCover:
- LandCover.tif : This is not an image file. This is actually a file containing geographical data, which is called within the exercise .Rmd file to produce a heatmap of percent landcover across the central Indiana area.
-
water:
- water.shp : Shapefile used to generate a GIS layer of water data. Requires .dbf and .shx files.
- water.dbf : Shapefile attribute format file
- water.shx : Shape index format file
- water.prj : File of geographic projection data
-
GeographicalEcology_Exercise.Rmd: An R Markdown document distributed to student repositories; includes the in-class assignment and homework. Knitting generates a .pdf of their work.
-
GeographicalEcology_Exercise.pdf: Blank Knit of Rmd Exercise
-
GeogrpahicalEcology_Handout.Rmd: Generates a handout that was given to students to guide them through the in-class assignment, which includes example code.
-
GeographicalEcology.pdf: pdf version of GeographicalEcology_Handout.Rmd.
-
Glossary.Rmd : Contains a few basic terms for those with little-to-no experience with spatial or geographical jargon.
Week 6 introduced concepts and methods of phylogenetic diversity. Students learned how to align sequence data, evaluate models of DNA evolution, and make phylogenetic trees. Last, student leanred how to map trait onto trees and test for phylogenetic signal.
-
data:
-
out.aln.muscle.afa: muscle alignment fasta file
-
p.isolates.afa: aligned phosphorus isolate sequences
-
p.isolates.fasta: raw phosphorus isolate sequences
-
p.isolates.kali.fasta: raw phosphorus isolate sequences
-
p.isolates.raw.growth.txt: raw growth rate data
-
p.isolates.std.growth.txt: standardized growth rate data
-
p.isolates.taxonomy.txt: phosphorus isolate taxonomy information
-
-
PhyloTraits_exercise.Rmd: An R Markdown document distributed to student repositories; includes the in-class assignment and homework. Knitting generates a .pdf of their work.
-
PhyloTraits_exercise.pdf: Output of PhyloTraits_exercise.Rmd. This is the file that students turned in via submitting a pull request on GitHub.
-
PhyloTraits_handout.Rmd: Generates handout .pdf that can be given to student and provides example code while guiding them through the exercise.
-
PhyloTraits_handout.pdf: pdf version of PhyloTraits_handout.Rmd.
-
PhyloTraits_supplementary.Rmd: Contains supplemental code that was not included in the week's exercise but can be included in future classes.
Week 7 introduced concepts and tools needed to look at multi-species assembalages in a phylogenetic context. Students were introduced to measures of phylogenetic alpha- and beta-diversity. In addition, students learned how to evaluate the contribution of phylogeny to geographical patterns of biodiversity, including distance-decay relationship and the phylogenetic diversity-area relationship.
-
BrownCoData:
- 20130801_PondDataMod.csv : Geographical and Environmental data for refuge ponds sampled within the Brown County, Yellowwood State Forest, and Hoosier National Forest areas during August, 2013.
- SiteBySpecies.csv : File containing the abundances of bacterial taxa (DNA and cDNA) across the refuge ponds.
- pond.JPG : An image of one of the refuge ponds that can be imported into the exercise handout.
-
bin:
- MothurTools.R: source functions for importing mothur output files into R
-
data:
- 20130801_PondDataMod.csv: Environmental data for IN Ponds
- INPonds.final.1.rep.fasta: Genus level representative sequences
- INPonds.final.FW.1.taxonomy: Genus level taxonomy
- INPonds.final.FW.shared: IN Ponds bacterial site-by-species matrix
- INPonds.final.FW.taxonomy: IN Ponds bacterial taxonomy information
- INPonds.final.rdp.1.cons.taxonomy: IN Ponds RDP genus taxonomy information
- INPonds.final.rdp.1.rep.fasta: IN Ponds bacterial genus representative sequences
- INPonds.final.rdp.shared: IN Ponds phylotypes
- methanosarcina.fasta: Methanosarcina outgroup fasta file
-
PhyloCom_exercise.Rmd: An R Markdown document distributed to student repositories; includes the in-class assignment and homework. Knitting generates a .pdf of their work.
-
PhyloCom_exercise.pdf: Output of PhyloCom_exercise.Rmd This is the file that students turned in via submitting a pull request on GitHub.
-
PhyloCom_handout.Rmd: Generates handout .pdf that can be given to student and provides example code while guiding them through the exercise.
-
PhyloCom_handout.pdf: pdf version of PhyloCom_handout.Rmd.
-
PhyloCom_supplementary.Rmd: Supplemental analyses
-
PhyloCom_supplementary.pdf: Knitted version of supplemental
In the eigth and final week, we had student construct their own file of R source-code for various functions used throughout Quantitative Biodiversity. We discussed the topics of open science and reproducible science more thoroughly, and inroduced students to the idea of licenses. We had students set up public accounts on GitHub.com, showed them how to apply for private repositories, and demonstrated how they could move their class-based code into their own GitHub.com account.
-
images A folder containing images that are used by the GitOn_handout.Rmd document
-
GitOn_handout.Rmd: Generates a handout that guides students through migrating their course materials and assignments to GitHub, and introduces them to licensing, and GitHub.com.
-
GitOn_handout.pdf The handout given to students.
-
README.md: An example README.md file
-
example_license.md: An example GNU GPL version 3 license
A basic set reference materials relating to Git, R, Unix/Linux, and the installation of various software.
- GitFlow-All.pdf
- RMarkdownCheatSheet.pdf
- RMarkdownCheatSheet.png
- R_ReferenceCard.pdf
- SoftwareInstallation.Rmd
- SoftwareInstallation.pdf
- Unix_Reference.pdf
- git-cheat-sheet.pdf
- git-cheat-sheet.png
R Style Guide: https://google-styleguide.googlecode.com/svn/trunk/Rguide.xml