Data Structures

sites <- c("a", "a", "b", "c")

length(sites)
density_ha <- c(2.8, 3.2, 1.5, 3.8)
mean(density_ha)
max(density_ha)
min(density_ha)
sum(density_ha)

Do Bird Banding 1-4.

density_ha <- c(2.8, 3.2, 1.5, NA)
mean(density_ha)

Why did we get NA?
- Hard to say what a calculation including NA should be
- So most calculations return NA when NA is in the data
Can tell many functions to remove the NA before calculating

mean(density_ha, na.rm = TRUE)

density_ha <- c(2.8, 3.2, 1.5, 3.8)
area_ha <- c(3, 5, 1.9, 2.7)
total_number <- density_ha * area_ha

area[sites == 'a']

area[sites != 'a']

sites[area_ha > 3]
sites[area_ha >= 3]
sites[area_ha < 3]

sites[sites != 'a']

Do Shrub Volume Vectors 1-3.

surveys <- data.frame(sites, density_ha, area_ha)

Useful commands:
- str(surveys)
- length(surveys)
- nrow(surveys), ncol(surveys)
Subsetting:
- [row, column]
- surveys[1, 2]
- surveys[1:2, 2:3]
- surveys[, 3]
- surveys[“area_ha”]
- surveys[c(“area_ha”, “sites”)]
- surveys$area_ha
- surveys[[“area_ha”]]

read.csv()
- Main argument is the location of the data - url or path on computer
- Go to Datasets page on site and copy Shrub dimensions url

shrub_data <- read.csv('https://datacarpentry.org/semester-biology/data/shrub-dimensions-labeled.csv')

str(shrub_data)

shrub_data <- read.csv('https://datacarpentry.org/semester-biology/data/shrub-dimensions-labeled.csv', stringsAsFactors = FALSE)
str(shrub_data)

Start Shrub Volume Data Frame, but just use the url instead of downloading the file.

Data Science for Biologists