Task: Import and Export Data

Find the solution here after the session ends.

Get started

If you haven’t already, install the tidyverse (do this in the console, not in your script):

install.packages("tidyverse")

Make sure you have the tidyverse loaded at the top of your script:

library(tidyverse)

Download the file below and save it in the data/ folder of your project.
Use read_csv() to read the file into R and save it in a variable called trees.
Explore the data:
- Use summary(trees) — how many tree species are there? What is the tallest tree?
- Use view(trees) to look at the full table
- Use $ to access the height_m column and calculate the mean height

Download the Excel file below and save it in the data/ folder of your project.
Load the readxl package with library(readxl).
Use read_excel() to read the file into R and save it in a variable.
Explore the data with summary().

Take the trees tibble you just read in and write it to a new file:

Download the file below and save it in the data/ folder.
Try reading it with read_csv(). Something is wrong — can you figure out what?
- Hint: open the file in a text editor or in RStudio (File → Open File) to see its structure.
Use the appropriate argument of read_csv() to fix the problem. Check ?read_csv for an argument that lets you skip lines at the top of a file.

You can do these in any order — or skip them and just take a break.

Download the file below and try to read it into R. This file has multiple problems — you’ll need more than one argument to fix them.

Hint (unfold to see)

The file has metadata lines on top and uses a different delimiter (not a comma).

After reading the messy soil data, try using janitor::clean_names() on it. What does it do to the column names?

You may need to install the janitor package first: install.packages("janitor").

If you have your own research data, try reading it into R:

Copy a data file into the data/ folder of your project
Use the appropriate read_*() function to read it in
Did it work? If not, check ?read_csv or ?read_excel for arguments that might help