Background

In nearly every field of science, our ability to generate data has exceeded our capacity for analysis. For me, this means that there is the potential for loss to science; many important discoveries may go unnoticed because we are unable to efficiently analyze data.

The goal for this course is to help get you started learning to manage, transform, and visualize data using the R programming language. You will learn to clearly and neatly organize messy data, transform it in ways that address your questions, and communicate results in a variety of formats. The course is designed for people with no prior programming experience. There is a substantial learning curve but, working together, we will make learning R easier, interesting, and fun.

This class is NOT focused on statistical analysis or interpretation. The goal is to provide you with the foundational tools you need to perform these tasks.

First steps

It is important that you understand the first two weeks of the class can seem slow, tedious, and frustrating. These first steps are like learning a new language; you often won’t know what is being said or why. Please be patient and stick with me! I promise that it will get easier.

Resources

Given that R is open source, many resources are available online. We will use a combination of resources in the class, but key items are listed below.

UC Davis

Our campus has a wealth of expertise in data science. There is even a new major in data science. Should your interests progress, here are some links. They offer regular workshops and maintain archives. Among the goals of this class is to get you set-up so that you can attend these workshops and learn more!

Help!

When you need help with homework or a class topic, please post on our class discord page. We are here to support you and everyone should participate, please.

Lab 1 Goals

The goal of lab 1 is to get everyone started using R, RStudio, and GitHub. All of our work will be done in RStudio and uploaded to the class GitHub repository. It is important that everyone is set-up correctly before we are done today. In the spirit of the R universe, our class is a community. If you see someone struggling, please give them some help.

Lab Computers

This quarter we are fortunate to be able to use the computers in the SCC. Each of these computers is up-to-date, and the installed software is exactly the same. This makes following my instructions much easier, but you will likely want to have the ability to work at home. Please follow the directions below to set-up your personal computer.

Setup your computer

Because you will need to work on assignments at home, it is important that you spend time making sure that your computer is set-up and ready to go. The first step in this is basic maintenance; i.e. clean up your desktop and update your OS. Data scientists are neat and tidy! Spend time getting yourself organized, it will pay off.

Please follow these four steps to set-up you computer here.

That’s it! Let’s take a break and then move on to part 2!

–>Home