Basic Linux command line skills are valuable but not required. Each participant will be required to run a 64 bit virtual machine (provided with the course).
R is an open source project including a language and an environment for statistical computing and graphics. This course will give participants an in depth introduction to R and it use as a data analysis tool. The course examines a wide variety of statistical (linear and nonlinear modelling, classical statistical tests, time-series analysis, classification, clustering) and graphical techniques for working with data. Hands on labs will give participants practical experience curating data, working with R commands for statistics and visualization as well as a short introduction to programming in R.
This course is designed for Application developers, analysts and data scientists.
Upon completion of this course, participants will be able to:
- Have a broad but practical understanding of R and its use in data analysis
Introduction to R
Working with data
Populations and distributions
Simulations and confidence intervals
Significance tests and goodness of fit
Regressions and variance analysis
Time series and forecasting