Getting and Cleaning Data


About this Course

Before you can work with data you have to get some. This course will cover the basic ways that data can be obtained. The course will cover obtaining data from the web, from APIs, from databases and from colleagues in various formats. It will also cover the basics of data cleaning and how to make data “tidy”. Tidy data dramatically speed downstream data analysis tasks. The course will also cover the components of a complete data set including raw data, processing instructions, codebooks, and processed data. The course will cover the basics needed for collecting, cleaning, and sharing data.

This course is part of multiple programs
This course can be applied to multiple Specializations or Professional Certificates programs.

Completing this course will count towards your learning in any of the following programs:

  • Data Science: Foundations using R Specialization
  • Data Science Specialization

WHAT YOU WILL LEARN

Understand common data storage systems
Apply data cleaning basics to make data "tidy"
Use R for text and date manipulation
Obtain usable data from the web, APIs, and databases

SKILLS YOU WILL GAIN


  • Data Manipulation
  • Regular Expression (REGEX)
  • R Programming
  • Data Cleansing