Save for later
Using Descriptive Statistics to Analyze Data in R
By the end of this project, you will create a data quality report file (exported to Excel in CSV format) from a dataset loaded in R, a free, open-source program that you can download. You will learn how to use the following descriptive statistical metrics in order to describe a dataset and how to calculate them in basic R with no additional libraries.
- minimum value
- maximum value
- average value
- standard deviation
- total number of values
- missing values
- unique values
- data types
You will then learn how to record the statistical metrics for each column of a dataset using a custom function created by you in R. The output of the function will be a ready-to-use data quality report. Finally, you will learn how to export this report to an external file.
A data quality report can be used to identify outliers, missing values, data types, anomalies, etc. that are present in your dataset. This is the first step to understand your dataset and let you plan what pre-processing steps are required to make your dataset ready for analysis.
Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.
Get a Reminder
Rating | Not enough ratings |
---|---|
Length | 2 weeks |
Effort | 1.5 hours |
Starts | Jul 10 (38 weeks ago) |
Cost | $9 |
From | Coursera Project Network via Coursera |
Instructor | Dr. Nikunj Maheshwari |
Download Videos | On all desktop and mobile devices |
Language | English |
Subjects | Data Science |
Tags | Data Science Data Analysis |
Get a Reminder
Similar Courses
Careers
An overview of related careers and their average salaries in the US. Bars indicate income percentile.
Data Quality Engineer 2 $47k
Coordinator of Quality Data $58k
Data Quality Analyst 2 $60k
Data and Quality Coordinator $64k
Data Quality Auditor $64k
Data Quality Steward $71k
Data Quality Technician $72k
Data Quality Analyst - Data Research Sourcing $74k
Data Quality Management $81k
Data Quality Assurance $92k
Senior Data Quality Administrator $120k
Data Scientist - Clinical Quality $127k
Write a review
Your opinion matters. Tell us what you think.
Please login to leave a review
Rating | Not enough ratings |
---|---|
Length | 2 weeks |
Effort | 1.5 hours |
Starts | Jul 10 (38 weeks ago) |
Cost | $9 |
From | Coursera Project Network via Coursera |
Instructor | Dr. Nikunj Maheshwari |
Download Videos | On all desktop and mobile devices |
Language | English |
Subjects | Data Science |
Tags | Data Science Data Analysis |
Similar Courses
Sorted by relevance
Like this course?
Here's what to do next:
- Save this course for later
- Get more details from the course provider
- Enroll in this course