課程信息
4.7
3,949 ratings
589 reviews
This course covers the essential exploratory techniques for summarizing data. These techniques are typically applied before formal modeling commences and can help inform the development of more complex statistical models. Exploratory techniques are also important for eliminating or sharpening potential hypotheses about the world that can be addressed by the data. We will cover in detail the plotting systems in R as well as some of the basic principles of constructing data graphics. We will also cover some of the common multivariate statistical techniques used to visualize high-dimensional data....
Stacks

Course 4 of 10 in the

Globe

100% 在線課程

立即開始,按照自己的計劃學習。
Calendar

可靈活調整截止日期

根據您的日程表重置截止日期。
Clock

建議:5 hours/week

完成時間大約為15 小時
Comment Dots

English

字幕:English, Chinese (Simplified)

您將學到的內容有

  • Check
    Apply cluster analysis techniques to locate patterns in data
  • Check
    Make graphical displays of very high dimensional data
  • Check
    Understand analytic graphics and the base plotting system in R
  • Check
    Use advanced graphing systems such as the Lattice system

您將獲得的技能

Cluster AnalysisGgplot2R ProgrammingExploratory Data Analysis
Stacks

Course 4 of 10 in the

Globe

100% 在線課程

立即開始,按照自己的計劃學習。
Calendar

可靈活調整截止日期

根據您的日程表重置截止日期。
Clock

建議:5 hours/week

完成時間大約為15 小時
Comment Dots

English

字幕:English, Chinese (Simplified)

教學大綱 - 您將從這門課程中學到什麼

1

章節
Clock
完成時間為 20 小時

Week 1

This week covers the basics of analytic graphics and the base plotting system in R. We've also included some background material to help you install R if you haven't done so already. ...
Reading
15 個視頻(共 109 分鐘), 6 個閱讀材料, 7 個測驗
Video15 個視頻
Installing R on Windows (3.2.1)3分鐘
Installing R on a Mac (3.2.1)1分鐘
Installing R Studio (Mac)3分鐘
Setting Your Working Directory (Windows)7分鐘
Setting Your Working Directory (Mac)7分鐘
Principles of Analytic Graphics12分鐘
Exploratory Graphs (part 1)9分鐘
Exploratory Graphs (part 2) 5分鐘
Plotting Systems in R9分鐘
Base Plotting System (part 1)11分鐘
Base Plotting System (part 2)6分鐘
Base Plotting Demonstration16分鐘
Graphics Devices in R (part 1)5分鐘
Graphics Devices in R (part 2)7分鐘
Reading6 個閱讀材料
Welcome to Exploratory Data Analysis10分鐘
Syllabus10分鐘
Pre-Course Survey10分鐘
Exploratory Data Analysis with R Book10分鐘
The Art of Data Science10分鐘
Practical R Exercises in swirl Part 110分鐘
Quiz1 個練習
Week 1 Quiz20分鐘

2

章節
Clock
完成時間為 17 小時

Week 2

Welcome to Week 2 of Exploratory Data Analysis. This week covers some of the more advanced graphing systems available in R: the Lattice system and the ggplot2 system. While the base graphics system provides many important tools for visualizing data, it was part of the original R system and lacks many features that may be desirable in a plotting system, particularly when visualizing high dimensional data. The Lattice and ggplot2 systems also simplify the laying out of plots making it a much less tedious process....
Reading
7 個視頻(共 61 分鐘), 1 個閱讀材料, 6 個測驗
Video7 個視頻
Lattice Plotting System (part 2)6分鐘
ggplot2 (part 1)6分鐘
ggplot2 (part 2)13分鐘
ggplot2 (part 3)9分鐘
ggplot2 (part 4)10分鐘
ggplot2 (part 5)8分鐘
Reading1 個閱讀材料
Practical R Exercises in swirl Part 210分鐘
Quiz1 個練習
Week 2 Quiz20分鐘

3

章節
Clock
完成時間為 13 小時

Week 3

Welcome to Week 3 of Exploratory Data Analysis. This week covers some of the workhorse statistical methods for exploratory analysis. These methods include clustering and dimension reduction techniques that allow you to make graphical displays of very high dimensional data (many many variables). We also cover novel ways to specify colors in R so that you can use color as an important and useful dimension when making data graphics. All of this material is covered in chapters 9-12 of my book Exploratory Data Analysis with R....
Reading
12 個視頻(共 77 分鐘), 1 個閱讀材料, 4 個測驗
Video12 個視頻
Hierarchical Clustering (part 2)5分鐘
Hierarchical Clustering (part 3)7分鐘
K-Means Clustering (part 1)5分鐘
K-Means Clustering (part 2)4分鐘
Dimension Reduction (part 1)7分鐘
Dimension Reduction (part 2)9分鐘
Dimension Reduction (part 3)6分鐘
Working with Color in R Plots (part 1)4分鐘
Working with Color in R Plots (part 2)7分鐘
Working with Color in R Plots (part 3)6分鐘
Working with Color in R Plots (part 4)3分鐘
Reading1 個閱讀材料
Practical R Exercises in swirl Part 310分鐘

4

章節
Clock
完成時間為 6 小時

Week 4

This week, we'll look at two case studies in exploratory data analysis. The first involves the use of cluster analysis techniques, and the second is a more involved analysis of some air pollution data. How one goes about doing EDA is often personal, but I'm providing these videos to give you a sense of how you might proceed with a specific type of dataset. ...
Reading
2 個視頻(共 55 分鐘), 2 個閱讀材料, 2 個測驗
Video2 個視頻
Air Pollution Case Study40分鐘
Reading2 個閱讀材料
Practical R Exercises in swirl Part 410分鐘
Post-Course Survey10分鐘
4.7
Direction Signs

37%

完成這些課程後已開始新的職業生涯
Briefcase

83%

通過此課程獲得實實在在的工作福利
Money

18%

加薪或升職

熱門審閱

創建者 CCJul 29th 2016

This is the second course I have taken from Roger Peng and both were outstanding. I have a strong math background, but not much of a background in stats, but this course was very approachable for me.

創建者 YSep 24th 2017

Very good course! It provide me the foundation in learning how to plot and interpret data. This will definitely strengthen my "R programming" to generate publication type figure for my genomics data!

講師

Roger D. Peng, PhD

Associate Professor, Biostatistics
Bloomberg School of Public Health

Jeff Leek, PhD

Associate Professor, Biostatistics
Bloomberg School of Public Health

Brian Caffo, PhD

Professor, Biostatistics
Bloomberg School of Public Health

關於 Johns Hopkins University

The mission of The Johns Hopkins University is to educate its students and cultivate their capacity for life-long learning, to foster independent and original research, and to bring the benefits of discovery to the world....

關於 Data Science 專項課程

Ask the right questions, manipulate data sets, and create visualizations to communicate results. This Specialization covers the concepts and tools you'll need throughout the entire data science pipeline, from asking the right kinds of questions to making inferences and publishing results. In the final Capstone Project, you’ll apply the skills learned by building a data product using real-world data. At completion, students will have a portfolio demonstrating their mastery of the material....
Data Science

常見問題

  • Once you enroll for a Certificate, you’ll have access to all videos, quizzes, and programming assignments (if applicable). Peer review assignments can only be submitted and reviewed once your session has begun. If you choose to explore the course without purchasing, you may not be able to access certain assignments.

  • When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile. If you only want to read and view the course content, you can audit the course for free.

還有其他問題嗎?請訪問 學生幫助中心