This course focuses on the concepts and tools behind reporting modern data analyses in a reproducible manner. Reproducible research is the idea that data analyses, and more generally, scientific claims, are published with their data and software code so that others may verify the findings and build upon them. The need for reproducibility is increasing dramatically as data analyses become more complex, involving larger datasets and more sophisticated computations. Reproducibility allows for people to focus on the actual content of a data analysis, rather than on superficial details reported in a written summary. In addition, reproducibility makes an analysis more useful to others because the data and code that actually conducted the analysis are available. This course will focus on literate statistical analysis tools which allow one to publish data analyses in a single document that allows others to easily execute the same analysis to obtain the same results.
提供方
課程信息
您將學到的內容有
Organize data analysis to help make it more reproducible
Write up a reproducible data analysis using knitr
Determine the reproducibility of analysis project
Publish reproducible web documents using Markdown
您將獲得的技能
- Knitr
- Data Analysis
- R Programming
- Markup Language
提供方

约翰霍普金斯大学
The mission of The Johns Hopkins University is to educate its students and cultivate their capacity for life-long learning, to foster independent and original research, and to bring the benefits of discovery to the world.
授課大綱 - 您將從這門課程中學到什麼
Week 1: Concepts, Ideas, & Structure
This week will cover the basic ideas of reproducible research since they may be unfamiliar to some of you. We also cover structuring and organizing a data analysis to help make it more reproducible. I recommend that you watch the videos in the order that they are listed on the web page, but watching the videos out of order isn't going to ruin the story.
Week 2: Markdown & knitr
This week we cover some of the core tools for developing reproducible documents. We cover the literate programming tool knitr and show how to integrate it with Markdown to publish reproducible web documents. We also introduce the first peer assessment which will require you to write up a reproducible data analysis using knitr.
Week 3: Reproducible Research Checklist & Evidence-based Data Analysis
This week covers what one could call a basic check list for ensuring that a data analysis is reproducible. While it's not absolutely sufficient to follow the check list, it provides a necessary minimum standard that would be applicable to almost any area of analysis.
Week 4: Case Studies & Commentaries
This week there are two
審閱
- 5 stars68.61%
- 4 stars23%
- 3 stars5.73%
- 2 stars1.62%
- 1 star1.02%
來自可重复性研究的熱門評論
Without taking this course wouldn't have fully understood the importance of reproducible research in data science. Thank you so much. I recommend this course for all data scientists.
Very helpful and informative information on how to create reproducible research. The project gives you an opportunity to create reproducible research in the format of a report.
I took this course as part of the Data Science specialization without any real expectation and realized that this subject is probably one of the most important in data analysis.
Enjoyed learning about rMarkdown, caching, and RPubs. Was also able to spend time plotting and aggregating data in different ways. Didn't enjoy cleaning data too much :)
常見問題
我什么时候能够访问课程视频和作业?
我订阅此专项课程后会得到什么?
有助学金吗?
還有其他問題嗎?請訪問 學生幫助中心。