關於此 專項課程
298,734

100% 在線課程

立即開始,按照自己的計劃學習。

靈活的計劃

設置並保持靈活的截止日期。

初級

You should have beginner level experience in Python. Familarity with regression is recommended.

完成時間大約為8 個月

建議 5 小時/週

英語(English)

字幕:英語(English), 阿拉伯語(Arabic), 法語(French), 中文(簡體), 希臘語, 意大利語, 巴西葡萄牙語, 越南語, 俄語(Russian), 土耳其語(Turkish), 希伯來語, 日語...

您將學到的內容有

  • Check

    Use R to clean, analyze, and visualize data.

  • Check

    Navigate the entire data science pipeline from data acquisition to publication.

  • Check

    Use GitHub to manage data science projects.

  • Check

    Perform regression analysis, least squares and inference using regression models.

您將獲得的技能

GithubMachine LearningR ProgrammingRegression Analysis

100% 在線課程

立即開始,按照自己的計劃學習。

靈活的計劃

設置並保持靈活的截止日期。

初級

You should have beginner level experience in Python. Familarity with regression is recommended.

完成時間大約為8 個月

建議 5 小時/週

英語(English)

字幕:英語(English), 阿拉伯語(Arabic), 法語(French), 中文(簡體), 希臘語, 意大利語, 巴西葡萄牙語, 越南語, 俄語(Russian), 土耳其語(Turkish), 希伯來語, 日語...

專項課程 的運作方式

加入課程

Coursera 專項課程是幫助您掌握一門技能的一系列課程。若要開始學習,請直接註冊專項課程,或預覽專項課程並選擇您要首先開始學習的課程。當您訂閱專項課程的部分課程時,您將自動訂閱整個專項課程。您可以只完成一門課程,您可以隨時暫停學習或結束訂閱。訪問您的學生面板,跟踪您的課程註冊情況和進度。

實踐項目

每個專項課程都包括實踐項目。您需要成功完成這個(些)項目才能完成專項課程並獲得證書。如果專項課程中包括單獨的實踐項目課程,則需要在開始之前完成其他所有課程。

獲得證書

在結束每門課程並完成實踐項目之後,您會獲得一個證書,您可以向您的潛在雇主展示該證書並在您的職業社交網絡中分享。

how it works

此專項課程包含 10 門課程

課程1

数据科学家的工具箱(中文版)

4.5
(19,054 個評分)
In this course you will get an introduction to the main tools and ideas in the data scientist's toolbox. The course gives an overview of the data, questions, and tools that data analysts and data scientists work with. There are two components to this course. The first is a conceptual introduction to the ideas behind turning data into actionable knowledge. The second is a practical introduction to the tools that will be used in the program like version control, markdown, git, GitHub, R, and RStudio....
課程2

R 语言程序设计(中文版)

4.6
(14,059 個評分)
In this course you will learn how to program in R and how to use R for effective data analysis. You will learn how to install and configure software necessary for a statistical programming environment and describe generic programming language concepts as they are implemented in a high-level statistical language. The course covers practical issues in statistical computing which includes programming in R, reading data into R, accessing R packages, writing R functions, debugging, profiling R code, and organizing and commenting R code. Topics in statistical data analysis will provide working examples....
課程3

获取和整理数据

4.6
(5,988 個評分)
Before you can work with data you have to get some. This course will cover the basic ways that data can be obtained. The course will cover obtaining data from the web, from APIs, from databases and from colleagues in various formats. It will also cover the basics of data cleaning and how to make data “tidy”. Tidy data dramatically speed downstream data analysis tasks. The course will also cover the components of a complete data set including raw data, processing instructions, codebooks, and processed data. The course will cover the basics needed for collecting, cleaning, and sharing data....
課程4

探索性数据分析

4.7
(4,557 個評分)
This course covers the essential exploratory techniques for summarizing data. These techniques are typically applied before formal modeling commences and can help inform the development of more complex statistical models. Exploratory techniques are also important for eliminating or sharpening potential hypotheses about the world that can be addressed by the data. We will cover in detail the plotting systems in R as well as some of the basic principles of constructing data graphics. We will also cover some of the common multivariate statistical techniques used to visualize high-dimensional data....
課程5

可重复性研究

4.5
(3,165 個評分)
This course focuses on the concepts and tools behind reporting modern data analyses in a reproducible manner. Reproducible research is the idea that data analyses, and more generally, scientific claims, are published with their data and software code so that others may verify the findings and build upon them. The need for reproducibility is increasing dramatically as data analyses become more complex, involving larger datasets and more sophisticated computations. Reproducibility allows for people to focus on the actual content of a data analysis, rather than on superficial details reported in a written summary. In addition, reproducibility makes an analysis more useful to others because the data and code that actually conducted the analysis are available. This course will focus on literate statistical analysis tools which allow one to publish data analyses in a single document that allows others to easily execute the same analysis to obtain the same results....
課程6

统计推断

4.2
(3,198 個評分)
Statistical inference is the process of drawing conclusions about populations or scientific truths from data. There are many modes of performing inference including statistical modeling, data oriented strategies and explicit use of designs and randomization in analyses. Furthermore, there are broad theories (frequentists, Bayesian, likelihood, design based, …) and numerous complexities (missing data, observed and unobserved confounding, biases) for performing inference. A practitioner can often be left in a debilitating maze of techniques, philosophies and nuance. This course presents the fundamentals of inference in a practical approach for getting things done. After taking this course, students will understand the broad directions of statistical inference and use this information for making informed choices in analyzing data....
課程7

回归模型

4.4
(2,554 個評分)
Linear models, as their name implies, relates an outcome to a set of predictors of interest using linear assumptions. Regression models, a subset of linear models, are the most important statistical analysis tool in a data scientist’s toolkit. This course covers regression analysis, least squares and inference using regression models. Special cases of the regression model, ANOVA and ANCOVA will be covered as well. Analysis of residuals and variability will be investigated. The course will cover modern thinking on model selection and novel uses of regression models including scatterplot smoothing....
課程8

实用机器学习

4.5
(2,460 個評分)
One of the most common tasks performed by data scientists and data analysts are prediction and machine learning. This course will cover the basic components of building and applying prediction functions with an emphasis on practical applications. The course will provide basic grounding in concepts such as training and tests sets, overfitting, and error rates. The course will also introduce a range of model based and algorithmic machine learning methods including regression, classification trees, Naive Bayes, and random forests. The course will cover the complete process of building prediction functions including data collection, feature creation, algorithms, and evaluation....
課程9

数据产品开发

4.5
(1,703 個評分)
A data product is the production output from a statistical analysis. Data products automate complex analysis tasks or use technology to expand the utility of a data informed model, algorithm or inference. This course covers the basics of creating data products using Shiny, R packages, and interactive graphics. The course will focus on the statistical fundamentals of creating a data product that can be used to tell a story about data to a mass audience....
課程10

数据课程毕业项目

4.5
(827 個評分)
The capstone project class will allow students to create a usable/public data product that can be used to show your skills to potential employers. Projects will be drawn from real-world problems and will be conducted with industry, government, and academic partners....

講師

Avatar

Jeff Leek, PhD

Associate Professor, Biostatistics
Bloomberg School of Public Health
Avatar

Roger D. Peng, PhD

Associate Professor, Biostatistics
Bloomberg School of Public Health
Avatar

Brian Caffo, PhD

Professor, Biostatistics
Bloomberg School of Public Health

行業合作夥伴

Industry Partner Logo #0
Industry Partner Logo #1

關於 约翰霍普金斯大学

The mission of The Johns Hopkins University is to educate its students and cultivate their capacity for life-long learning, to foster independent and original research, and to bring the benefits of discovery to the world....

常見問題

  • 可以!点击您感兴趣的课程卡开始注册即可。注册并完成课程后,您可以获得可共享的证书,或者您也可以旁听该课程免费查看课程资料。如果您订阅的课程是某专项课程的一部分,系统会自动为您订阅完整的专项课程。访问您的学生面板,跟踪您的进度。

  • 此课程完全在线学习,无需到教室现场上课。您可以通过网络或移动设备随时随地访问课程视频、阅读材料和作业。

  • Time to completion can vary based on your schedule, but most learners are able to complete the Specialization in 3-6 months.

  • Each course in the Specialization is offered monthly.

  • Some programming experience (in any language) is recommended. We also suggest a working knowledge of mathematics up to algebra (neither calculus or linear algebra are required).

  • Begin by taking The Data Scientist's Toolbox and Introduction to R Programming, in order. The other courses may be taken in any order, and in parallel if desired.

  • Coursera courses and certificates don't carry university credit, though some universities may choose to accept Specialization Certificates for credit. Check with your institution to learn more.

  • You’ll have a foundational understanding of the field and be prepared to continue studying data science.

  • Yes, you can access the course for free via www.coursera.org/jhu. This will allow you to explore the course, watch lectures, and participate in discussions for free. To be eligible to earn a certificate, you must either pay for enrollment or qualify for financial aid.

還有其他問題嗎?請訪問 學生幫助中心