Statistical Data Visualization with Seaborn From UST

4.6
157 個評分
提供方
Coursera Project Network
7,948 人已註冊
在此免費指導項目中,您將:

Produce and customize various chart types with Seaborn

Apply feature selection and feature extraction methods with scikit-learn

Build a boosted decision tree classifier with XGBoost

在面試中展現此實踐經驗

Clock1.5 hours
Intermediate中級
Cloud無需下載
Video分屏視頻
Comment Dots英語(English)
Laptop僅限桌面

Welcome to this Guided Project on Statistical Data Visualization with Seaborn, From UST. For more than 20 years, UST has worked side by side with the world’s best companies to make a real impact through transformation. Powered by technology, inspired by people and led by their purpose, they partner with clients from design to operation. With this Guided Project from UST, you can quickly build in-demand job skills and expand your career opportunities in the Data Science field. Producing visualizations is an important first step in exploring and analyzing real-world data sets. As such, visualization is an indispensable method in any data scientist's toolbox as well as a powerful tool to identify problems in analyses and for illustrating results. In this project, we will employ the statistical data visualization library, Seaborn, to discover and explore the relationships in the Breast Cancer Wisconsin (Diagnostic) data set. Using the exploratory data analysis (EDA) results from the Breast Cancer Diagnosis – Exploratory Data Analysis Guided Project, you will practice dropping correlated features, implement feature selection and utilize several feature extraction methods including; feature selection with correlation, univariate feature selection, recursive feature elimination, principal component analysis (PCA) and tree based feature selection methods. Lastly, we will build a boosted decision tree classifier with XGBoost to classify tumors as either malignant or benign. By the end of this Guided Project, you should feel more confident about working with data, creating visualizations for data analysis, and have practiced several methods which apply to a Data Scientist’s role. Let's get started!

必備條件

Some experience in the basic programming commands of Python and a general understanding of machine learning.

您要培養的技能

  • Data Science
  • Machine Learning
  • Python Programming
  • Seaborn
  • Data Visualization (DataViz)

分步進行學習

在與您的工作區一起在分屏中播放的視頻中,您的授課教師將指導您完成每個步驟:

  1. Project Overview

  2. Importing Libraries and Data

  3. Dropping Correlated Columns from Feature List

  4. Classification using XGBoost (minimal feature selection)

  5. Univariate Feature Selection

  6. Recursive Feature Elimination with Cross-Validation

  7. Plot CV Scores vs Number of Features Selected

  8. Feature Extraction using Principal Component Analysis

指導項目工作原理

您的工作空間就是瀏覽器中的雲桌面,無需下載

在分屏視頻中,您的授課教師會為您提供分步指導

授課教師

審閱

來自STATISTICAL DATA VISUALIZATION WITH SEABORN FROM UST的熱門評論

查看所有評論

常見問題

常見問題

還有其他問題嗎?請訪問 學生幫助中心