Chevron Left
返回到 Building Machine Learning Pipelines in PySpark MLlib

學生對 Coursera Project Network 提供的 Building Machine Learning Pipelines in PySpark MLlib 的評價和反饋

4.3
49 個評分
8 條評論

課程概述

By the end of this project, you will learn how to create machine learning pipelines using Python and Spark, free, open-source programs that you can download. You will learn how to load your dataset in Spark and learn how to perform basic cleaning techniques such as removing columns with high missing values and removing rows with missing values. You will then create a machine learning pipeline with a random forest regression model. You will use cross validation and parameter tuning to select the best model from the pipeline. Lastly, you will evaluate your model’s performance using various metrics. A pipeline in Spark combines multiple execution steps in the order of their execution. So rather than executing the steps individually, one can put them in a pipeline to streamline the machine learning process. You can save this pipeline, share it with your colleagues, and load it back again effortlessly. Note: You should have a Gmail account which you will use to sign into Google Colab. Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions....

熱門審閱

篩選依據:

1 - Building Machine Learning Pipelines in PySpark MLlib 的 8 個評論(共 8 個)

創建者 Andrés M

2021年5月7日

I never write reviews, but ... please DON´T TAKE THIS PROJECT. Terrible project, no theoretical explanation, no explanation of functions, no complete project using pipelines (only 2 lines). The installation of libraries is not correct, I spent about two days trying to install them. Poor English language, zero explanations in general. It's a shame that people with high level of education are trying to scam people, I regret 100% of paying for this. I DID NOT LEARN ANYTHING.

創建者 Jeremy S

2022年1月26日

This project gives a good overview of the basic commands of PySpark, as well as a pretty decent glimpse of the functions and methods in MLlib. It will bring you through the methods required to split your training/validation/testing data, train the model, cross validate it, and evaluate it. There is also a brief section on cleaning data, though the majority of the lesson is focused on the random forest pipeline. Note that this project will not teach you machine learning basics or Random Forest techniques whatsoever. Similarly, this project will not teach you the basics of databases or SQL. If your sole intention is to learn the MLlib of Pyspark, then this course is good, but don't expect more. Finally, the project is contained on Coursera's Python notebook service, Rhyme. This course will not show you how to set up Pyspark or any of the supporting installations or environment variables on your own computer. Similarly, you will not be able to easily download the dataset used in this project. Even "downloading" the dataset only downloads it to the virtual machine running Rhyme, though the VM has access to the internet (hint hint). For me, this project was worth the $10 USD I paid.

創建者 Aruparna M

2021年2月21日

The dataset provided was wrong. It was not the exact one that was demonstrated by the instructor!

創建者 19BST035-HARI K R B B C

2020年9月25日

This Course is Very useful. This course big advantage is short. Read short, Learn Big.

創建者 Cheikh B

2021年3月27日

Awsome project and very good explaination thank you for this project

創建者 Leonardo E

2020年11月21日

pretty useful, actually.

創建者 MD R I

2020年10月5日

helpful project

創建者 Sankirna J

2022年5月2日

Good project to get you started with some spark code. It was very short and hands on. Instructor doesn't really talk about the library or how things work. We are expected to follow the guidelines in the companion video and basically replicate the code the instructor provides which is very clear and concise.

I would recommend this project as a quick hands-on but I did not feel like I learned a lot from it because of the very simple guided nature.