Chevron Left
返回到 Big Data Essentials: HDFS, MapReduce and Spark RDD

學生對 Yandex 提供的 Big Data Essentials: HDFS, MapReduce and Spark RDD 的評價和反饋

4.0
541 個評分
144 條評論

課程概述

Have you ever heard about such technologies as HDFS, MapReduce, Spark? Always wanted to learn these new tools but missed concise starting material? Don’t miss this course either! In this 6-week course you will: - learn some basic technologies of the modern Big Data landscape, namely: HDFS, MapReduce and Spark; - be guided both through systems internals and their applications; - learn about distributed file systems, why they exist and what function they serve; - grasp the MapReduce framework, a workhorse for many modern Big Data applications; - apply the framework to process texts and solve sample business cases; - learn about Spark, the next-generation computational framework; - build a strong understanding of Spark basic concepts; - develop skills to apply these tools to creating solutions in finance, social networks, telecommunications and many other fields. Your learning experience will be as close to real life as possible with the chance to evaluate your practical assignments on a real cluster. No mocking, a friendly considerate atmosphere to make the process of your learning smooth and enjoyable. Get ready to work with real datasets alongside with real masters! Special thanks to: - Prof. Mikhail Roytberg, APT dept., MIPT, who was the initial reviewer of the project, the supervisor and mentor of half of the BigData team. He was the one, who helped to get this show on the road. - Oleg Sukhoroslov (PhD, Senior Researcher at IITP RAS), who has been teaching MapReduce, Hadoop and friends since 2008. Now he is leading the infrastructure team. - Oleg Ivchenko (PhD student APT dept., MIPT), Pavel Akhtyamov (MSc. student at APT dept., MIPT) and Vladimir Kuznetsov (Assistant at P.G. Demidov Yaroslavl State University), superbrains who have developed and now maintain the infrastructure used for practical assignments in this course. - Asya Roitberg, Eugene Baulin, Marina Sudarikova. These people never sleep to babysit this course day and night, to make your learning experience productive, smooth and exciting....

熱門審閱

YH
2018年11月21日

Everything in this course is new to me, but it provides me with many practice so I can gradually get familiar with all these new stuff. I find it a bit challenging, but overall it's quite good.

SH
2019年5月9日

The course takes you from basic level , step level .But It is quite fast for beginners , you may need pause video in between and try to understand the concept.

篩選依據:

76 - Big Data Essentials: HDFS, MapReduce and Spark RDD 的 100 個評論(共 141 個)

創建者 Павел С

2018年12月11日

I think students could choose MapReduce or Spark. And about shortest path task. Provided by authors code runs out of memory while checking on cluster. After a lot of time playing with spark paramets and cache/persist i found solution without calculating all distances, but... Also there was no information about spark executors parameters on course...

Simple hint could save a lot of stupidly wasted time.

But it's not major, anyway thanks!

創建者 Кряжевских С В

2019年10月7日

Practice work in this course is divided in two part. First, you try to solve an assignment into your home Docker environment. It's really interesting to do it in spite of the assignments is not very clear. Second, you try to put the result into the course's grader system. For me, Grader it's like a Major Payne. You will get an amazing experience to work with production cluster through not well suited environment.

創建者 Marco G

2018年12月5日

Interesting, useful, informative, accessible (and sometimes funny!) lectures.

Stimulating assignments.

Fast responses from instructors/mentors.

Unfortunately, I often spent more time trying to get my assignments to pass the automatic grader than on solving them. This made the course a bit frustrating at times.

創建者 Oliver P

2020年3月31日

Overall, a sound introduction of the topic. The first time, I understood these technologies. From time to time the tutorials and introduction videos have been a bit too quick, too sketchy to understand the content properly.

創建者 Terry A

2019年2月28日

Good general overview, start to the subject. Frustrated at consistent issues with development environment and/or ability to debug. Responses to questions and mentor assistance is seriously lacking.

創建者 Waldemar D

2019年5月19日

good course, covering a lot of foundations for Big Data and for Hadoop/Spark. Also one of the few that focus on Data Engineering perspective rather than Data Science. Learned a lot here!

創建者 Gregory R

2018年4月27日

Great course! Please, follow up with discussion boards more. Otherwise, happy I took it.

Also, looking forward to the entire specialization ready, like course #4 about real Time Streaming.

創建者 DIEGO A R

2018年8月1日

Excelente curso, falta más realmentación por parte de los profesores, pero en general aunque el contenido es Denso y se requieren más horas de lo estipulado en el curso, es muy bueno.

創建者 Alois T

2020年11月5日

The course is pretty ok, but beware to bend under the constraints of the automatic corrector. You can spend many hours just "fixing" your code despite having the good result

創建者 Taras P

2018年5月12日

Materials are good, but there was a lot of problem with assignment clear understanding and infrastructure. Also would like to pass this course on Scala.

創建者 Mahendra A

2020年3月16日

Overall course was good and informative.. Sometime it feels lil bit tough to grasp may be due to as its an entirely different domain for me

創建者 Tomiwa k

2019年11月8日

the curriculum is fine, I learnt new things. the authors abandunded this course, no maintenance for the grading system. this shows be fixed

創建者 Simon V L

2018年1月31日

The content of the course is really good. THe assignments should be made a lot clearer and the jupyter grading tool is full of bugs.

創建者 Casper Y

2018年2月18日

The practises are practical and useful. However, there is an initial learning curve to get use to the grading tools.

創建者 Abijith K

2020年2月29日

Expected more depth in Spark architecture but was kept at a high level

Course on HDFS were really good

創建者 David Z

2019年2月17日

The content was a nice introductory course. The only thing that could be better is the grading system

創建者 Vladimir

2017年12月2日

Good course, but the description of practical tasks is not always clear.

創建者 Kirill L

2018年1月31日

Only four because of graduating tool. The contend is very interesting.

創建者 Shreeharsha G

2018年6月27日

Challenging !! But need some more help on slack and active community.

創建者 Rain

2019年5月23日

其实课程内容设计还是挺不错的,配合资料对Mapreduce和hdfs基本设计思路都有很好的了解,但是课程的编程l练习不置可否。

創建者 Alexander K

2018年3月5日

Requires intermediate skills and ability to work on your own.

創建者 Bo T

2018年9月6日

The assignment cannot submit correctly. Really disappointed!

創建者 SAI V K

2019年10月29日

Course is good,but the grader doesnt work properly.

創建者 yunwoo n

2018年5月16日

good course but grading system has some trouble

創建者 Martin T

2018年2月5日

Lectures are very good and I learned a lot.