課程信息
3.9
67 個評分
13 個審閱
專項課程

第 2 門課程(共 5 門),位於

100% 在線

100% 在線

立即開始,按照自己的計劃學習。
可靈活調整截止日期

可靈活調整截止日期

根據您的日程表重置截止日期。
高級

高級

完成時間(小時)

完成時間大約為41 小時

建議:6 weeks of study, 6-8 hours/week...
可選語言

英語(English)

字幕:英語(English)...

您將獲得的技能

GraphsHiveApache HiveApache Spark
專項課程

第 2 門課程(共 5 門),位於

100% 在線

100% 在線

立即開始,按照自己的計劃學習。
可靈活調整截止日期

可靈活調整截止日期

根據您的日程表重置截止日期。
高級

高級

完成時間(小時)

完成時間大約為41 小時

建議:6 weeks of study, 6-8 hours/week...
可選語言

英語(English)

字幕:英語(English)...

教學大綱 - 您將從這門課程中學到什麼

1
完成時間(小時)
完成時間為 12 分鐘

Welcome to the Second Course: Big Data Analysis

...
Reading
8 個視頻(共 12 分鐘)
Video8 個視頻
What is BigData Analysis?1分鐘
Tools For BigData Analysis1分鐘
Graph Data Analysis2分鐘
Meet Alexey Dral2分鐘
Meet Pavel Mezentsev分鐘
Meet Natalia Pritykovskaya分鐘
Meet Pavel Klemenkov分鐘
完成時間(小時)
完成時間為 3 小時

Big Data SQL: Hive

...
Reading
15 個視頻(共 105 分鐘), 1 個閱讀材料, 3 個測驗
Video15 個視頻
HTTP Web Service: Access Log Format4分鐘
Business Use Cases: Solution with Hive6分鐘
(optional) SQL: likbez10分鐘
Hive Data Definition Language (DDL)11分鐘
Hive Data Manipulation Language (DML)6分鐘
Hive Analytics: RegexSerDe, Views7分鐘
(optional) Regular Expressions, Likbez9分鐘
Hive Analytics: UDF, UDAF, UDTF7分鐘
Hive Streaming4分鐘
Hive PTF (Window Functions)5分鐘
Hive Optimization: Partitioning, Bucketing and Sampling8分鐘
Hive Map-Side Joins: Plain, Bucket, Sort-Merge5分鐘
Hive Optimization: Data Skew4分鐘
Hive Optimization: Row-Columnar File Formats, Compression8分鐘
Reading1 個閱讀材料
Slack Channel is the quickest way to get answers to your questions10分鐘
Quiz3 個練習
Hive: SQL over Hadoop MapReduce20分鐘
Hive Analytics with UDF and Streaming20分鐘
Hive final20分鐘
2
完成時間(小時)
完成時間為 7 小時

Big Data SQL: Hive (practice week)

...
Reading
3 個視頻(共 11 分鐘), 6 個閱讀材料, 5 個測驗
Video3 個視頻
How to Install Docker on Windows 7, 8, 104分鐘
How to submit your first Hadoop assignment3分鐘
Reading6 個閱讀材料
Assignments. General requirements10分鐘
Hive assignment. Intro and instructions10分鐘
Grading System: Instructions and Common Problems10分鐘
Docker Installation Guide10分鐘
Copy of Assignments. General requirements10分鐘
Copy of Assignments. General requirements10分鐘
3
完成時間(小時)
完成時間為 2 小時

Spark SQL and Spark Dataframe

...
Reading
14 個視頻(共 82 分鐘), 2 個測驗
Video14 個視頻
What is Pandas DataFrame and how to create it4分鐘
How to process a DataFrame as SQL4分鐘
Working with Hive4分鐘
Reading and Writing Files7分鐘
RDD vs. DF vs. SQL3分鐘
Projection and Filtering5分鐘
Functions5分鐘
Aggregates6分鐘
Join8分鐘
User Defined Functions8分鐘
Time Processing4分鐘
Window Functions7分鐘
Two-Dimensional Distributions4分鐘
Quiz2 個練習
Introducing DataFrame and SQL16分鐘
Spark SQL and Spark Dataframe18分鐘
4
完成時間(小時)
完成時間為 4 小時

Graph Analysis from Big Data Perspective

...
Reading
13 個視頻(共 83 分鐘), 5 個測驗
Video13 個視頻
Graph representation7分鐘
Counting common friends. Part I2分鐘
Counting common friends. Part II10分鐘
Counting common friends. Part III5分鐘
GraphFrames: Introduction6分鐘
Motif Finding: DSL6分鐘
Motif Finding: Counting Mutual Friends6分鐘
Motif Finding: Under The Hood. Part 114分鐘
Motif Finding: Under The Hood. Part 24分鐘
Triangles Count: Introduction3分鐘
Triangles Count: Edge Lists6分鐘
Triangles Count: GraphFrame6分鐘
Quiz4 個練習
Graph Representations10分鐘
Motif Finding18分鐘
Triangles Count8分鐘
Graph Analysis from Big Data Perspective20分鐘
3.9
職業方向

50%

完成這些課程後已開始新的職業生涯
工作福利

83%

通過此課程獲得實實在在的工作福利

熱門審閱

創建者 SMNov 13th 2018

content of the course is remarkable and the way they explained concepts is very lucid. I just want to give suggestions please give link to the data set they are using for illustrating the concepts.

創建者 SSFeb 3rd 2018

I wish I could give more rating than 5 :). Excellent course. Thanks so much for such an excellent course. All the instructors are great.

講師

Avatar

Pavel Klemenkov

Chief Data Scientist
NVIDIA
Avatar

Pavel Mezentsev

Senior Data Scientist
PulsePoint inc
Avatar

Alexey A. Dral

Founder and Chief Executive Officer
BigData Team

關於 Yandex

Yandex is a technology company that builds intelligent products and services powered by machine learning. Our goal is to help consumers and businesses better navigate the online and offline world....

關於 Big Data for Data Engineers 專項課程

This specialization is made for people working with data (either small or big). If you are a Data Analyst, Data Scientist, Data Engineer or Data Architect (or you want to become one) — don’t miss the opportunity to expand your knowledge and skills in the field of data engineering and data analysis on the large scale. In four concise courses you will learn the basics of Hadoop, MapReduce, Spark, methods of offline data processing for warehousing, real-time data processing and large-scale machine learning. And Capstone project for you to build and deploy your own Big Data Service (make your portfolio even more competitive). Over the course of the specialization, you will complete progressively harder programming assignments (mostly in Python). Make sure, you have some experience in it. This course will master your skills in designing solutions for common Big Data tasks: - creating batch and real-time data processing pipelines, - doing machine learning at scale, - deploying machine learning models into a production environment — and much more! Join some of best hands-on big data professionals, who know, their job inside-out, to learn the basics, as well as some tricks of the trade, from them. Special thanks to Prof. Mikhail Roytberg (APT dept., MIPT), Oleg Sukhoroslov (PhD, Senior Researcher, IITP RAS), Oleg Ivchenko (APT dept., MIPT), Pavel Akhtyamov (APT dept., MIPT), Vladimir Kuznetsov, Asya Roitberg, Eugene Baulin, Marina Sudarikova....
Big Data for Data Engineers

常見問題

  • 注册以便获得证书后,您将有权访问所有视频、测验和编程作业(如果适用)。只有在您的班次开课之后,才可以提交和审阅同学互评作业。如果您选择在不购买的情况下浏览课程,可能无法访问某些作业。

  • 您注册课程后,将有权访问专项课程中的所有课程,并且会在完成课程后获得证书。您的电子课程证书将添加到您的成就页中,您可以通过该页打印您的课程证书或将其添加到您的领英档案中。如果您只想阅读和查看课程内容,可以免费旁听课程。

還有其他問題嗎?請訪問 學生幫助中心