Chevron Left
Big Data Essentials: HDFS, MapReduce and Spark RDD に戻る

Yandex による Big Data Essentials: HDFS, MapReduce and Spark RDD の受講者のレビューおよびフィードバック

4.0
421件の評価
115件のレビュー

コースについて

Have you ever heard about such technologies as HDFS, MapReduce, Spark? Always wanted to learn these new tools but missed concise starting material? Don’t miss this course either! In this 6-week course you will: - learn some basic technologies of the modern Big Data landscape, namely: HDFS, MapReduce and Spark; - be guided both through systems internals and their applications; - learn about distributed file systems, why they exist and what function they serve; - grasp the MapReduce framework, a workhorse for many modern Big Data applications; - apply the framework to process texts and solve sample business cases; - learn about Spark, the next-generation computational framework; - build a strong understanding of Spark basic concepts; - develop skills to apply these tools to creating solutions in finance, social networks, telecommunications and many other fields. Your learning experience will be as close to real life as possible with the chance to evaluate your practical assignments on a real cluster. No mocking, a friendly considerate atmosphere to make the process of your learning smooth and enjoyable. Get ready to work with real datasets alongside with real masters! Special thanks to: - Prof. Mikhail Roytberg, APT dept., MIPT, who was the initial reviewer of the project, the supervisor and mentor of half of the BigData team. He was the one, who helped to get this show on the road. - Oleg Sukhoroslov (PhD, Senior Researcher at IITP RAS), who has been teaching MapReduce, Hadoop and friends since 2008. Now he is leading the infrastructure team. - Oleg Ivchenko (PhD student APT dept., MIPT), Pavel Akhtyamov (MSc. student at APT dept., MIPT) and Vladimir Kuznetsov (Assistant at P.G. Demidov Yaroslavl State University), superbrains who have developed and now maintain the infrastructure used for practical assignments in this course. - Asya Roitberg, Eugene Baulin, Marina Sudarikova. These people never sleep to babysit this course day and night, to make your learning experience productive, smooth and exciting....

人気のレビュー

YH

Nov 22, 2018

Everything in this course is new to me, but it provides me with many practice so I can gradually get familiar with all these new stuff. I find it a bit challenging, but overall it's quite good.

SH

May 10, 2019

The course takes you from basic level , step level .But It is quite fast for beginners , you may need pause video in between and try to understand the concept.

フィルター:

Big Data Essentials: HDFS, MapReduce and Spark RDD: 51 - 75 / 112 レビュー

by Garvish

Dec 27, 2017

This is really Intermediate level course.

by navneet k

Nov 28, 2018

Awesome content...great learning ...:)

by ANTUL K

May 27, 2019

Great Content if you are a beginner.

by shubham m

Aug 20, 2019

Very Nice..............Intraction

by kebize m

Mar 21, 2019

<Good learning >

by Shaik M

Jan 29, 2020

good knowledge

by Aman A

Jul 31, 2019

Great Course.

by Rodrigo R A d S

Feb 05, 2018

Excellent!

by Alok K

Sep 16, 2019

Very Good

by Anshika M

Jun 19, 2019

EXCELLENT

by Marwen B A

Nov 04, 2019

great

by Minh T

Aug 24, 2019

Great

by swapnil c

Dec 29, 2018

none

by Mohamed H

Dec 29, 2019

The course is very useful and gives you the basics you need about HDFS, Map-Reduce in python (there is no java in this course) and pyspark. The assignments are straightforward, however you may face issues in the docker and in the grader system. The cons of this course is that sometimes information is given in a fast pace which somehow can make you get confused and unable to fully digest the material. Also there is no interaction at all from the instructors, it'll be nice if they can keep up with students' questions and issues in the future!

by Dorofei N

Feb 14, 2020

Потратил больше времени на то, чтобы Grader правильно принял решения, чем времени на решение задачи. Потратить 3-4 часа на решение исходной задачи и потратить 10 часов (включая форум и Slack) на то чтобы ответ правильно принять. Особенно на задаче с Твиттер датасетом, ругается на количество редусеров, но оказывается, надо было логи yarn тоже выводить.

Хорошо было бы добавить еще одну проверку, которая проверяет выводятся ли логи yarn и сообщать, что его нет.

Если бы не эта проблема, поставил бы 5 звезд.

by Павел С

Dec 11, 2018

I think students could choose MapReduce or Spark. And about shortest path task. Provided by authors code runs out of memory while checking on cluster. After a lot of time playing with spark paramets and cache/persist i found solution without calculating all distances, but... Also there was no information about spark executors parameters on course...

Simple hint could save a lot of stupidly wasted time.

But it's not major, anyway thanks!

by Кряжевских С В

Oct 07, 2019

Practice work in this course is divided in two part. First, you try to solve an assignment into your home Docker environment. It's really interesting to do it in spite of the assignments is not very clear. Second, you try to put the result into the course's grader system. For me, Grader it's like a Major Payne. You will get an amazing experience to work with production cluster through not well suited environment.

by Marco G

Dec 05, 2018

Interesting, useful, informative, accessible (and sometimes funny!) lectures.

Stimulating assignments.

Fast responses from instructors/mentors.

Unfortunately, I often spent more time trying to get my assignments to pass the automatic grader than on solving them. This made the course a bit frustrating at times.

by Terry A

Mar 01, 2019

Good general overview, start to the subject. Frustrated at consistent issues with development environment and/or ability to debug. Responses to questions and mentor assistance is seriously lacking.

by Waldemar D

May 19, 2019

good course, covering a lot of foundations for Big Data and for Hadoop/Spark. Also one of the few that focus on Data Engineering perspective rather than Data Science. Learned a lot here!

by Gregory R

Apr 27, 2018

Great course! Please, follow up with discussion boards more. Otherwise, happy I took it.

Also, looking forward to the entire specialization ready, like course #4 about real Time Streaming.

by DIEGO A R

Aug 02, 2018

Excelente curso, falta más realmentación por parte de los profesores, pero en general aunque el contenido es Denso y se requieren más horas de lo estipulado en el curso, es muy bueno.

by Taras P

May 12, 2018

Materials are good, but there was a lot of problem with assignment clear understanding and infrastructure. Also would like to pass this course on Scala.

by Tomiwa k

Nov 08, 2019

the curriculum is fine, I learnt new things. the authors abandunded this course, no maintenance for the grading system. this shows be fixed

by Simon V L

Jan 31, 2018

The content of the course is really good. THe assignments should be made a lot clearer and the jupyter grading tool is full of bugs.