Get started with custom lists to organize and share courses.

Sign up

Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Core Concepts in Data Analysis

Higher School of Economics via Coursera

8 Reviews 374 students interested

Taken this course? Share your experience with other students. Write review

Overview

Sign up to Coursera courses for free Learn how

The course was created with the support of Sberbank
The course was created with the support of Sberbank

This is an unconventional course in modern Data Analysis, Machine Learning and Data Mining. Its contents are heavily influenced by the idea that data analysis should help in enhancing and augmenting knowledge of the domain as represented by the concepts and statements of relation between them. According to this view, two main pathways for data analysis are summarization, for developing and augmenting concepts, and correlation, for enhancing and establishing relations. The term summarization embraces here both simple summaries like totals and means and more complex summaries: the principal components of a set of features and cluster structures in a set of entities. Similarly, correlation covers both bivariate and multivariate relations between input and target features including Bayes classifiers.

The view of the data as a subject of computational data analysis that is adhered to here has emerged quite recently. Typically, in sciences and in statistics, a problem comes first, and then the investigator turns to data that might be useful in advancing towards a solution. Yet nowadays the situation is reversed frequently, especially with the advent of Big Data. Typical questions then are: Take a look at this data set - what sense can be made out of it? – Is there any structure in the data set? Can these features help in predicting those? This is more reminiscent to a traveler’s view of the world rather than that of a scientist. The scientist sits at his desk, gets reproducible signals from the universe and tries to accommodate them into a great model of the universe. The traveler deals with what come on their way – here is the data analysis niche.  A textbook by the instructor along these lines has been published by Springer-London in 2011: “Core concepts in data analysis is clean and devoid of any fuzziness. The author presents his theses with a refreshing clarity seldom seen in a text of this sophistication. … To single out just one of the text’s many successes: I doubt readers will ever encounter again such a detailed and excellent treatment of correlation concepts. (Computing Reviews of ACM, June 2011).”

Syllabus

Week 1. Intro: Examples of data and data analysis problems; visualization.       

                     

Week 2. 1D analysis. Feature scales. Histogram. Two common types of histograms: Gaussian and Power Law. Central values. Minkowski distance and data recovery view. Validation with Bootstrap.           

                       

Week 3-4. 2D analysis cases:

(Both quantitative: Scatter-plot, linear regression, correlation and determinacy coefficients: meaning and properties. Both nominal: Contingency table, Quetelet index, Pearson chi-squared coefficient, its double meaning and visualization).                                                              

Week 5-6. Learning multivariate correlations

(Bayes approach and Naïve Bayes classifier with a Bag-of-words text model; Decision trees and criteria for building them.)                      

                       

Week 7. Principal components (PCA) and SVD

(SVD model behind PCA: student marks as the product of subject factor scores and subject loadings. Application to deriving a hidden underlying factor. Data visualization with PCA. Conventional PCA and data normalization issues.)

 

Week 8. Clustering with k-means

(K-Means iterations and K-Means features   

K-Means criterion. Anomalous clusters and intelligent K-Means.)

Taught by

Boris Mirkin

Tags

Help Center

Most commonly asked questions about Coursera Coursera

Reviews for Coursera's Core Concepts in Data Analysis
3.3 Based on 8 reviews

  • 5 stars 38%
  • 4 star 13%
  • 3 star 13%
  • 2 star 13%
  • 1 stars 25%

Did you take this course? Share your experience with other students.

Write a review
  • 1
Anonymous
3.0 5 years ago
Bart completed this course.
I'm dropping this course after 5 weeks.

There are good bits about this course, but you can probably read about those in other reviews. I'll focus on the bad bits.

First of all, it does not make sense to follow this kind of quality course after the high quality courses "Data Analysis and Statistical Inference" on coursera and MIT's "Introduction to Probability" on Edx which cover similar topics, but in much greater depth and with much more rigour.

I'm sure Boris Mirkin is very knowledgeable, but i.m.h.o. he lacks the educational skills.…
2 people found
this review helpful
Was this review helpful to you? Yes
Gaetano P
5.0 5 years ago
by Gaetano completed this course, spending 5 hours a week on it and found the course difficulty to be medium.
It was really a pleasure to take this course so many years after my university degree on subjects that I went through in those years. I was really interested in the new approaches regarding Principal Components Analysis and the K-means clustering as I studied these subjects with the classical approaches. Maybe something could be improved in the english statements of tests in order to avoid misunderstandings but all the staff made a great effort to meet the requests for clarification.
2 people found
this review helpful
Was this review helpful to you? Yes
Robert S
1.0 3 years ago
by Robert is taking this course right now, spending 4 hours a week on it and found the course difficulty to be medium.
The instructor - however knowledgeable he might be - is terrible. He often rambles - about unimportant anecdotes - and doesn't seem to have a clear plan about what to say. There are many other free online courses covering similar subjects, so check out some of them instead of wasting your time on this one.
Was this review helpful to you? Yes
Karolos G
5.0 2 years ago
by Karolos completed this course, spending 5 hours a week on it and found the course difficulty to be hard.
I was impressed by two things:

1. the clarity of the instructor, even when discussing difficult subjects

2.the explanation of the common route of seemingly different concepts

Great if you want to get deep knowledge which is also immediately applicable
Was this review helpful to you? Yes
Emma E
1.0 5 years ago
Emma is taking this course right now.
I couldn't get the video lectures to work on my laptop - it kept crashing. I will probably try again in the future
0 person found
this review helpful
Was this review helpful to you? Yes
Sajan S
5.0 3 years ago
by Sajan completed this course.
Was this review helpful to you? Yes
Rafael P
2.0 4 years ago
Rafael completed this course.
Was this review helpful to you? Yes
Mark B
4.0 3 years ago
by Mark completed this course.
Was this review helpful to you? Yes
  • 1

Class Central

Get personalized course recommendations, track subjects and courses with reminders, and more.

Sign up for free

Never stop learning Never Stop Learning!

Get personalized course recommendations, track subjects and courses with reminders, and more.