Coursera: Data-driven Astronomy

 with  Tara Murphy and Simon Murphy
Science is undergoing a data explosion, and astronomy is leading the way. Modern telescopes produce terabytes of data per observation, and the simulations required to model our observable Universe push supercomputers to their limits. To analyse this data scientists need to be able to think computationally to solve problems. In this course you will investigate the challenges of working with large datasets: how to implement algorithms that work; how to use databases to manage your data; and how to learn from your data with machine learning tools. The focus is on practical skills - all the activities will be done in Python 3, a modern programming language used throughout astronomy.

Regardless of whether you’re already a scientist, studying to become one, or just interested in how modern astronomy works ‘under the bonnet’, this course will help you explore astronomy: from planets, to pulsars to black holes.

Course outline:
Week 1: Thinking about data
- Principles of computational thinking
- Discovering pulsars in radio images

Week 2: Big data makes things slow
- How to work out the time complexity of algorithms
- Exploring the black holes at the centres of massive galaxies

Week 3: Querying data using SQL
- How to use databases to analyse your data
- Investigating exoplanets in other solar systems

Week 4: Managing your data
- How to set up databases to manage your data
- Exploring the lifecycle of stars in our Galaxy

Week 5: Learning from data: regression
- Using machine learning tools to investigate your data
- Calculating the redshifts of distant galaxies

Week 6: Learning from data: classification
- Using machine learning tools to classify your data
- Investigating different types of galaxies

Each week will also have an interview with a data-driven astronomy expert.

Note that some knowledge of Python is assumed, including variables, control structures, data structures, functions, and working with files.


Thinking about data
This module introduces the idea of computational thinking, and how big data can make simple problems quite challenging to solve. We use the example of calculating the median and mean stack of a set of radio astronomy images to illustrate some of the issues you encounter when working with large datasets.

Big data makes things slow
In this module we explore the idea of scaling your code. Some algorithms scale well as your dataset increases, but others become impossibly slow. We look at some of the reason for this, and use the example of cross-matching astronomical catalogues to demonstrate what kind of improvements you can make.

Querying your data
Most large astronomy projects use databases to manage their data. In this module we introduce SQL - the language most commonly used to query databases. We use SQL to query the NASA Exoplanet database and investigate the habitability of planets in other solar systems.

Managing your data
This module introduces the basic principles of setting up databases. We look at how to set up new tables, and then how to combine Python and SQL to get the best out of both approaches. We use these tools to explore the life of stars in a stellar cluster.

Learning from data: regression
This module introduces the idea of machine learning. We look at standard methodology for running machine learning experiments, and then apply this to calculating redshifts of distant galaxies using decision trees for regression.

Learning from data: classification
In this final module we explore the limitations of decision tree classifiers. We then look at ensemble classifiers, using the random forest algorithm to classify images of galaxies into different types.

0 Student
Cost Free Online Course (Audit)
Pace Upcoming
Subject Astronomy
Provider Coursera
Language English
Certificates Paid Certificate Available
Calendar 6 weeks long
Sign up for free? Learn how
+ Add to My Courses
Learn Data Analysis

Learn to become a Data Analyst. Job offer guaranteed or get a full refund.

Become a Data Scientist

Learn Python & R at your own pace. Start now for free!

FAQ View All
What are MOOCs?
MOOCs stand for Massive Open Online Courses. These are free online courses from universities around the world (eg. Stanford Harvard MIT) offered to anyone with an internet connection.
How do I register?
To register for a course, click on "Go to Class" button on the course page. This will take you to the providers website where you can register for the course.
How do these MOOCs or free online courses work?
MOOCs are designed for an online audience, teaching primarily through short (5-20 min.) pre recorded video lectures, that you watch on weekly schedule when convenient for you.  They also have student discussion forums, homework/assignments, and online quizzes or exams.

0 reviews for Coursera's Data-driven Astronomy

Write a review

Write a review

How would you rate this course? *
How much of the course did you finish? *
Create Review