Data-driven Astronomy

The University of Sydney via Coursera

Go to class Write review

Details

Go to class

Provider

Coursera
Pricing

Free Online Course (Audit)
Languages

English
Certificate

Paid Certificate Available
Duration & workload

23 hours 56 minutes
Sessions

On-Demand
Level

Intermediate
Subtitles

Arabic, French, Portuguese, Italian, German, Russian, English, Spanish, Kazakh, Hindi, Swedish, Korean, Greek, Chinese, Thai, Ukrainian, Japanese, Indonesian, Polish, Dutch, Turkish, Hungarian, Bengali, Pashto, Urdu, Azerbaijani, Farsi

Found in

Overview

Class Central Tips

Science is undergoing a data explosion, and astronomy is leading the way. Modern telescopes produce terabytes of data per observation, and the simulations required to model our observable Universe push supercomputers to their limits. To analyse this data scientists need to be able to think computationally to solve problems. In this course you will investigate the challenges of working with large datasets: how to implement algorithms that work; how to use databases to manage your data; and how to learn from your data with machine learning tools. The focus is on practical skills - all the activities will be done in Python 3, a modern programming language used throughout astronomy. Regardless of whether you’re already a scientist, studying to become one, or just interested in how modern astronomy works ‘under the bonnet’, this course will help you explore astronomy: from planets, to pulsars to black holes. Course outline: Week 1: Thinking about data - Principles of computational thinking - Discovering pulsars in radio images Week 2: Big data makes things slow - How to work out the time complexity of algorithms - Exploring the black holes at the centres of massive galaxies Week 3: Querying data using SQL - How to use databases to analyse your data - Investigating exoplanets in other solar systems Week 4: Managing your data - How to set up databases to manage your data - Exploring the lifecycle of stars in our Galaxy Week 5: Learning from data: regression - Using machine learning tools to investigate your data - Calculating the redshifts of distant galaxies Week 6: Learning from data: classification - Using machine learning tools to classify your data - Investigating different types of galaxies Each week will also have an interview with a data-driven astronomy expert. Note that some knowledge of Python is assumed, including variables, control structures, data structures, functions, and working with files.

Syllabus

Thinking about data

This module introduces the idea of computational thinking, and how big data can make simple problems quite challenging to solve. We use the example of calculating the median and mean stack of a set of radio astronomy images to illustrate some of the issues you encounter when working with large datasets.

Big data makes things slow

In this module we explore the idea of scaling your code. Some algorithms scale well as your dataset increases, but others become impossibly slow. We look at some of the reason for this, and use the example of cross-matching astronomical catalogues to demonstrate what kind of improvements you can make.

Querying your data

Most large astronomy projects use databases to manage their data. In this module we introduce SQL - the language most commonly used to query databases. We use SQL to query the NASA Exoplanet database and investigate the habitability of planets in other solar systems.

Managing your data

This module introduces the basic principles of setting up databases. We look at how to set up new tables, and then how to combine Python and SQL to get the best out of both approaches. We use these tools to explore the life of stars in a stellar cluster.

Learning from data: regression

This module introduces the idea of machine learning. We look at standard methodology for running machine learning experiments, and then apply this to calculating redshifts of distant galaxies using decision trees for regression.

Learning from data: classification

In this final module we explore the limitations of decision tree classifiers. We then look at ensemble classifiers, using the random forest algorithm to classify images of galaxies into different types.

Taught by

Tara Murphy and Simon Murphy

Reviews

5.0 rating, based on 2 Class Central reviews

4.8 rating at Coursera based on 1336 ratings

Start your review of Data-driven Astronomy

Arnaud Dion

This is real astronomy ! A fantastic approach to current research subject. If you want to learn astronomy from the ground up, take an introductory course before this one. It starts directly to studying pulsars statistics, and most important, how to detect and study it. All the worshops are in Python, using a web notebook. But it's neither an introductory course on Python. So, it' better to have a minimum knowledge on programming and Python language. But, if you have the prequisites, and are interested to do computation for astronomy using large datasets, this is the course. The techniques can also been extended to other computational intensive domains.
Kristina Šekrst

This is a wonderful cross-section of machine learning advancement and astrophysics. With no background in astronomy or with no background in programming, it is easy to stick around. And if you have knowledge of both, then you'll certainly enjoy your stay since there are advanced assignments as well. The course uses Python and Grok interactive learning tool, and you'll tackle a bit with SQL and machine learning. The instructors were pretty much omnipresent in the forums and helped out everybody, which is praiseworthy. I had fun while learning new things, and I hope to see a sequel to such a novel course.

Go to class

BloomTech’s Downfall: A Long Time Coming

Most common

Popular subjects

Popular courses

Data-driven Astronomy

Overview

Syllabus

Taught by

Tags

Reviews

BloomTech’s Downfall: A Long Time Coming

Taught by

Tags

Cloud Computing Applications, Part 2: Big Data and Applications in the Cloud

IBM Data Science

Machine Learning Foundations for Product Managers

AI Skills for Engineers: Data Engineering and Data Pipelines

Data Science Interview Prep

IBM Data Science

10 Best Applied AI & ML Courses

10 Best Free SQL Courses

1700 Coursera Courses That Are Still Completely FREE

250 Top FREE Coursera Courses of All Time

Massive List of MOOC-based Microcredentials

Never Stop Learning.