Introduction to Data Science

Exploratory data analysis and interactive visualisation, unsupervised learning, dimensionality reduction and feature extraction, supervised learning and more.


  • Level: beginner
  • Duration: 2-day course
  • Delivered: in-house

What you will learn

The course is extremely interactive and hands-on. You will learn by working through concrete problems with a real dataset. You will be taught by academic and industry experts in the field, who have a wealth of experience and knowledge to share.

  • Preprocessing (scaling, log transformations, imputation, hot coding)
  • Exploratory data analysis and interactive visualisation
  • Unsupervised learning (k-means clustering, hierarchical clustering)
  • Dimensionality reduction and feature extraction (PCA, t-SNE)
  • Supervised learning (KNN)

Languages and libraries

  • Python programming language
  • Numpy and pandas for data manipulation
  • Scikit-learn for machine learning algorithms
  • Matplotlib and seaborn for data visualisation

OUTLINE

Day One

Data Science Essentials

Session 1

Introduction to Data Science

  • Overview of Data Science and Machine Learning
  • Supervised vs. Unsupervised Learning
  • Industrial Applications

Session 2

Working with real-world data

  • Loading and manipulating data in Python with Panda
  • Data cleaning and pre-processing
  • Exploratory data analysis (EDA)

Session 3

Principal Component Analysis (PCA)

  • What is PCA and why you need it
  • Applying PCA in Python with SKLearn

Day Two

Unsupervised learning and supervised learning

Session 1

Unsupervised learning

  • k-means clustering
  • Hierarchical cluster analysis
  • Density-based clustering

Session 2

Working with real-world data

  • The K Nearest Neighbor algorithm
  • Decision Tree classifier
  • Overfitting and Validation
  • Hyperparameter tuning

Prerequisites

  • Elementary Python programming and use of the command line. You can acquire these skills at our Python bootcamp.
  • Basic probability and linear algebra.

Audience

Individuals who want to master new technical skills and learn the latest techniques and industry best practices to work effectively with Data Science teams.


Get in touch

Get in touch to discuss team size, pricing and your tech requirements. Send an email to training@cambridgespark.com or fill in our contact form. We’ll be sure to get back to you soon.

Contact our team