Data Science Hands-On with Open Source Tools

Login to enroll
  • Course Number
  • Classes Start
    Any time, Self-paced

About This Course

Get started with some of the most popular tools for collaborative data science, including RStudio IDE, Jupyter Lab and Jupyter Noteboooks, Apache Zeppelin notebooks. Use the tools directly on Skills Network Labs, a free virtual lab environment that brings powerful open data science tools together so you can analyze, visualize, explore, clean data, run models and create apps without needing to download, install and maintain software. All you need is a modern web browser and your desire to learn.

Course Syllabus

  • Module 1 - Introducing Skills Network Labs (formerly Cognitive Class Labs)
    • What is Cognitive Class Labs (Data Scientist Workbench)?
    • Skills Network Labs account features
    • Creating a Skills Network Labs account
    • Managing data within My Data
    • LAB: Getting Started with Skills Network Labs
  • Module 2 - Introducing Jupyter Notebooks
    • What are Jupyter notebooks?
    • Getting started with Jupyter
    • Data and Notebooks in Jupyter
    • Sharing your Jupyter Notebooks and data
    • Apache Spark in Jupyter Notebooks
    • LAB: Getting Started with Jupyter Notebooks
  • Module 3 - Introducing Zeppelin Notebooks
    • What are Zeppelin Notebooks?
    • Zeppelin for Scala
    • Getting started with Zeppelin
    • Managing your Interpreters in Zeppelin
    • Apache Spark in Zeppelin Notebooks
    • LAB: Getting Started with Apache Zeppelin Notebooks
  • Module 4 - Introducing RStudio IDE
    • What is RStudio IDE?
    • Uploading files, Installing Packages and loading libraries in RStudio IDE
    • Getting started with RStudio IDE
    • RStudio Environment and History
    • Apache Spark in RStudio IDE
    • LAB: Getting Started with RStudio IDE

General Information

  • This course is free.
  • It is self-paced.
  • It can be taken at any time.
  • It can be audited as many times as you wish.

Recommended skills prior to taking this course

  • None


  • None

Course Staff

Polong Lin, Data Science Bootcamp instructor

Polong Lin

Polong Lin is a Data Scientist at IBM in Canada. Under the Emerging Technologies division, Polong is responsible for educating the next generation of data scientists through BDU. Polong is a regular speaker in conferences and meetups, and holds a M.Sc. in Cognitive Psychology.
Dr. Saeed Aghabozorgi, Data Science Bootcamp instructor

Saeed Aghabozorgi

Saeed Aghabozorgi, PhD is a Data Scientist in IBM with a track record of developing enterprise level applications that substantially increases clients’ ability to turn data into actionable knowledge. He is a researcher in data mining field and expert in developing advanced analytic methods like machine learning and statistical modelling on large datasets.
CodyClear Messages

Failed to send message. Retry.

Chat With Cody