Intro to Apache Spark

Databricks hosted this webinar introducing Apache Spark, the platform that Databricks is based upon.

Abstract: scikit-learn is one of the most popular open-source machine learning libraries among data science practitioners.

This workshop will walk through what machine learning is, the different types of machine learning, and how to build a simple machine learning model. This workshop focuses on the techniques of applying and evaluating machine learning methods, rather than the statistical concepts behind them. We will be using data released by the New York Times (

Prior basic Python and pandas experience is required.

Previous webinars in the series:

  • Watch Part1, Intro to Python: ( to learn about python)
  • Watch Part 2, Data Analysis with pandas:
  • Watch Part 3, Machine Learning:


#DataScientist, #DataEngineer, Blogger, Vlogger, Podcaster at . Back @Microsoft to help customers leverage #AI Opinions mine. #武當派 fan. I blog to help you become a better data scientist/ML engineer Opinions are mine. All mine.