My PySpark Essential Training course just launched on LinkedIn Learning!

I’m excited to share that my new course on PySpark Essential Training is now live on LinkedIn Learning!

I was approached by the LinkedIn Learning team after they found some of my Python courses on YouTube, and we brainstormed several ideas for topics together. While I hadn’t used PySpark in a production setting, I was really excited to dive into it and develop a new course for them! This is the first time I’ve created an on-demand course that actually pays royalties based on streams, which is a lot more scalable than the quarterly live trainings I’ve been hosting for O’Reilly.

In the LinkedIn Learning course, I provide a structured and hands-on introduction to PySpark – perfect for data engineers, analysts, and anyone looking to scale their data processing skills. The course covers:

  • The core concepts of Spark and PySpark.
  • Installing PySpark, loading, manipulating, and analyzing large datasets in a notebook environment.
  • How PySpark fits into a wider data engineering ecosystem.
  • Best practices about executing PySpark in a production environment.

You can stream the course here with a LinkedIn Learning subscription – and feel free to like and bookmark the course to provide feedback!