Apache Spark

SWAN: Spark connector and monitor

These notebooks exemplify the usage of SWAN and Spark for analytics and machine learning use cases at CERN.

Analyzing monitoring data

Analyzing LHC logging data

Processing ROOT (NanoAOD) files with Distributed ROOT RDataFrame in Python and Spark (PySpark)

Physics analysis with Apache Spark using Coffea and Laurelin packages

Machine Learning with Apache Spark

Handwritten Digit Classification using Apache Spark and BigDL

Processing LHCb Opendata with Spark and ROOT

Spark Course

This gallery shows examples of usage of Apache Spark within SWAN. The full tutorial given at CERN by Prasanth Kothuri and Kasper Surdy can be found on Indico



Data Frame Tutorial

RDD Tutorial

JDBC Tutorial


You are here