Executive Summary

High-level metrics and project methodology.

Project Overview

This project implements an automated, end-to-end "Transit & Weather Statistic" data pipeline focused on the Turku region. The analytical core of this project tests the hypothesis that adverse weather conditions (meaning heavy precipitation, freezing temperatures) can cause a quantifiable shift in human behavior. Specifically, a decrease in public transit usage and a concurrent increase in personal vehicle traffic, which manifests as an increased volume of parking ticket violations.


Architecture & Tech Stack

The system is designed with a strict separation of concerns, utilizing a modern data lakehouse system alongside a compiled microservice backend and a reactive frontend dashboard.

  • Data Engineering & ETL: Databricks, PySpark
  • Machine Learning: Apache Spark MLlib
  • Backend API: Go (Golang)
  • Frontend: Vue.js, Nuxt, NuxtUI

Data Sources (2023)

  • Weather Data (FMI): Daily precipitation and temperature statistics collected from the Turku Artukainen measuring station.
  • Public Transit Data (Föli): Daily passenger boarding counts registered via ticketing machines across the Turku city limits.
  • Parking Fine Data: Records of parking violations and tickets issued within the Turku area.