Built a statistical anomaly detection system for a Dutch water consortium to monitor Rhine basin contamination. The system replaced days of manual analysis with instant automated alerts across 40+ discharge facilities and monitoring stations. Operators receive alerts with geospatial correlations and substance classifications, enabling them to investigate and respond quickly. Successfully detected a major PFBS discharge event (27,000 ng/l) and validated 233 facility-monitoring correlations across 24 water bodies.
A consortium for Dutch water companies needed to detect contamination in the Rhine before it reached Dutch borders. Their existing workflow relied on manual analysis of complex time-series water quality data from multiple Rhine basin sources. This approach was time-consuming, prone to bottlenecks, and often delayed detection of critical contamination events, potentially putting public water safety at risk.
The real need wasn't just faster analysis, it was an automated early warning system that could monitor 40+ facilities continuously with minimal false alarms.
Key considerations:
I designed and built a statistical anomaly detection system with the following components:
The system was designed to flag anomalies automatically while providing operators with the context they need to investigate and respond.
Python • Pandas • NumPy • Statistical Analysis • Geospatial Mapping • Custom Data Pipeline • PDF Processing • Time-Series Analysis