I design and build end-to-end analytics platforms — from raw data pipelines and GCP medallion architectures through dimensional models, ML, and interactive dashboards. 12+ years across Snowflake, dbt, BigQuery, Airflow, Apache Spark, and modern BI stacks. Based in South Africa.
Featured Work
Each project covers the full data lifecycle — ingestion, modelling, ML, and visualisation.
Databricks · Medallion Architecture · ML · Data Warehouse
Full-stack data engineering and ML platform for South Africa's leading parcel pickup network. Combines a Databricks Medallion Pipeline (Bronze → Silver → Gold → XGBoost, serverless, MLflow) with a Snowflake + dbt data warehouse and 15 trained ML models — end-to-end from dirty data ingestion to R502K/year RTS cost prediction.
Dimensional Modelling · Snowflake · SCD Type 2
Star-schema data warehouse for an employee wellbeing platform. Models corporate counselling sessions, medication sales, and patient experience across 16 tables — with SCD Type 2 history for employee and consent records.
Business Intelligence · CCPB Analytics
Multi-dimensional business intelligence platform for consumer channel and price-by-brand analytics. Covers revenue, trade spend ROI, product health scoring, and a full SKU master — all self-contained HTML dashboards.
Web Analytics · BigQuery · GTM
Full Google Analytics 4 implementation showcase covering event tracking, BigQuery exports, datalayer architecture, ecommerce funnel analysis, and consent mode quality validation — with SQL query library and GTM templates.
GCP · Medallion Architecture · dbt · Airflow · Apache Beam
Production-grade Google Cloud marketing data platform ingesting GA4, Google Ads, Campaign Manager 360, DV360, and YouTube via BigQuery Data Transfer Service. Raw data flows through a Bronze → Silver → Gold medallion pipeline — Cloud Dataflow (Apache Beam) validates and quarantines, dbt stages and models, Apache Airflow orchestrates the daily DAG.
Background
I'm a Data Engineer, Data Architect, and Analytics Specialist based in South Africa, with 12+ years spanning the full analytics stack — from raw ingestion and cloud pipeline architecture through dimensional modelling, machine learning, and executive-ready dashboards.
My work focuses on building production-grade data platforms that serve real business decisions. I enjoy turning messy, high-volume datasets into clean, trusted, and insightful products — whether that's a Snowflake star schema, a GCP medallion pipeline, a Databricks lakehouse, or a trained ensemble ML model.
I design data architecture from scratch: source-to-target flows, Bronze/Silver/Gold layers, dimensional marts, SCD Type 2 history, data-quality quarantine patterns, orchestration, and BI-ready semantic layers.
Cloud-native experience includes BigQuery, Databricks, Delta Lake, Cloud Dataflow (Apache Beam), Apache Airflow, dbt, Apache Spark / PySpark, and BigQuery Data Transfer Service — alongside Snowflake, SQL Server, PostgreSQL, Microsoft Fabric, Power BI, and Tableau from enterprise BI roles.
Currently open to Data Engineering, Data Architecture, Analytics Engineering, and Senior BI roles — locally and internationally.
Design architecture from scratch — source mapping, medallion/lakehouse patterns, warehouse layers, SCD2 dimensions, data quality gates, and migration roadmaps
Snowflake & BigQuery DWH design, Databricks/Delta pipelines, dbt transformations, Airflow DAGs, Cloud Dataflow (Apache Beam), Spark/PySpark, Apache NiFi
Classification, regression, clustering, forecasting — XGBoost, LightGBM, scikit-learn, MLflow tracking, model validation, and explainability
Power BI, Tableau, SSAS Tabular, DAX, Excel pivot analytics, Chart.js dashboards, executive KPI reporting, and semantic-layer design
GA4 implementation, BigQuery event analysis, GTM datalayer architecture, consent mode quality
Technical Stack
Built over years of hands-on project work across the full data lifecycle.
Get in Touch
Whether you're looking for a data engineer, analytics specialist, or BI consultant — I'd love to hear about your project.