Open to Opportunities

Anthony Apollis Data Engineer &
Analytics Specialist

I design and build end-to-end analytics platforms — from raw data pipelines and GCP medallion architectures through dimensional models, ML, and interactive dashboards. 12+ years across Snowflake, dbt, BigQuery, Airflow, Apache Spark, and modern BI stacks. Based in South Africa.

12+
Years Experience
5
Portfolio Projects
30M+
Records Modelled
15
ML Models Built
R10.1M
Projected Saving

End-to-End Analytics Projects

Each project covers the full data lifecycle — ingestion, modelling, ML, and visualisation.

Dimensional Modelling · Snowflake · SCD Type 2

Lyra Wellbeing Analytics

Star-schema data warehouse for an employee wellbeing platform. Models corporate counselling sessions, medication sales, and patient experience across 16 tables — with SCD Type 2 history for employee and consent records.

1.4M+
Total rows
16
Tables
SCD2
History tracking
Snowflake SQL Server Python Star Schema SCD Type 2

Business Intelligence · CCPB Analytics

PenBev — BI Dashboard Suite

Multi-dimensional business intelligence platform for consumer channel and price-by-brand analytics. Covers revenue, trade spend ROI, product health scoring, and a full SKU master — all self-contained HTML dashboards.

4
Dashboards
CCPB
Channel analytics
BI
Platform
Business Intelligence Excel Chart.js HTML5 / JS CCPB

Web Analytics · BigQuery · GTM

GA4 Analytics Portfolio

Full Google Analytics 4 implementation showcase covering event tracking, BigQuery exports, datalayer architecture, ecommerce funnel analysis, and consent mode quality validation — with SQL query library and GTM templates.

7
SQL queries
GA4
Platform
GTM
Implementation
GA4 BigQuery GTM SQL Consent Mode

About Me

I'm a Data Engineer, Data Architect, and Analytics Specialist based in South Africa, with 12+ years spanning the full analytics stack — from raw ingestion and cloud pipeline architecture through dimensional modelling, machine learning, and executive-ready dashboards.

My work focuses on building production-grade data platforms that serve real business decisions. I enjoy turning messy, high-volume datasets into clean, trusted, and insightful products — whether that's a Snowflake star schema, a GCP medallion pipeline, a Databricks lakehouse, or a trained ensemble ML model.

I design data architecture from scratch: source-to-target flows, Bronze/Silver/Gold layers, dimensional marts, SCD Type 2 history, data-quality quarantine patterns, orchestration, and BI-ready semantic layers.

Cloud-native experience includes BigQuery, Databricks, Delta Lake, Cloud Dataflow (Apache Beam), Apache Airflow, dbt, Apache Spark / PySpark, and BigQuery Data Transfer Service — alongside Snowflake, SQL Server, PostgreSQL, Microsoft Fabric, Power BI, and Tableau from enterprise BI roles.

Currently open to Data Engineering, Data Architecture, Analytics Engineering, and Senior BI roles — locally and internationally.

🧭

Data Architecture

Design architecture from scratch — source mapping, medallion/lakehouse patterns, warehouse layers, SCD2 dimensions, data quality gates, and migration roadmaps

🏗️

Data Engineering

Snowflake & BigQuery DWH design, Databricks/Delta pipelines, dbt transformations, Airflow DAGs, Cloud Dataflow (Apache Beam), Spark/PySpark, Apache NiFi

🤖

Machine Learning

Classification, regression, clustering, forecasting — XGBoost, LightGBM, scikit-learn, MLflow tracking, model validation, and explainability

📊

Business Intelligence

Power BI, Tableau, SSAS Tabular, DAX, Excel pivot analytics, Chart.js dashboards, executive KPI reporting, and semantic-layer design

🌐

Web Analytics

GA4 implementation, BigQuery event analysis, GTM datalayer architecture, consent mode quality

Skills & Tools

Built over years of hands-on project work across the full data lifecycle.

Data Warehousing

Snowflake SQL Server PostgreSQL BigQuery Star Schema SCD Type 2 Dimensional Modelling Surrogate Keys

Data Architecture

Data Architect Architecture from Scratch Medallion / Lakehouse Source-to-Target Mapping Data Quality Gates Quarantine Patterns Migration Roadmaps Semantic Layers

Data Transformation

dbt dbt Tests Incremental Models SQL Python (Pandas) ETL / ELT Medallion Architecture Data Cleaning T-SQL Procedures SSIS

Machine Learning

XGBoost LightGBM scikit-learn Classification Clustering (K-Means) Time Series SHAP MLflow Snowpark ML Statsmodels

Visualisation & BI

Chart.js HTML/CSS/JS Excel (Advanced) Power BI Tableau DAX SSAS Tabular SSRS Looker Studio

Web Analytics

GA4 Google Tag Manager BigQuery Export Consent Mode Datalayer

Cloud & Orchestration

Databricks Databricks Workflows Delta Lake Microsoft Fabric OneLake Direct Lake Apache Airflow Cloud Dataflow Apache Beam BQ Data Transfer Apache Spark PySpark Apache NiFi Python Git / GitHub REST APIs Cloud Monitoring

Let's Work Together

Open to new opportunities

Whether you're looking for a data engineer, analytics specialist, or BI consultant — I'd love to hear about your project.