• Developed automated Python workflows integrated with ADF, improving pipeline reliability and reducing processing time by 40% • Utilized Azure Data Explorer and Kusto to analyze and manipulate data for various builds (Fast build, BuildXL, downlevel builds) • Developed data transformations in Azure Databricks using Spark, improving processing performance for telemetry dataset • Used Terraform to provision and manage cloud infrastructure components, enabling repeatable and version-controlled setup • Designed ETL pipelines in Apache Airflow to orchestrate automated ingestion and transformation of data from Azure Data Lake • Built scalable ETL data pipeline in Azure Data Factory to process build telemetry for Windows updates impacting 750 M+ devices • Utilized GitLab and GitHub for version control, code review, pull requests, and collaborative development across ADF pipelines • Built data ingestion, modeling, and governance workflows using Microsoft Fabric to support enterprise reporting and analytics • Maintained CI/CD pipelines in Azure DevOps Repos to automate deployments, ensure version control, and enforce code quality • Leveraged Generative AI tools like Azure OpenAI and GitHub Copilot to accelerate SQL development, exploratory data analysis • Used Azure Monitor and Log Analytics to track pipeline health, ingestion latency, and telemetry across the Azure data workflow • Developed scalable data flows for Azure Synapse and cloud storage environments, supporting real-time engineering dashboards
Data Science
British Airways
Credential ID: ZoMDchThcXpH7oAad
Data Analytics
Deloitte Australia
Credential ID: vmBCEwJAuryuNfC3r
Python
HackerRank
Introduction to Responsible AI
Introduction to Image Generation
Google Crash Course on Python
Credential ID: P4CUL4KA3BH6
Google Data Analytics Professional Certificate
Coursera
Credential ID: LLJ5NKJXHGBD
Google Data Analytics Capstone: Complete a Case Study
Coursera
Data Science Orientation
Coursera
Configure a semantic model
Microsoft
Credential ID: 6D634569C7C1DE79