Resume

Summary

Hi I’m Matt Guan - a data platform engineer based out of Raleigh, NC. I have a passion for developing analytics solutions, and automating standard workflows.

Having worked in both development, and consulting roles, I possess a holistic view on how analytics can be implemented in a scalable manner to drive change within the business.

Skills Overview

  • Python: Experienced in developing Python packages utilizing various libraries including pandas, pyspark, dask, boto3, flask, along with Git for version control.
  • DevOps: Experienced in writing CI jobs to deploy code to production, and building Docker images.
  • Data Engineering: Experienced in building tables, writing stored procedures, and querying data using complex SQL queries.
  • Data Visualization: Experience in front-end and back-end development of large scale data visualization projects using Tableau, and Python packages such as bokeh, plotly, seaborn, and matplotlib.
  • Business Intelligence: Experienced in developing BI process flows, with scalability and automation in mind.
  • Predictive Modeling: Experienced in building predictive models with robust statistical validation, and applying them to a variety of business problems.
  • Other Skills: I have experience R, SAS, Java, and C/C++, although Python is now my language of choice. I can do advanced Excel if I really need to.

Work Experience

Senior Data Engineer, Team Lead

Red Hat Aug 2018-Present – Raleigh, NC

  • Lead a team of 6 Data Engineers, within a larger analytics team consisting of Data Analysts and Data Scientists. Team lead responsibilities include leading Sprint planning calls, advising in project meetings, training team members, performing code review, assisting with hiring, and representing the team in various cross functional initiatives.
  • Collaborated with other Platform Data Engineers to build and maintain a distributed analytics platform for the larger team.
  • Develop python packages to streamline data engineering and data science workflows
  • Develop CI pipelines to test and deploy production code throughout the team analytics platform
  • Write large scale ETL pipelines to be orchestrated in containerized environments via Airflow

Marketing Science Engineer

Valassis Digital Apr 2018-Aug 2018 – Raleigh, NC

  • Develop and implement production level streams to automate the Marketing Science Team’s workflow
  • Act as the primary contributor to the Marketing Science codebase - ensuring coding best practices across the team
  • Design and populate backend tables (Hive and Postgres) to support data products across the Operations organization
  • Build self service analytics tools for the larger Operations organization.

Marketing Scientist

Valassis Digital Jan 2017-Apr 2018 – Raleigh, NC

  • Set up and perform measurement studies for digital marketing campaigns
  • Generate robust post-campaign insights using a variety of data sources (point of sale, panel, demographic, interest etc.)
  • Develop python packages to streamline the workflows of the marketing science, and insights data analysis teams
  • Build standardized, and dynamically populated tableau templates for use across multiple teams

Analytics Associate

Mu Sigma Sep 2015-Jan 2017 – Atlanta, GA

  • Consulted with clients on how to strategically leverage advanced analytics in order to meet their business needs
  • Collaborated with an offshore team in Bangalore, India, to build predictive models, create dashboard visualizations, and perform ad hoc analysis for clients
  • Worked with clients to overhaul their data ingestion process, and set up a relational database using MySQL

Education

UNC Chapel Hill

Years: 2011-2015
Degrees: B.A. Economics, B.S. Biology, Chemistry Minor
Activities: Undergraduate Research, Resident Advisor, Peer Tutor, Chemistry Mentor, Buckley Public Service Scholar, Omicron Delta Epsilon

NC State University

Years: 2019-2022
Computer Programming Certificate
Relevant Coursework: Java I, Java II, Discrete Math, C/C++, Assembly Language, Operating Systems, Data Structures

pdf version