My name is Emma Todd, I’m a data professional working in NYC.

Leveraging experience in both data science and engineering, I use Python, R, SQL, machine learning, statistical analysis, relational database management, and data-driven storytelling to help teams make better decisions.

Work Experience

June - Aug 2019
American Regent
Marketing and Supply Chain Analytics Intern
Nov 2020 - Jan 2022
Michael Kors
Customer Data Analyst
Aug 2022 - Nov 2022
Made by Gather
Data Science and Analytics Consultant
Jan 2022 - May 2023
Fender   
Data Scientist

May 2023 - Present
Made by Gather    
Senior Data and Business Intelligence Analyst

Favorite Languages, Tools and Libraries

Python
R
SQL
SSMS
Azure Data Factory
Snowflake
Fivetran
Alteryx
Tableau
plotly
matplotlib
seaborn
ggplot2
Scikit-learn
NumPy
XGBoost

Certifications

Completed (12/2021)
IBM Data Science, Coursera
Courses Completed:
  • What is Data Science?
  • Tools for Data Science
  • Data Science Methodology
  • Python for Data Science, AI & Development
  • Python Project for Data Science
  • Databases and SQL for Data Science with Python
  • Data Analysis with Python
  • Data Visualization with Python
  • Machine Learning with Python
  • Applied Data Science Capstone
In progress
IBM Data Engineering, Coursera
Courses Completed:
  • Introduction to Data Engineering
  • Python for Data Science, AI & Development
  • Python Project for Data Engineering
  • Introduction to Relational Databases (RDBMS)
  • Databases and SQL for Data Science with Python
  • Hands-On Introduction to Linux Commands and Shell Scripting
  • Relational Database Administration (DBA)
  • ETL and Data Pipelines with Shell, Airflow, and Kafka
  • Data Warehouse Fundamentals
Desktop view

Recent Projects

Python / R / Tableau / Machine Learning

Ensemble Machine Learning - Random Forest

Business value: Brought the most powerful and accurate predictive tools possible to drive informed decision making in new product development.

Made predictions about customer demand for new products based on historical performance of established material attributes. Used a Random Forest ML model that demonstrated a 22% reduction in MAPE, as compared to a singular regression tree approach. Results visualized using matplotlib and Tableau to communicate optimized product launch plan details.

Python / AWS / Sagemaker

Web Scraping of ECommerce Sites

Business value: Scalable competitive business intelligence.

Deployed Python "BeautifulSoup" package to scrape ecommerce websites. Built two script versions - a local Selenium driver retrieval with headless browsing functionality, and an API version for automation in AWS (Sagemaker).

R / Machine Learning / Correlation Testing

Macroeconomic Data Analysis

Business value: Analyzed and communicated prior business reactivity to macroeconomic events and projected those trends forward to set future expectations.

Created business forecasts using multiple linear regression models that ingest macroeconomic variables as predictive factors.

Python / R / GIS / Correlation Testing

Employee Retention Analysis

Business value: Leveraged company data for meaningful insights into retention challenges and recommended best KPIs for future retention reporting and analysis.

Used correlation testing to identify most impactful environmental factors that affect employee retention. Identified inflection points in data density plots to communicate key retention parameters. Also used geo-spatial analysis to evaluate the effect of commute times on retention metrics.

Python / Streamlit / GIS

Streamlit Web App

Business value: Open source dashboarding and app development integrating statistical analysis and trend extraction.

Coded an interactive data dashboard in Python using the Streamlit service and library to host in a web app.

Web Scraping / Python

Eclipse Data Visualization

Business value: Quickly analyze online data sets and extract seasonal trend.

Scraped eclipse data from Wikipedia with requests and BeautifulSoup to analyze magnitude seasonality and visualize upcoming eclipse events.

Tableau / GIS

Tableau Strava Dashboard

Business value: Advanced dashboarding and interactive visualization.

Created a Tableau dashboard with my personal Strava data.

R / GIS

Fender History Globe Visualization

Business value: Open source interactive visualization.  

Fender history plotted on a globe using R plotly.

Python / Machine Learning

NYC Air Quality K-means Cluster Analysis

Business value: Fast unsupervised ML clustering like events.

K-means cluster analysis of NYC Air Quality (NYC Open Data Set) as it relates to certain environmental parameters using Python.

Engineering and Process Schematics