Akash Gangadharan



Experienced Data Engineer with 4 years of experience in architecting and developing robust data infrastructures and pipelines on cloud platforms like AWS & GCP. Proficient across full data engineering stack including pipeline development, ETL processes, database design, and data modeling. Currently seeking opportunities as a data engineer/scientist to take on greater technical leadership and strategic initiatives.


Experience

Data Research Assistant

Indiana University School of Medicine

• Acquired and integrated sociodemographic, environmental, and health data from diverse sources using advanced techniques like web scraping and spatial data enrichment.
• Performed complex analytics including descriptive statistics, trend analysis, and data visualization utilizing Python, SQL, Excel, Power BI, and STATA.
• Built dashboards, and presentations to summarize key findings for stakeholders and leadership.

January 2024 - Present

Teaching Assisant

Indiana University School of Public Health

• Provided feedback, guidance and graded over 60+ students as TA for Personal Leadership Development course.

August 2023 - Present

Microsoft Ambassador

University Information technology Services

• Led 50+ workshops on Microsoft Office 365 among students and promoting the adoption of Microsoft Office 365.
• Utilized Microsoft forms, MSSQL Server, and Power BI to analyze event participation, driving data-informed decisions resulting in a 2X increase in student adoption of Microsoft technologies.

August 2023 - December 2023

Data Engineer Intern (AWS)

Rocket Companies

• Built scalable ETL pipeline processing of 10M+ user records using PySpark and AWS Glue from raw to transformed state.
• Streamlined data validation process in ETL pipeline, thereby reducing the data load failures by 40%.
• Partnered cross-functionally with MLOps team and designed data architecture to integrate Health Analytics Report in the Rocket Money app.

May 2023 - August 2023

Data Engineer

KPMG

• Led the migration of Oracle SQL scripts to BigQuery, efficiently orchestrating data loading and achieving project completion 15 days ahead of schedule.
• Transformed 15+ financial reports by automating end-to-end data pipelines in SSIS, cutting delivery time from 2 days to 5 minutes, enabling Excel exports, and automating stakeholder notifications.
• Streamlined data validation on GCP cloud storage by developing a Python script for row count analysis, reducing processing time from 1 hour to 5 minutes thereby improving the efficiency of data validation tasks by 70%.

May 2021 - July 2022

Data Engineer

Vodafone

• Generated revenue of more than 1M £ by delivering Cross-sell, Up-sell, & Retention campaigns with Marketing team.
• Automated weekly SQL updates with Python after a critical database error, enhancing accuracy and efficiency by 50%.
• Streamlined data processing by automating SQL and Salesforce integration, cutting task time from 5 days to 30 minutes and boosting campaign efficiency.

August 2018 - May 2021

Education

Indiana University Bloomington

Master of Science - Data Science

• Data Research Assistant at IU School of Medicine

• Teaching Assistant for Personal Leadership Development (L-102) at IU School of Public health

• Professional Development officer in Data Science Club of Indiana University Bloomington

August 2022 - May 2024

University of Pune

Bachelor of Engineering - Mechanical Engineering
June 2013 - May 2017

Skills

Business Intelligence and Visualization Tools
  • SQL
  • MS Excel
  • Tableau
  • Power BI
  • Data Modeling
  • Teradata
  • NoSQL
  • MongoDB
  • MSSQL
Data Processing & ETL Frameworks
  • Python
  • Spark
  • Github
  • CircleCI
  • Apache Airflow
  • Hadoop
  • Hive
  • Snowflake
  • Kafka
Cloud Infrastructures
  • AWS Glue
  • AWS Athena
  • AWS S3
  • AWS QuickSight
  • GCP Bigquery
  • GCP Cloud Composer
  • GCP Cloud Storage
  • AWS EMR
  • AWS Kinesis
Certifications
  • Google Cloud Professional Machine Learning Engineer
  • Google Cloud certified Professional Data Engineer
  • Apache Airflow Fundamentals
  • Apache DAG Authoring
  • Microsoft Azure Certified Data Scientist Associate
  • Microsoft Azure Certified AI Engineer Associate
Certification Badge Certification Badge Certification Badge Certification Badge Certification Badge Certification Badge Certification Badge


Projects

Youtube Data Analysis Pipeline

Skills: AWS Glue, Pyspark, AWS Athena, AWS QuickSight

Designed an end-to-end ETL Data pipeline in AWS for YouTube Data analysis and visualizing in AWS QuickSight.

[Github]

Sales Performance Dashboard

Excel

This is an interactive Sales Performance Dashboard created on Excel. I created this dashboard as my side project.

[Github]