Data Research Assistant
Indiana University School of Medicine
• Acquired and integrated sociodemographic, environmental, and health data from diverse sources using advanced techniques like web scraping and spatial data enrichment.
• Performed complex analytics including descriptive statistics, trend analysis, and data visualization utilizing Python, SQL, Excel, Power BI, and STATA.
• Built dashboards, and presentations to summarize key findings for stakeholders and leadership.
January 2024 - Present
Teaching Assisant
Indiana University School of Public Health
• Provided feedback, guidance and graded over 60+ students as TA for Personal Leadership Development course.
August 2023 - Present
Microsoft Ambassador
University Information technology Services
• Led 50+ workshops on Microsoft Office 365 among students and promoting the adoption of Microsoft Office 365.
• Utilized Microsoft forms, MSSQL Server, and Power BI to analyze event participation, driving data-informed decisions resulting in a 2X increase in student adoption of Microsoft technologies.
August 2023 - December 2023
Data Engineer Intern (AWS)
Rocket Companies
• Built scalable ETL pipeline processing of 10M+ user records using PySpark and AWS Glue from raw to transformed state.
• Streamlined data validation process in ETL pipeline, thereby reducing the data load failures by 40%.
• Partnered cross-functionally with MLOps team and designed data architecture to integrate Health Analytics Report in the Rocket Money app.
May 2023 - August 2023
Data Engineer
KPMG
• Led the migration of Oracle SQL scripts to BigQuery, efficiently orchestrating data loading and achieving project completion 15 days ahead of schedule.
• Transformed 15+ financial reports by automating end-to-end data pipelines in SSIS, cutting delivery time from 2 days to 5 minutes, enabling Excel exports, and automating stakeholder notifications.
• Streamlined data validation on GCP cloud storage by developing a Python script for row count analysis, reducing processing time from 1 hour to 5 minutes thereby improving the efficiency of data validation tasks by 70%.
May 2021 - July 2022
Data Engineer
Vodafone
• Generated revenue of more than 1M £ by delivering Cross-sell, Up-sell, & Retention campaigns with Marketing team.
• Automated weekly SQL updates with Python after a critical database error, enhancing accuracy and efficiency by 50%.
• Streamlined data processing by automating SQL and Salesforce integration, cutting task time from 5 days to 30 minutes and boosting campaign efficiency.
August 2018 - May 2021