π A graduate student at Northeastern University, Boston
π Boston, Massachusetts
π Project I am working on : Building end-to-end Data Engineering Pipelines
π± Currently Learning: Apache Spark and Databricks
π¨βπ» All of my projects are available at https://github.com/MallikaGaikwad
π You can view my dashboards at https://public.tableau.com/app/profile/mallika.gaikwad/vizzes
π« You can reach me at [email protected]
π¨βπ» Business Intelligence Engineer Co-op
π Sunny Benefits Inc, South Carolina, USA
π January 2025-September 2025
- I implemented a medallion data architecture using dbt Cloud and AWS Redshift, collaborating with product and data teams to improve data quality and stabilize pipelines. This eliminated recurring failures and led to a 95% reporting accuracy.
- I also redesigned the EAV reporting layer using SQL and Jinja, consolidating multiple data sources into a single, optimized source for Tableau. By applying dimensional modeling principles, I improved dashboard performance and supported over 2 million member records.
- As part of pipeline optimization, I streamlined ELT processes using modular modeling in dbt Cloud, Agile development practices, and proactive problem-solving. This reduced refresh time from nine to six minutes, improving pipeline efficiency by 33%.
- To enhance visibility for leadership, I built and accelerated executive Tableau dashboards, integrating financial and transactional data from multiple sources. By applying data validation and storytelling techniques, I improved reporting speed by 40% and delivered clearer, more actionable insights.
- I also standardized schemas and unified data from 12+ sources on AWS Redshift and Postgres, leveraging Git-based version control and automated dbt tests to ensure data integrity and enable advanced analytics.
π¨βπ» Data Analyst
π Orient Technologies Pvt Ltd - Mumbai, India
π August 2021 - July 2023
- I spearheaded communication with over 10 key banking sector clients, ensuring alignment between dashboard scope and strategic business objectives while understanding analytical requirements.
- I orchestrated the management of extensive datasets by streamlining over 90,000 records across 55 columns through the use of advanced SQL queries, thereby enhancing data integrity and eliminating data loss risks.
- I implemented data preprocessing techniques using the Pandas library, resulting in a 95% accuracy rate in data optimization and facilitating more reliable analytical outcomes.
- I deployed over 150 dashboards and reports using ThoughtSpot and PowerBI, providing critical insights that empowered stakeholders to make data-driven decisions, consequently boosting operational efficiency.
- I drove collaboration with the data engineering team to automate data jobs and update dashboards with real-time data, ultimately reducing the client's Excel report workload.
π Master of Science in Data Analytics Engineering
π Northeastern University - Boston,Massachusetts, United States
π September 2023 - December 2025
- Foundations of Data Analytics
- Data Management for Analytics
- Data Mining Engineering
- Computation and Visualization
π Bachelor of Engineering In Computer Science
π Rajiv Gandhi Institute Of Technology - Mumbai, India
π August 2017 - June 2021
- Database Management Systems
- Data Warehousing and Mining
- Big Data Analytics
- Management Information Systems