Sudip P. Sudip-Pandit

Hi there 👋

🔭 I’m a Senior Data Engineer with 11+ Years of exeperince in software Engineering, 8+ Years in Hadoop, Pyspark, AWS/Azure Big Data Domain, Databricks, SNowflakes, ML and Gen AI.
🌱 I’m having 4+ years of expertise in healthcare industry.
👯 I have designed and deployed Kafka streaming applications for real-time data processing and analytics, integrating with Spark Structured Streaming.
🤔 I’m Skilled in ETL and ELT (DBT) processes, data warehousing, machine learning pipelines, and database management. Strong problem-solving abilities in Hadoop, Spark, Hive, AWS, Azure, Kafka, and NoSQL databases.
💬 With a strong commitment to innovation, I work closely with engineering, data science, and business teams to design and implement scalable, robust, and efficient data solutions.
📫 Proven expertise in designing, building, and optimizing scalable data pipelines and large-scale data infrastructures.
😄 Strong problem-solving abilities in Hadoop, Spark, Hive, AWS, Azure, Kafka, and NoSQL databases.
⚡ Built end to end data pipelines such as Data Reconcilliation, Data Profiling, Backward filling, data governance and Data quality.