- π Iβm a Senior Data Engineer with 11+ Years of exeperince in software Engineering, 8+ Years in Hadoop, Pyspark, AWS/Azure Big Data Domain, Databricks, SNowflakes, ML and Gen AI.
- π± Iβm having 4+ years of expertise in healthcare industry.
- π― I have designed and deployed Kafka streaming applications for real-time data processing and analytics, integrating with Spark Structured Streaming.
- π€ Iβm Skilled in ETL and ELT (DBT) processes, data warehousing, machine learning pipelines, and database management. Strong problem-solving abilities in Hadoop, Spark, Hive, AWS, Azure, Kafka, and NoSQL databases.
- π¬ With a strong commitment to innovation, I work closely with engineering, data science, and business teams to design and implement scalable, robust, and efficient data solutions.
- π« Proven expertise in designing, building, and optimizing scalable data pipelines and large-scale data infrastructures.
- π Strong problem-solving abilities in Hadoop, Spark, Hive, AWS, Azure, Kafka, and NoSQL databases.
- β‘ Built end to end data pipelines such as Data Reconcilliation, Data Profiling, Backward filling, data governance and Data quality.
π
At work
Data engineer with 11+ years of experience in building large scale data-processing, data-intensive applications, Machine Learning and cloud computing
Pinned Loading
-
100-Days-Of-ML-Code
100-Days-Of-ML-Code PublicForked from Avik-Jain/100-Days-Of-ML-Code
100 Days of ML Coding
-
druid
druid PublicForked from medb/druid
Apache Druid: a high performance real-time analytics database.
Java
-
-
-
Sqoop-Deep-Drive
Sqoop-Deep-Drive PublicI tried to put all the real time Real Time Sqoop Commands in this repository.
-
CCA-175-practice-tests-resource
CCA-175-practice-tests-resource PublicForked from proedu-organisation/CCA-175-practice-tests-resource
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.