Hi there π! I am Utkarsh Mathur, a.k.a. DataMathur, a 24-year-old new-grad with MS in Data Science from University at Buffalo, B.Tech. from IIT Roorkee, and 2 years of experience in Data Science, Software Engineering, and Data Engineering. I am currently working as a Data Scientist at Atriano, developing an educational story generation web app for kids using Large Language Models (LLMs) and recommendation algorithms.
I am passionate about leveraging my skills to develop scalable AI/ML software and services as well as picking up new skills on the way. My expertise spans programming (Python, C++, SQL, R, Perl, and JavaScript), machine learning, data engineering, and software development.
I am committed to continuous learning and passionate about creating impactful and practical solutions. I believe that with dedication, hard work, and a positive mindset, anything is achievable - a principle I've demonstrated throughout my academic and professional career.
Education π
University at Buffalo, State University of New York
Master of Science in Data Science
January 2023 β June 2024
Indian Institute of Technology Roorkee (IIT Roorkee)
Bachelor of Technology in Polymer Science (Chemical Engineering)
July 2018 β July 2022
Work Experience πΌ
- Machine Learning Engineer at Atriano (September 2024 - Present)
- Machine Learning Engineer, Imaging at ImagoAI (October 2024 - January 2025)
- Data Scientist at Quinbay (May 2022 - October 2022)
- Machine Learning Engineer at Hono (July 2021 - April 2022)
- Research Intern under Dr. Mayank Goswami (September 2020 - June 2021)
- Data Scientist at ImagoAI (April 2021 - May 2021)
- Research Intern under Dr. Kusum Deep (September 2021 - December 2021)
- Research Intern under Dr. Gaurav Manik (July 2019)
-
Python Programming: Develop end-to-end solutions for Machine Learning, Model Deployment, and Process Automations using Python.
-
Machine Learning: Develop and deploy ML solutions for Computer Vision, NLP, Regression, Classification, and Recommendation Systems using sklearn, PyTorch, TensorFlow, Keras, and XGBoost.
-
MLOps: Deploy ML solutions on cloud (AWS, GCP) using ETL/ELT pipelines, MLOps, CI/CD pipelines, and Model Monitoring Tools.
-
Data Analysis: Performing data analysis with PowerBI, Tableau, Python, SQL, and R to extract actionable insights
-
ETL/ELT Pipelines: Developing ETL pipelines for big data using AWS, SQL, Snowflake, and Databricks.
-
Software Developmemt: Fullstack deployment of developed solutions using Python, JavaScript (Node, Angular, React, MongoDB, Django), C++, and HTML/CSS.
-
Gen AI: Fine-tuning GenAI foundation models (LoRA, QLoRA), RAG with Knowledge Graphs and Vector Database augmentation for Agentic AI.
Key Projects π
- Graph Neural Networks and Large Language Models: A Literature Review (Capstone Project - University at Buffalo)
- Metaheuristic Optimization v/s Backpropagation (Course Project - University at Buffalo)
- Top Spotify Tracks Database (Course Project - University at Buffalo)
- Statistical Analysis (Course Project - University at Buffalo)
- Clustering and Time Series Analysis (Course Project - University at Buffalo)
Contact Me βοΈ
Feel free reach out at [email protected] and [email protected] to explore potential opportunities and collaborations.
For more information regarding means to contact me, please visit the contact page.