Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View benlooi1913's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report benlooi1913

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
benlooi1913/README.md

My Data Journey

Here's a summary of my data journey so far, highlighting what I've accomplished and what I'm working on:

Journey

  • Python Libraries: Explored pandas, numpy, matplotlib, seaborn, plotly, altair, sklearn, streamlit, prophet, neural prophet, etc.
  • Visualization: Learned about different graph types and customization with altair, and converting visuals to HTML for deployment. Also used streamlit to create quick web applications.
  • Machine Learning: Developed a Bayesian Network with a novel approach to accurately make prediction profiling on the manufacturing defect and understanding of causality discovery among the intricate interplay interaction of the process variables - real industry project.
  • Web Scraping: Exploring web-scrapping and breaking through the firewalls of reputable websites for data mining.
  • Algorithms and Data Types: Learning about binary search, linear search, array, linked list, big O notation, selection sort, stack, queue, quicksort, hashtable, collisions, load factor, hash function.
  • Mathematics: Exploring Bayesian statistics, hypothesis testing, probability sampling, statistical significance, designing tests, and inferential statistics.

Current Focus

  • Future Interests: Interested in learning Rust, MLOps, DevOps, cloud computing, and handling big data with Hadoop and Spark.
  • Building Data Pipelines: Used Python, AWS Lambda, AWS Redshift, CRON scheduling, and encrypting personally identifiable information (PII) data columns.
  • Web Development: Gaining experience in HTML, CSS, JavaScript, and AWS for computing, storage, network routing, and authorization.
  • Certification: Targeting to gain datacamp certificates in data engineer and data scientist in May 2024
  • Competition: Looking for teams in Kaggle Competition

Popular repositories Loading

  1. benlooi1913 benlooi1913 Public

    Config files for my GitHub profile.

  2. datascienceroutemap datascienceroutemap Public

    This repository serves as a storage for those projects I have done under the context of predictive business analytics. All of these projects are business insights delver, inclusive of recommender s…

    Jupyter Notebook

  3. Image-Classification Image-Classification Public

    Jupyter Notebook

  4. Open-CV Open-CV Public

    Python

  5. Text-and-Speech- Text-and-Speech- Public

  6. bayesian_network- bayesian_network- Public

    Project practicum - manufacturing defect causality discovery and prediction profiling using Bayesian Network

    Jupyter Notebook