Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View wischmcj's full-sized avatar

Block or report wischmcj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
wischmcj/README.md

Collin Wischmeyer

I'm a data engineer and environmental data enthusiast passionate about empowering scientists with tools to aquire and analyze data. My largest side projects focus on

  • Remote sensing data analysis and publishing related research
  • Using infrastructure-as-code tools for the orchestration and containerization of workflows (MlOps
  • Webscraping and archiving

πŸ“« Questions? Connect with me at:
LinkedIn β€’ [email protected]

πŸ”¬ Projects at a Glance

Section Technologies Project
Infrastructure-as-Code Ansible, Terraform, Bash ml_ops_tree_learn: Object detection MLOps
Prometheus, Grafana, NodeJS, Docker ArchiveTeam IaC: Distributed compute observation stack
Geospatial & Remote Sensing Open3D, PyTorch, OpenCV, Rasterio pyqsm: Image processing and spatial algorithms
NumPy, MatPlotLib, GeoPandas, GDAL canopyHydrodynamics: Simulating water movement within tree canopies
Data Engineering / DevOps DLT, DuckDB, Web Scraping, Streamlit LinkedInScraper: Automated data acquisition
GitOps, Pandocs, PyPI canopyHydrodynamics: Robust GitOps CI/CD workflows

|

πŸš€ Featured Projects

Note

Detailed project descriptions are available via dropdowns.

GitOps, NumPy, MatPlotLib, GeoPandas, GDAL, Pandocs, PyPI

canopy hydrodynamics visualization

Simulating water movement within tree canopiea under varied meteorological conditions. Identifies key structural traits:
  • Stemflow and throughfall generating areas of the canopy
  • The 'drip points' to which throughfall is directed - complete with their relative volumes
  • 'Divides' and 'confluences' within the canopy that dictate the flow of water through the canopy
Leverages GitOps for robust CI/CD capabilities.
  • automated linting and testing for all changes
  • dynamically created version upgrade branches
  • auto-generated method documentation
  • Versioned deployment automated for release branches

Prometheus, Grafana, NodeJS, Docker, Bash

at_observation_process

Infrastructure-as-code to provision and configure a multi-server, multi-container cluster with a modern observability stack. Utilized for the community archive project ArchiveTeam. Consists of:
  • Docker containerization monitored by CloudWatch
  • Prometheus for node management/aggregation
  • Graphana dashboards for visualization
  • a custom a node.js metrics server for exporting telemetry.

🌲 pyQSM

SciPy, Open3D OpenCV, Rasterio

pyqsm_example_isolation

Image processing and spatial algorithms to clean and segment trees and their components within terrestrial LiDAR point clouds. Key functionality includes:
  • Tree Isolation: Separating individual trees from surrounding man-made objects and other vegetation.
  • Epiphyte Segmentation: Isolating and analyzing different parts of trees (trunk, branches, leaves) as well as plants in and around the trees.
  • Ray Casting Similations: Creation of 3D meshes representing objects and examining their characteristics via tensor intersection calculations.

  • Laspy, Terraform, PyTorch, Open3D

    object_isolated_w_mlo-tl

    An MLOps pipeline for configuration and deployment of a convolutional neural-net on GPU-enabled, cloud-hosted clusters. Automates the provisioning of Digital Ocean GPU droplets to allow users to leverage CUDA friendly compute. Designed as a 'one-click' solution enabling researchers without specialized hardware to process LiDAR data at minimal cost.

    πŸ•ΈοΈ linkedInScraper

    DLT, DuckDB, Web Scraping, Streamlit

    LI Scraper Streamlit UI]

    A DLT pipeline leveraging a LinkedIn's 'hidden' Voyager API to retrieve job and company data.
    • Built on DLT which provides a UI for viewing pipeline status, exploring data
    • Custom DLT source automatically handles REST requests, pagination, data extraction and relational DB storage
    • Predefined endpoints/available datasets
      • `get_companies`: scrape followed companies via GraphQL profile components
      • `get_job_urls`: fetch job cards per company
      • `get_descriptions`: fetch job descriptions and details
      • crawler
    • Extensible, with additional resources configured via json

    Pinned Loading

    1. canopyHydrodynamics canopyHydrodynamics Public

      A productionalized library supporting NSF funded research initiatives. Utilities for quantifying and visualizing water distribution within tree canopies

      Python 2 1

    2. pyQSM pyQSM Public

      (WIP) A python library for processessing TLS lidar scans; producing triangular and tetra-meshes for the application of raytracing and prediction of environmental conditions

      Python

    3. PenguaLab-IaC PenguaLab-IaC Public

      IaC that defines the entirety of my k3s based 'homelab'! Utilizes Terraform, Ansible, Docker, Rancher to configure a k3s cluster.

      HCL

    4. archiveteam-digitalocean-IaC archiveteam-digitalocean-IaC Public

      Shell

    5. ml_ops_tree_learn ml_ops_tree_learn Public

      OpenTofu (Terraform) and Ansible IaC for the creation of a gpu droplet in Digital Ocean, the installation of apt and pip prerequisites and the running TreeLearn's UNet pipeline. Enables users to se…

      Python

    6. linkedInScraper linkedInScraper Public

      Reverse engineering the Voyager API *for educational purposes*

      Python 3