Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View dimitris-markopoulos's full-sized avatar
🎃
Focusing
🎃
Focusing

Block or report dimitris-markopoulos

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Profile views Python VSCode

Dimitris Markopoulos' Github

Machine Learning • Quantitative Finance

🎓 M.A. in Statistics at Columbia University (4.19/4.33)
🎓 B.S. in Applied Mathematics & Statistics from Stony Brook University (3.99/4.0)


Project Summary
self-instruct-gpt4mini Current working project
xrt-trading-case-study Built and backtested deviation- and volatility-based trading strategies on XRT, analyzing hit rates, Sharpe ratios, and robustness, concluding no persistent alpha after risk adjustments.
mnist-image-classification Comparing Lasso, Naive Bayes, Ridge, SVM, and Group Lasso
tcga-brca-analysis Unsupervised analysis of TCGA Breast Cancer (BRCA) gene-expression data
trees-ensembles-neural-networks Decision trees, boosting, and neural networks on UCI income data with tuning, overfitting analysis, and feature interpretation.
algorithmic-trading Systematic trading pipelines using ML and time-series cross-validation
latent-semantic-clustering UMAP + EM-GMM clustering of book chapters via NLP frequency vectors
quantitative-finance BSM & Heston option pricing, Monte Carlo simulations, VaR, etc
crime-predictor-analysis Predicting crime using UCI community features; LASSO, Ridge, Elastic Net, kernel regression + manually implemented CV
sepsis-prediction Cleaned & merged using SQL, then applied ML pipeline to CUMC + NYP secure patient-level dataset; HIPAA-compliant experiments using Azure Secure Environment; certified.


Supervised Learning & Statistical Modeling: LASSO, Ridge, Elastic Net, Logistic Regression, LDA, ARIMA, Group Lasso, etc
Dimensionality Reduction & Feature Analysis: PCA, UMAP, t-SNE, Spectral Embedding, MDS, NMF, Kernel PCA
Unsupervised Learning & Clustering: KMeans++, Gaussian Mixture Models (GMM), Spectral Clustering, Hierarchical Clustering

Languages: Python, SQL, HTML, R, MATLAB
Libraries: PyTorch, TensorFlow, scikit-learn, XGBoost, Numpy, Pandas, Statsmodels
Visualization: Plotly, Matplotlib, Seaborn, Streamlit
Workflow: VSCode, Git/GitHub, Google Colab (for GPU compute), LaTeX
Infra: Azure, APIs, GitHub Actions

🌐 Connect With Me
LinkedIn
GitHub
[email protected]

GitHub followers


"Averaged over all possible data-generating distributions, every classification algorithm has the same error rate."
— David H. Wolpert, No Free Lunch Theorems for Optimization

Pinned Loading

  1. xrt-trading-case-study xrt-trading-case-study Public

    This project explores a systematic trading case study on the SPDR S&P Retail ETF (XRT).

    Python

  2. mnist-image-classification mnist-image-classification Public

    Comparative analysis of MNIST digit classification and dimensionality reduction methods. Part 1: supervised models (Logistic, SVM, LDA, Group LASSO). Part 2: unsupervised nonlinear embeddings (UMAP…

    Jupyter Notebook

  3. quantitative-finance quantitative-finance Public

    A collection of quantitative finance projects covering option pricing, risk analysis, volatility modeling, and investment strategies. Includes Monte Carlo simulations, Black-Scholes & Heston models…

    Jupyter Notebook 2 1

  4. latent-semantic-clustering latent-semantic-clustering Public

    Clustering book chapters with unsupervised ML—custom EM-GMM, sklearn baselines, and dimensionality reduction.

    Jupyter Notebook

  5. trees-ensembles-neural-networks trees-ensembles-neural-networks Public

    Machine learning models including decision trees, random forests, adaboost, gradient boosting, and neural networks applied to structured data for classification tasks.

    Jupyter Notebook

  6. crime-predictor-analysis crime-predictor-analysis Public

    Predicting violent crime rates using high-dimensional community data from the UCI dataset. Implements a structured machine learning pipeline with extensive preprocessing, multiple feature selection…

    Jupyter Notebook