Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View yfhsu's full-sized avatar

Block or report yfhsu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for our paper "Modelobfuscator: Obfuscating Model Information to Protect Deployed ML-Based Systems" that has been published by ISSTA'23

C++ 19 7 Updated May 18, 2024

Repository for sample controller. Complements sample-apiserver

Go 3,463 1,200 Updated Feb 12, 2026

Bootstrap Kubernetes the hard way on Vagrant on Local Machine. No scripts.

Shell 5,097 4,820 Updated Nov 17, 2025

source provider for parquet-go

Go 109 88 Updated Oct 21, 2024

cuDF - GPU DataFrame Library

C++ 9,485 1,009 Updated Feb 13, 2026

k-Nearest Neighbors algorithm on Spark

Scala 240 107 Updated Nov 14, 2023

A system for quickly generating training data with weak supervision

Python 5,937 854 Updated May 2, 2024

The Google Cloud Developer's Cheat Sheet

8,207 1,912 Updated Apr 6, 2024

JWT support for Scala. Bonus extensions for Play, Play JSON, Json4s, Circe, uPickle, Spray and Argonaut

Scala 677 143 Updated Feb 9, 2026

The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020

Jupyter Notebook 606 68 Updated Jun 4, 2020
Python 25 6 Updated Dec 8, 2022

BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray

Jupyter Notebook 2,694 732 Updated Feb 5, 2026

A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.

Jupyter Notebook 29,847 13,246 Updated Jun 13, 2024

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 8,677 1,403 Updated Jan 28, 2026

Step-by-step Deep Leaning Tutorials on Apache Spark using BigDL

Jupyter Notebook 210 124 Updated Jan 3, 2023

Self-supervised NER prototype - updated version (69 entity types - 17 broad entity groups). Uses pretrained BERT models with no fine tuning. State-of-art performance on 3 biomedical datasets

Python 78 19 Updated Jul 16, 2022

1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.

Python 958 138 Updated Jan 28, 2025

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,154 6,658 Updated Sep 30, 2025

State-of-the-Art Text Embeddings

Python 18,249 2,762 Updated Feb 9, 2026

A library for efficient similarity search and clustering of dense vectors.

C++ 39,087 4,232 Updated Feb 13, 2026

Public runnable examples of using John Snow Labs' NLP for Apache Spark.

Jupyter Notebook 1,076 617 Updated Feb 6, 2026

notes about machine learning

HTML 3,304 900 Updated Nov 22, 2021

State of the Art Natural Language Processing

Scala 4,108 740 Updated Feb 12, 2026

This is a CoNLL formatted version of the OntoNotes 5.0 release.

189 99 Updated Jan 13, 2015

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

Java 10,044 2,719 Updated Feb 10, 2026

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

Python 6,943 934 Updated Feb 13, 2026

Models and Pipelines for the Spark NLP library

Jupyter Notebook 113 44 Updated Aug 12, 2021

In this notebook, we will build an abstractive based text summarizer using deep learning from the scratch in python using keras

Jupyter Notebook 209 233 Updated Apr 1, 2022

Open source annotation tool for machine learning practitioners.

Python 10,542 1,830 Updated Feb 11, 2026
Next