- San Francisco, CA
- http://kathyqian.com
Stars
A list of blogs, videos, and other content that provides advice on building experimentation and A/B testing platforms
Blazingly fast cleaning swear words (and their leetspeak) in strings
Utils for streaming large files (S3, HDFS, gzip, bz2...)
📛 Fuzzy Name Matching with Machine Learning
Python Causal Impact Implementation Based on Google's R Package. Built using TensorFlow Probability.
An experiment to standardize individual donor names in campaign finance data using simple graph theory and machine learning.
🎠A carousel component for Vue.js
A pure vue native horizontal list implementation for mobile/touch and responsive web.
Uplift modeling and causal inference with machine learning algorithms
List of newsrooms around the world that are using software engineering, data science, osint, and various tech to elevate reporting.
An automated, programming-free web scraper for interactive sites
Zip code boundaries for each of the 50 states
Python function caching that prevents re-entrant calls
Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017
Cytoscape.js based network visualizer for Jupyter Notebook
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
The first RESTful API for the Federal Election Commission. We're aiming to make campaign finance more accessible for journalists, academics, developers, and other transparency seekers.
An implementation of figlet written in Python
A Python module for common interactive command line user interfaces
TinyDB is a lightweight document oriented database optimized for your happiness :)
fast python port of arc90's readability tool, updated to match latest readability.js!
newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs: