- 
                  University of Waterloo
- Nearby data lake
- https://cs.uwaterloo.ca/~jimmylin/
- @lintool
- 
  cs451-2025f PublicCS 451/651: Data-Intensive Distributed Computing (Fall 2025) 
- 
  bigcows PublicScrapes citation statistics from Google Scholar 
- 
  
  
- 
  
  
- 
  
  
- 
  The Art and Science of Empirical Computer Science (Fall 2023) 8 UpdatedNov 20, 2023 
- 
  cs-big-cows PublicList of people with great achievements in Computer Science 
- 
  The Art and Science of Empirical Computer Science (Fall 2022) 21 UpdatedSep 1, 2023 
- 
  csranking-aica PublicVisualizations of top Canadian universities for AI research by CSRankings HTML UpdatedApr 10, 2022 
- 
  
  
- 
  IR-Reproducibility2 PublicThe Replicability of IR Replicability Experiments 
- 
  MapReduceAlgorithms PublicData-Intensive Text Processing with MapReduce 
- 
  
  
- 
  
- 
  
  
- 
  
  
- 
  
  
- 
  My data is bigger than your data! 
- 
  robust04-analysis PublicMeta-Analysis of Robust04 Papers (Yang et al., SIGIR 2019) 
- 
  
  
- 
  bigdata-2018f PublicCS 451/651 Data-Intensive Distribute Computing (Fall 2018) at the University of Waterloo 
- 
  UROC-projects PublicUndergraduate Research Opportunities Conference sponsored by the University of Waterloo 
- 
  bespin PublicReference implementations of data-intensive algorithms in MapReduce and Spark 
- 
  TS4 PublicForked from ylwang99/TS4Tweet Streaming Selective Search with Spark 
- 
  
  
- 
  
  
- 
  bigdata-2018w PublicCS 451/651 431/631 Data-Intensive Distribute Computing (Winter 2018) at the University of Waterloo 
- 
  
  
- 
  
  
- 
  c-bfscan PublicImplementations of brute force scans for document retrieval in C