- Create sample parquet files for article relevance and entity extraction - Write a script to load and merge the 2 parquet files - Update the saving process to update the parquet file instead