A Node.js + Puppeteer project that scrapes product information from Plaisio.gr, using a list of product URLs provided in a .txt file. It extracts product metadata such as title, price, availability, category, and brand.
Here's what's included in the repository:
Main scraping script using Puppeteer:
- Reads URLs from
inputURLS.txt - Navigates to each product page
- Scrapes key product data using CSS selectors
- Writes the output to
productData.json
Plain text file with one product URL per line. Example:
The final scraped data output — an array of objects like:
[
{
"Τίτλος": "@Work Αυτοκόλλητα Χαρτάκια Neon 75x75mm 400φύλλα",
"Τιμή": "2,19 €",
"Διαθεσιμότητα": "Διαθέσιμο σε καταστήματα",
"Κατηγορία": "Post It - Χαρτάκια",
"Brand": "@Work"
},
...
]Each product includes the following fields:
Τίτλος (Title)
Τιμή (Price)
Διαθεσιμότητα (Availability)
Κατηγορία (Category) – from breadcrumbs
Brand – parsed as the first word of the title
1️⃣ Install Dependencies
Αντιγραφή Επεξεργασία npm install puppeteer 2️⃣ Make sure inputURLS.txt exists
Add one product URL per line.
3️⃣ Run the scraper
Αντιγραφή Επεξεργασία node scrapePlaisio.js 4️⃣ Output
Results are saved in productData.json.