Scrap all products from the website: www.plazavea.com.pe Libreria y Oficina
Download and install in your local machine:
- Chrome download page
- Chrome driver binary download and copy here
Software:
google chrome (v92)
chromedriver (v92)
Packages:
python 3
pip
virtualenv
Once inside the folder
Execute the follow comands:
virtualenv -p python3 virtualenv
source virtualenv/bin/active
pip install -r requirements.txt # install all packages
#01. Get all categories (this generate the file: 01.categories)
python3 main5.py
#02. Get category and subcategories
python3 main6.py
# Get all products
python3 main7.py
The result is saved in the file named 03.categories.json there are all all products
Source: scrapingbee