Web Scraper with Beautiful Soup

This is my first project to portfolio, showing web scraper implemented using Beautiful Soup, a Python library for parsing HTML and XML documents.

Description

This is my first portfolio project showing Python Codings skills. This web scraper is designed to extract recipes from the Bianca Zapatka Blog

The app task is to scrape recipes from Culinary Blog and save them to JSON file.

Used libraries

Beautiful Soup 4 - Well known web scraper which allows parse HTML files and extract desired data.

tqdm - Python library responsible for displaying progress bars

FastAPI - Modern, fast (high-performance), web framework for building APIs based on standard Python type hints.

Features

Fetches recipes content from a Bianca Zapatka blog
Parses the HTML using Beautiful Soup
Extracts data based on specified CSS selectors
Saves extracted data to JSON file for further processing
Serves the extracted data via FastAPI

Planned features

Sending Slack / email notification when the new recipes are found
Adding nutrition information to the recipes using external API

Demo

Make sure you have the following installed on your system:

Docker
Docker compose

Usage

Clone the repository
Build the stack
- Run docker-compose build
Run the stack
- Run docker-compose up

Name		Name	Last commit message	Last commit date
Latest commit History 213 Commits
.github/workflows		.github/workflows
api		api
config		config
json_files		json_files
scraper		scraper
tests		tests
utilities		utilities
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.sourcery.yaml		.sourcery.yaml
LICENCE		LICENCE
README.md		README.md
__init__.py		__init__.py
docker-compose.yml		docker-compose.yml
main.py		main.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web Scraper with Beautiful Soup

Description

Used libraries

Features

Planned features

Demo

Prerequisites

Usage

About

Releases

Packages

Contributors 2

Languages

License

bartczak-pa/WebScraper

Folders and files

Latest commit

History

Repository files navigation

Web Scraper with Beautiful Soup

Description

Used libraries

Features

Planned features

Demo

Prerequisites

Usage

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages