Goal: what we will make by the end of the day
- Learn how to use beautifulsoup for web scraping
- Review HTML code to abstract those messages what we interested
- BeautifulSoup is a python module to help us to implement user develop web site
- Target to scarp movie web then we can make a movie list for MUST watch
Parsing HTML and Making Soup
Beautiful Soup Documentation - Beautiful Soup 4.9.0 documentation
- HTML and XML are structure language they are responsible for web data structure
- read data from a file
from bs4 import BeautifulSoup
with open("website.html") as file:
contents = file.read()
soup = BeautifulSoup(contents, "html.parser")
- HTML parser in BeautifulSoup


BeautifulSoup中各種html解析器的比較及使用
- read HTML content from Angela’s material