Scrap

Scrap

Beautiful Soup

Generate bs4

1
soup = bs4.BeautifulSoup(text)

find node

1
2
3
soup.find("div", attrs={"id":"..."})
soup.find_all("...")
soup.div

siblings

1
2
3
4
soup.find("...").next_sibling
soup.find("...").previous_sibling
for elm in soup.div.next_siblings:
...

with Regular Expression

1
2
soup(class_=re.compile('item-'))
soup(attrs={"class":re.compile("star-rating.*")})[0].get("class")[1]
Author

Chen Yulin

Posted on

2023-05-10

Updated on

2024-05-15

Licensed under

Comments