流程:
迭代进行:种子url->Element遍历获取超链接lin->作为新种子id
⚠️注意频率和遍历深度
1.设定url及请求参数
headers = {"user-agent": "Baiduspider"}base_url = "https://www.zhihu.com"suffix = "/explore"entry_url = base_url + suffix
2.读取请求返回结果,及解析
```
response = requests.get(entry_url, headers=headers).textsoup = bs4.BeautifulSoup(response, "lxml")page_set = soup.find_all("a")