Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

prepare_data文件中的max_cut.py完全没用,可以删除! #57

Open
Peace-And-War opened this issue Jul 27, 2022 · 0 comments
Open

Comments

@Peace-And-War
Copy link

max_cut.py主要是为了解决爬取数据中的"basic_info"中的"并发症"数据连在一起无法分离的问题。但问题应该出在data_spider.py中,而不是应该用max_cut.py使问题复杂化。作者使用的max_cut.py增加了操作步骤而且错误率不低。在data_spider.py,加入accompany = selector.xpath('//div[@Class="mt20 articl-know" and position() = 2]/p/span/a/text()')就可以完美解决问题,整个max_cut.py就不需要了。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant