Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- for link in soup.find_all('a'):
- if link.has_attr('href'):
- s = link.get('href')
- if s.startswith('/w') and (':' not in s) and (not s.startswith('#')):
- b.add(s)
- print(len(a & b))
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement