I'm running this spider on a site. It works fine, but one problem I'm running into is that there are a number of hrefs with "#" as the link.
How can I skip or drop those # links altogether? I'm outputting the links to a file currently, and using the lstrip
dumps "" to the file. I've also tried i.replace
, but it still drops a blank line in the file.