How would you scrape a sitemap URL with a LinkExtractor?
<?xml version="1.0" encoding="UTF-8"?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
<url>
<loc>http://www.example.com/</loc>
<lastmod>2005-01-01</lastmod>
<changefreq>monthly</changefreq>
<priority>0.8</priority>
</url>
</urlset>
Linkextractor will target the href attribute of an a tag.
<a href="http://mylink.com">MyLink</a>
How would you use LxmlLinkExtractor to target <url>
/<loc>
elements instead ?