The Blueprint Training

Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Extracting Links that are not links
#1
I have a source that I need to crawl where the websites are listed like this "website.com" There is no www, no http:// etc. And, they are not actually links, they are just text.  I would like to crawl the site and capture all of these website addresses, even though they are plain text.  Is there a way to do that?
Reply
#2
Sure, use the link extractor to crawl the sites

https://www.youtube.com/watch?v=Ed3SGP_ch3Q

and then use the custom data scraper to get the links

https://www.youtube.com/watch?v=X3Ep-NXg4kY
Reply




Users browsing this thread: 1 Guest(s)

Looplines Scrapebox List