Login

ambromfg · 02-03-2020, 04:34 PM

I have a source that I need to crawl where the websites are listed like this "website.com" There is no www, no http:// etc. And, they are not actually links, they are just text. I would like to crawl the site and capture all of these website addresses, even though they are plain text. Is there a way to do that?

**loopline** · 02-04-2020, 11:00 PM

Sure, use the link extractor to crawl the sites

https://www.youtube.com/watch?v=Ed3SGP_ch3Q

and then use the custom data scraper to get the links

https://www.youtube.com/watch?v=X3Ep-NXg4kY

Login

Username:
Password:

Login

Username:
Password: