ScrapeBox Forum
Harvesting stops after few keywords - Printable Version

+- ScrapeBox Forum (https://www.scrapeboxforum.com)
+-- Forum: ScrapeBox Main Discussion (https://www.scrapeboxforum.com/Forum-scrapebox-main-discussion)
+--- Forum: General ScrapeBox Talk (https://www.scrapeboxforum.com/Forum-general-scrapebox-talk)
+--- Thread: Harvesting stops after few keywords (/Thread-harvesting-stops-after-few-keywords)



Harvesting stops after few keywords - Charliepug - 05-16-2015

Hi there! I used Scrapebox few times months ago and all run smoothly. Now I'm back to business and I need to do some scraping, so I got an VPS and 10 private proxies from Squidproxies.

Strangely, even if I load 800 keywords, harvesting stops ("harvesting completed") after scraping just 2 to 12 keywords.

It doesn't even show the usual red color on the failed ones.

[Image: LsoFkES.jpg]

Some hints:

1) I disabled multithreaded harvester and set a high delay between connections.
2) Squidproxies are being blocked A LOT, but they do manage to harvest something:

[Image: pHBSFqv.jpg]

Sometimes, however, it shows a keyword as "completed" with 0 results even if I know for certain that there are a lot of them.
3) I'm using a not that good VPS with a very small amount of free hard drive (around 800MB).
4) I'm using advanced operators as keywords.
5) I'm using spanish words with accents (who knows, maybe affects).

I know that most of the Scrapebox issues are due to poxies of bad quality, but this time I have private proxies, that's why I don't understand what's going on...

Does anyone know what could be the problem?

Thanks!


RE: Harvesting stops after few keywords - loopline - 05-16-2015

Well its almost certainly proxies. In that version of scrapebox I think it removes dead proxies so when it encounters a 403, or 302 etc... it removes the proxy from the grid and when it runs out of proxies it quits.

Try V2.

http://www.scrapebox.com/v2-beta

you can use the detailed harvester for a delay. All in all, its a much better harvesting experience, aside from being faster its more robust and highly optimized. I have a harvesting video here:
https://www.youtube.com/watch?v=XsHbSIUA4ho&list=PLd2GyqDU6SSwBGVROdFI9jS4WPTODl-vB&index=3


RE: Harvesting stops after few keywords - Charliepug - 05-16-2015

(05-16-2015, 08:51 PM)loopline Wrote: Well its almost certainly proxies. In that version of scrapebox I think it removes dead proxies so when it encounters a 403, or 302 etc... it removes the proxy from the grid and when it runs out of proxies it quits.

Try V2.

http://www.scrapebox.com/v2-beta

you can use the detailed harvester for a delay. All in all, its a much better harvesting experience, aside from being faster its more robust and highly optimized. I have a harvesting video here:
https://www.youtube.com/watch?v=XsHbSIUA4ho&list=PLd2GyqDU6SSwBGVROdFI9jS4WPTODl-vB&index=3

Yeah, I'm going to ask for a proxies replacement and give that 2.0 version a try.

Thanks a lot, Loopline!


RE: Harvesting stops after few keywords - loopline - 05-18-2015

Your welcome. Good luck. Smile