Load WordPress Sites in as fast as 37ms!

Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Harvester not working
#1
I am getting zero results from Google when I scrape.
I am getting the same url from Bing multiple times. Url = choice.live.com

I harvest proxies from scrapebox resources, and use only anonymous google passed.

I feel like there is some kind of bug with my scrapebox, or for some reason the proxies are not working.

Any suggestions?
Would I see better results with paid proxies?
What sites should I use to buy proxies?

Right now my scrapebox is useless.
Reply
#2
I tried to pm you back and it did not work, says your pms are disabled. Anyway, try this:

Go to settings >> use multi threaded harvester - and uncheck this. Also under that same settings menu uncheck the use custom harvester. Then try and harvest, there will be a status column. What errors are you getting in the status column?
Here is a video that goes over how to do this and other troubleshooting for the harvester that might be helpful. http://youtu.be/2QaLWgTXsRo

You should also use the custom harvester for bing/yahoo. Go go to settings >> custom harvester settings >> import/export >> update the latest engine file. Then you will have a working engine file. Then just use the custom harvester:
https://www.youtube.com/watch?v=SwaLo10ck9U

Your queries are probably banned by google, even though they are passing the google test. I explain here:
https://www.youtube.com/watch?v=P9CbGhfc1aY
Reply
#3
Hi,

Thank you very much for helping me out!
I am getting 302 errors (ip blocked), 404 errors...etc. Only two proxies worked out of my list. I'm leaning towards proxy issues.

I've attached a screen shot below.

I updated my engines, and will test Bing/Yahoo after I watch the videos you shared.

Where should I go to buy proxy's that will help improve my results with scrapebox?


(07-12-2014, 10:45 PM)loopline Wrote: I tried to pm you back and it did not work, says your pms are disabled. Anyway, try this:

Go to settings >> use multi threaded harvester - and uncheck this. Also under that same settings menu uncheck the use custom harvester. Then try and harvest, there will be a status column. What errors are you getting in the status column?
Here is a video that goes over how to do this and other troubleshooting for the harvester that might be helpful. http://youtu.be/2QaLWgTXsRo

You should also use the custom harvester for bing/yahoo. Go go to settings >> custom harvester settings >> import/export >> update the latest engine file. Then you will have a working engine file. Then just use the custom harvester:
https://www.youtube.com/watch?v=SwaLo10ck9U

Your queries are probably banned by google, even though they are passing the google test. I explain here:
https://www.youtube.com/watch?v=P9CbGhfc1aY


Attached Files
.docx   Screenshot.docx (Size: 229.06 KB / Downloads: 0)
Reply
#4
Yes that would be proxy issues. I recommend and use these proxy providers:

http://scrapeboxfaq.com/scrapebox-proxies

Also this might be helpful
https://www.youtube.com/watch?v=Ri6cci288vk
Reply
#5
I've been using Buy Proxies for the last 2 days (in Loopline's link above) and they've been great. I bought 30 shared proxies. There was a little hiccup with my first set and they died after 1 hour and they said it was because I was using too many keywords (which I disagree). I was only using 1 connection with 30 IPs. They supplied me with a fresh list of 30 and I've been scraping today for the past 5 hours (about 78,000 results so far) without interruption.

Their customer service responded in minutes! It was awesome. In case you're curious about the issue I had, then continue reading. The information might be helpful for you in the future. Loopline can correct any assumptions that I might have wrong.

The issue might have been my fault, I was using the single-thread harvester, and I think it might have been cycling through the IPs in sequential order, and Google might have caught on. They were non-sequential IPs, but some were really close in sequence. Pair that up with similar looking searches across close IPs and they can put 2 and 2 together I'd imagine.

Anyways, what I did to fix this was used the multi-threaded harvester with only 1 connection with the new IPs and it is chugging along perfectly. I'm guessing that the multi-threaded chooses the IP to use at random. My first assumption for the issue was that Buy Proxies might have paired me up with a more aggressive user and they got the proxies banned, but they tried to indicate that it was my fault. I'm not so perfect that I can't screw up which is why I came up with my alternate theory above.
Reply
#6
How much did it cost you for the 30 shared proxies?
How long will they last?
Are you able to use custom footprints?



(08-27-2014, 04:23 PM)ExoPress Wrote: I've been using Buy Proxies for the last 2 days (in Loopline's link above) and they've been great. I bought 30 shared proxies. There was a little hiccup with my first set and they died after 1 hour and they said it was because I was using too many keywords (which I disagree). I was only using 1 connection with 30 IPs. They supplied me with a fresh list of 30 and I've been scraping today for the past 5 hours (about 78,000 results so far) without interruption.

Their customer service responded in minutes! It was awesome. In case you're curious about the issue I had, then continue reading. The information might be helpful for you in the future. Loopline can correct any assumptions that I might have wrong.

The issue might have been my fault, I was using the single-thread harvester, and I think it might have been cycling through the IPs in sequential order, and Google might have caught on. They were non-sequential IPs, but some were really close in sequence. Pair that up with similar looking searches across close IPs and they can put 2 and 2 together I'd imagine.

Anyways, what I did to fix this was used the multi-threaded harvester with only 1 connection with the new IPs and it is chugging along perfectly. I'm guessing that the multi-threaded chooses the IP to use at random. My first assumption for the issue was that Buy Proxies might have paired me up with a more aggressive user and they got the proxies banned, but they tried to indicate that it was my fault. I'm not so perfect that I can't screw up which is why I came up with my alternate theory above.
Reply
#7
30 shared proxies from buyproxies.org is pretty cheap, $25. That lasts a month.

Sure you can use custom footprints, but you would only want to use 1 connection if your using advanced operators and 2-3 if your using basic keywords.
Reply
#8
Thank you

How does it work.
I enter a visa number and they email me the proxies?
Or is it a download file?

(08-29-2014, 05:36 PM)loopline Wrote: 30 shared proxies from buyproxies.org is pretty cheap, $25. That lasts a month.

Sure you can use custom footprints, but you would only want to use 1 connection if your using advanced operators and 2-3 if your using basic keywords.
Reply
#9
Yeah, I think they take visa or I know they take paypal. Then you get an account on their site with a control panel. Just login and they will have your proxies listed. I think they also mail them to you.
Reply
#10
@loopline Can you tell me where in version 2.0.0.61 is option with "use multi threaded harvester"?
Reply




Users browsing this thread: 1 Guest(s)

Looplines Scrapebox List