Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Index Checker
#1
I'm using Google index checker and would like to know what it actually checks for.
I tested with a list of 35 of my websites, in all instances the home page and with correct http or https. and www or not www,
I only get 12 indexed, and while some are indeed not indexed, at least 80% are in reality when checking by hand on Google.

What's the search string that's used?

PS I get 18 indexed on Yahoo, and 14 on Bing, and I never tried to index on either one of them.
Reply
#2
If you are using the latest version of scrapebox it is using

site:domain.com

to verified if something is indexed or not.
Reply
#3
Can't be that, can it? at least it would have to be site:domain.com/the_whole_url
Still, why do I get so many NOes.
Anybody willing to check a list on their Scrapebox? Where I constantly get No, whereas a site:... search on Google of course returns results.
Reply
#4
Just tested this with and without trailing slashes, with and without http(s):// and I get different results.
And there are some domains that always get a NO, no matter how I test them, while they get always found when I do a site:xxx on Google, whether with or without /, https etc.
Some other feedback gets interpreted as NO, imho.
Anyway, not reliable at all for me at the moment
Reply
#5
sure post a few urls here and what you get.

According to google, and basically the rest of the world to, all of these are 100% unique as far as being indexed by google, and each one of them can serve up different results, so they are all different urls basically

http://domain.com
http://www.domain.com
https://domain.com
https://www.domain.com
http://domain.com/
http://www.domain.com/
https://domain.com/
https://www.domain.com/

So 8 urls that may say are all the same, because they all resolve to the same content, but to google that is 8 different urls and they will index or not index each one individually.
Reply
#6
i understand that (except http and https are now combined, they changed that about a month ago)
OK, here is an example -- my personal "business card" page:

In google it gives the exact same result for:
site:econsult.pl
site:https://econsult.pl
site:http://econsult.pl/
etc.
and a slightly different one for
site:www.econsult.pl etc.
(but also indexed for all variants)

On Index checker
https://econsult.pl and http://econsult.pl are not found, econsult.pl and https://econsult.pl/ are found. Similar inconsistencies with www

Here's the export:
Url,Indexed
https://econsult.pl,NO
http://econsult.pl,NO
econsult.pl,YES
https://econsult.pl/,YES
www.econsult.pl,YES
http://www.econsult.pl,YES
https://www.econsult.pl,NO
www.econsult.pl/,NO
http://www.econsult.pl/,NO
https://www.econsult.pl/,NO

And I now I've updated to I get more YESs but still a few NOs...
Bing and Yahoo have varied, too.

Let me also add that the YES/NOes are consistent from one run to the next, so it's not likely a proxy issue (as they would be rotated, no? I have plenty)
Reply
#7
What happens when you use no proxy and just your local IP?
Reply
#8
Interesting, then they're all indexed.
Could that be different google versions due to different IP countries?
I'll change to only US proxies and have a look,
(I have to use proxies, or else the VPS IP will burn in no time with "site" queries...)
Reply
#9
Yes, that was indeed the source of the problem. If I only take Tier-1 proxies (US, UK, Germany, France) all indexing shows. If I add Thailand, Korea, etc. I get mixed results.

The consistency of the results, though, seems to indicate that Scrapebox doesn't use proxies randomly, but always in the same sequence?

Anyway, many thanks for you your help. Loopline
Reply
#10
Different features of scrapebox use proxies differently, some random some in sequence and yes I think this one is in sequence.

Also there are different google data centers and yes some fully index things and some only partially will show certain things as indexed, it just depends on their algorithm.

So you just want to be index checking with an ip that is in the region where your traffic would be coming from.
Reply




Users browsing this thread: 1 Guest(s)

Looplines Scrapebox List