07-13-2010, 04:31 PM
(06-07-2010, 03:46 PM)mild7 Wrote:(06-04-2010, 06:37 PM)bigprofits83 Wrote: anyone else having problems with the Inurl: custom foot print? all the other foot prints work fine, but the inurl: doesnt. I can pull results manually from google with this but not with scrape box..
For me it's harvesting blogs but it doesn't look accurate. If I use the custom footprint Inurl:.edu “powered by wordpress” “leave a comment” with some keywords, and it harvests blogs that aren't .edu blogs. Does this happen to everyone?
Use "site:edu" (no period needed before "edu") instead of "inurl:.edu".
The site function can be used to browse an entire domain extension. Google isn't as smart as it looks, it simply looks at the URL and anything before the first "/", it refers to as the website. So you can use it at any level you want, the domain extension (.edu), an entire site (school.edu), or even a sub-domain (calendar.school.com).
Any footprints that use the "inurl" function can be inaccurate. You can't control how sites organize their files, and often times the site will contain "edu" somewhere else in the url. For instance, if someone ran a WHOIS scan of an edu site and the WHOIS program caches the scan, it'l probably store it in a url like who.is/someschoolsite.edu, which would cause Google to show it in an "inurl:.edu" search, but not the "site:edu" since the "edu" portion of the url appears after the first "/" in the url.