ScrapeBox Forum
What am I doing wrong? - Printable Version

+- ScrapeBox Forum (https://www.scrapeboxforum.com)
+-- Forum: ScrapeBox Main Discussion (https://www.scrapeboxforum.com/Forum-scrapebox-main-discussion)
+--- Forum: Scrapebox Footprints (https://www.scrapeboxforum.com/Forum-scrapebox-footprints)
+--- Thread: What am I doing wrong? (/Thread-what-am-i-doing-wrong)



What am I doing wrong? - archkre - 09-09-2010

Hey there: I need to scrape all the pages of a given site.

Then I enter "site:mysite.com" in the harvester browser, > Custom footprint>Yahoo .

Then I enter "mysite.com" in the KWs window>Start Harvesting.

All the pages appear on the URLs Harvested window as required, but a note pops up: "xx domain(s) removed from the list. 97% of the harvested results have been removed. Maybe too many similar keywords have been used"

Any only the main domain remains.

What am I doing wrong?
Thank you


RE: What am I doing wrong? - hoyce - 12-18-2010

Under the options menu you probably have "automatically remove bla bla bla". Uncheck that box and you should be fine.


RE: What am I doing wrong? - s4nt0s - 12-23-2010

(09-09-2010, 02:47 PM)archkre Wrote: Hey there: I need to scrape all the pages of a given site.

Then I enter "site:mysite.com" in the harvester browser, > Custom footprint>Yahoo .

Then I enter "mysite.com" in the KWs window>Start Harvesting.

All the pages appear on the URLs Harvested window as required, but a note pops up: "xx domain(s) removed from the list. 97% of the harvested results have been removed. Maybe too many similar keywords have been used"

Any only the main domain remains.

What am I doing wrong?
Thank you

Yes hoyce is right. Go to options and uncheck, "automatically remove duplicate domains".

Also, the site: search operator won't give you all the websites pages. Only the ones that have been indexed by the search engines.

To find more pages go to the add ons menu at the top and use the, "link extractor". Make sure and check internal links button. That should give you a lot more results.