04-26-2012, 07:26 AM
You must also remember that your footprint plays a massive part in the quality of your results.
Just scraping with "Wordpress" selected and using a footprint like "leave a comment" will NOT get you a good result. Think about what you want to achieve in your scrape session, and build your footprint around that. For example, if I was looking for wordpress blogs to comment on that were targeted around the puppy training niche, I would build a custom footprint like this:
"Powered by wordpress" "puppy training" "leave a comment" -"you must be logged in" -captcha -commentluv
This will bring me back blogs related to puppy training that allow me to leave a comment, that don't require me to be logged in, that don't have captcha enabled, and that aren't protected by the commentluv plugin - so my overall success rate in finding AA blogs will be higher that if I was scraping with just "leave a comment" and having wordpress selected.
Just scraping with "Wordpress" selected and using a footprint like "leave a comment" will NOT get you a good result. Think about what you want to achieve in your scrape session, and build your footprint around that. For example, if I was looking for wordpress blogs to comment on that were targeted around the puppy training niche, I would build a custom footprint like this:
"Powered by wordpress" "puppy training" "leave a comment" -"you must be logged in" -captcha -commentluv
This will bring me back blogs related to puppy training that allow me to leave a comment, that don't require me to be logged in, that don't have captcha enabled, and that aren't protected by the commentluv plugin - so my overall success rate in finding AA blogs will be higher that if I was scraping with just "leave a comment" and having wordpress selected.