09-16-2017, 08:35 AM
(09-15-2017, 11:27 PM)Did it more by accident that design. Used before_after in CDG to get a wider pool of text and then weedled it down and got the URL.Would be great if the CDG recorded the mask name / field name with the DATA grabbed. It would help to stitch it all back together in excel.Happy Scraping Wrote: Thats because there is no external link, here is the code
<a href="/record_click.asp?id=120635&ctype=profile" target="_blank" class="u" title="http://www.daftonline.co.uk">www.daftonline.co.uk</a>
You can see the title and the anchor is the link you want, but the actual ahref= is a internal link. So it probably passes the title/anchor to the internal recording system and counts the click and redirects. But there is no actual valid external link there.
You might get the custom data scraper to work with some regex, matching the title to the anchor or looking for the record click event and getting the anchor or title after that. Not sure, not a regex expert, don't even know if its possible.
Before and After would be very hard as there isn't a lot of unique marker there, but its possibly doable like
before_after=&ctype=profile" target="_blank" class="u" title="|">