01-28-2022, 06:23 PM
I am working on a research project that requires me to access many documents on the web.
I have successfully created the footprints and Engines (one) I want to use and am finding a fair amount of documents.
Although I have used to harvester to download many of the documents, Scrapebox is not getting the files (like PDF's) that open in a new browser window and download automatically when using my browser.
Is there a custom grab idea/strategy to collect these? Alternatively how might I determine in advance these will auto download so I remove them from the URL list
Any thoughts?
Thanks
I have successfully created the footprints and Engines (one) I want to use and am finding a fair amount of documents.
Although I have used to harvester to download many of the documents, Scrapebox is not getting the files (like PDF's) that open in a new browser window and download automatically when using my browser.
Is there a custom grab idea/strategy to collect these? Alternatively how might I determine in advance these will auto download so I remove them from the URL list
Any thoughts?
Thanks