You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Was wondering if its possible to download a full pdf file via flaresolverr. Im currently scraping a site and need to get past their cloudflare protection to get and download pdf files. I have no problem scraping the rest of their site with flaresolverr - but unsure how to go about grabbing the contents of a file.
This is how I normally download and store a PDF file in python without flaresolverr:
#...
req = requests.get(url, verify=False)
with open(uid, "wb") as outfile:
outfile.write(req.content)
At the moment the response from flaresolverr from the protected url seems to contain the HTML of the page - but has no "req.content" equivalent with the actual file contents.
Really appreciate any pointers anyone could give me on this.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Was wondering if its possible to download a full pdf file via flaresolverr. Im currently scraping a site and need to get past their cloudflare protection to get and download pdf files. I have no problem scraping the rest of their site with flaresolverr - but unsure how to go about grabbing the contents of a file.
This is how I normally download and store a PDF file in python without flaresolverr:
At the moment the response from flaresolverr from the protected url seems to contain the HTML of the page - but has no "req.content" equivalent with the actual file contents.
Really appreciate any pointers anyone could give me on this.
Beta Was this translation helpful? Give feedback.
All reactions