Question: Handling of pdf files #355

eladbitton · 2019-09-21T08:30:19Z

I want to create a general purpose crawler with this project.
By general purpose i mean - if the url leads to pdf i want it to render the pdf, and if its html i want it to render html.

How is this project handle files like pdf?
Is there any example i can take a look at?
Is there a docker example for this project?

kulikalov · 2020-10-17T06:45:12Z

Hey @eladbitton! At the moment this project is not handling pdfs well. Actually, it's simply crashing. So, this is a valid point to improve.
Did you figure how to achieve what you want? If not, pls elaborate more on what is your final goal.

kulikalov added the bug label Oct 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question: Handling of pdf files #355

Question: Handling of pdf files #355

eladbitton commented Sep 21, 2019

kulikalov commented Oct 17, 2020 •

edited

Question: Handling of pdf files #355

Question: Handling of pdf files #355

Comments

eladbitton commented Sep 21, 2019

kulikalov commented Oct 17, 2020 • edited

kulikalov commented Oct 17, 2020 •

edited