Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The error message for host unreachable in older versions was significantly better... #565

Open
elucidsoft opened this issue Nov 24, 2021 · 4 comments

Comments

@elucidsoft
Copy link

I have been banging my head trying to get this to work and kept getting host unreachable error. After 3 hours, I tried the 1.13.0 docker image instead. The error message it gave was SIGNIFICANTLY better and I was immediately able to recognize the issue. I really suggest you put that back to how it was, I just wasted an immense amount of time.

@shortcuts
Copy link
Member

Hey, could you please provide more context? What are the errors/differences?

The scraper uses the Algolia Python client so I don't think the issue is related to this repo

@elucidsoft
Copy link
Author

The error message I was getting was host unreachable on the latest version. On the v1.13.0 message, it told me exactly what was wrong as it showed the entire neterror stack and I could clearly see that I had a malformed credentials.

@Markeli
Copy link

Markeli commented Nov 30, 2021

I had the same behavior. My auto-update script incorrectly added trailing extra whitespace to APPLICATION_ID , because of that docsearch-scrapprer made incorrect hostname. But in the latest version, I got AlgoliaUnreachableHostException: Unreachable hosts without any useful information. After downgrading to v1.13.0 I got some details that allowed me to solve the issue.

@shortcuts
Copy link
Member

Hey @elucidsoft, @Markeli, looking at the past commit, I can only see one change that could cause that: we upgraded the scraper to the latest major version of our Python client, which might handle errors differently.

After downgrading to v1.13.0 I got some details that allowed me to solve the issue.

I believe using an older version won't change the indexing, most updates were to make the scraper more stable and detect the website structure when bootstrapping config.

Note that prior to our new infra, we will only accept community contribution unless there's an urgent fix to do.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants