Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

article.content only contains a part of the content #100

Open
westlinkin opened this issue Oct 24, 2017 · 2 comments
Open

article.content only contains a part of the content #100

westlinkin opened this issue Oct 24, 2017 · 2 comments

Comments

@westlinkin
Copy link

First of all, great great library! You've done a wonderful job here.

When use this url, the result is wrong. The article.content only contains a part of the content, here the value:

<div class="field field-paragraph field-paragraph--full field-type-text-long field-type-text-long--full"><p>“It was the saddest movie I've ever filmed, to be honest with you. I've never had a more difficult film to film,” Olmos lamented. “It was too close to the time when she was actually killed, it was only 13 months after when we were filming. Nobody wanted to film it, the parents didn't, we didn't, nobody wanted to. We'd rather she be alive. But we had to."</p></div>

If you click on the link, you'll see article.content only contains the first paragraph.

@westlinkin westlinkin changed the title article.content only contains a part of the content article.content only contains a part of the content Oct 24, 2017
@wong2
Copy link
Contributor

wong2 commented Nov 1, 2017

yes this is one of the biggest limitations of this lib: it doesn't work well on deeply nested HTML structure:

image

@raju1988
Copy link

raju1988 commented Sep 16, 2019

Yes we are also facing same kind of issue. It won't return full content of html. It returns some random div from the page.
I used this link here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants