Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug: Using wrong chrome binary for singlefile snapshot #763

Closed
LinusCDE opened this issue Jun 5, 2021 · 2 comments
Closed

Bug: Using wrong chrome binary for singlefile snapshot #763

LinusCDE opened this issue Jun 5, 2021 · 2 comments
Labels
status: wip Work is in-progress / has already been partially completed type: bug report
Milestone

Comments

@LinusCDE
Copy link

LinusCDE commented Jun 5, 2021

Describe the bug

Creating a singlefile snapshot fails.

Steps to reproduce

  1. Created a snapshot that included "singlefile". Seems to work with any site.

Screenshots or log output

image

ArchiveBox version

ArchiveBox v0.6.2
Cpython Linux Linux-4.9.201-tegra-aarch64-with-glibc2.28 aarch64
IN_DOCKER=True DEBUG=False IS_TTY=True TZ=UTC SEARCH_BACKEND_ENGINE=ripgrep

[i] Dependency versions:
 √  ARCHIVEBOX_BINARY     v0.6.2          valid     /usr/local/bin/archivebox                                                   
 √  PYTHON_BINARY         v3.9.5          valid     /usr/local/bin/python3.9                                                    
 √  DJANGO_BINARY         v3.1.10         valid     /usr/local/lib/python3.9/site-packages/django/bin/django-admin.py           
 √  CURL_BINARY           v7.64.0         valid     /usr/bin/curl                                                               
 √  WGET_BINARY           v1.20.1         valid     /usr/bin/wget                                                               
 √  NODE_BINARY           v15.14.0        valid     /usr/bin/node                                                               
 √  SINGLEFILE_BINARY     v0.3.16         valid     /node/node_modules/single-file/cli/single-file                              
 √  READABILITY_BINARY    v0.0.2          valid     /node/node_modules/readability-extractor/readability-extractor              
 √  MERCURY_BINARY        v1.0.0          valid     /node/node_modules/@postlight/mercury-parser/cli.js                         
 √  GIT_BINARY            v2.20.1         valid     /usr/bin/git                                                                
 √  YOUTUBEDL_BINARY      v2021.04.26     valid     /usr/local/bin/youtube-dl                                                   
 √  CHROME_BINARY         v89.0.4389.114  valid     /usr/bin/chromium                                                           
 √  RIPGREP_BINARY        v0.10.0         valid     /usr/bin/rg                                                                 

[i] Source-code locations:
 √  PACKAGE_DIR           23 files        valid     /app/archivebox                                                             
 √  TEMPLATES_DIR         3 files         valid     /app/archivebox/templates                                                   
 -  CUSTOM_TEMPLATES_DIR  -               disabled                                                                              

[i] Secrets locations:
 -  CHROME_USER_DATA_DIR  -               disabled                                                                              
 -  COOKIES_FILE          -               disabled                                                                              

[i] Data locations:
 √  OUTPUT_DIR            6 files         valid     /data                                                                       
 √  SOURCES_DIR           5 files         valid     ./sources                                                                   
 √  LOGS_DIR              1 files         valid     ./logs                                                                      
 √  ARCHIVE_DIR           8 files         valid     ./archive                                                                   
 √  CONFIG_FILE           81.0 Bytes      valid     ./ArchiveBox.conf                                                           
 √  SQL_INDEX             268.0 KB        valid     ./index.sqlite3                                                             

Cause/Solution

I did the recommended commands from the error page. It told me that it failed with code 0. I ran the command for the full output and got a node error that the binary (/usr/bin/chromium-browser) didn't exist. Symlinking that path to ./chromium fixed it for the time being.

image

After that running the command again, actually did the singlefile job successfully.

So seems that there is some node stuff using the wrong chrome binary. Or the docker-compose.yml that hub.docker.com recommended me is missing some configuration. (this file was the latest version on the link to the master branch).

@pirate pirate added type: bug report status: wip Work is in-progress / has already been partially completed labels Jun 11, 2021
@pirate pirate added this to the v0.6.3 milestone Jun 11, 2021
@TrAnn3l
Copy link

TrAnn3l commented Sep 18, 2021

This seems to be a know bug for ARM, see puppeteer/puppeteer#6614

@pirate
Copy link
Member

pirate commented Jan 19, 2024

Closing this as stale for now. I've tested it on x86, armv7, and armv8 (aka arm64 or aarch64) and singlefile w/ chrome seem to be working in Docker.

Please open a new issue if you're still encountering trouble on ArchiveBox version >= 0.7.2.

@pirate pirate closed this as completed Jan 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
status: wip Work is in-progress / has already been partially completed type: bug report
Projects
None yet
Development

No branches or pull requests

3 participants