Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Richer JSON output #89

Open
ocervell opened this issue Jan 28, 2023 · 5 comments
Open

Richer JSON output #89

ocervell opened this issue Jan 28, 2023 · 5 comments

Comments

@ocervell
Copy link

ocervell commented Jan 28, 2023

Would be nice to have some other response data than just the URL in the JSON output, such as :

{
"url": "https://test.domain.synology.me/.htaccess-local",
"status_code": 200,
"words": 1066,
"lines": 100,
"content_length": 4516,
"content_type": "text/html; charset=utf-8",
"duration": 57779116,
"host": "test.domain.synology.me"
}

That would avoid scraping the endpoint again to find those details.

Maybe even consider using httpx as a client instead of fasthttp as it seems to give more info on the response ?

@lc
Copy link
Owner

lc commented Feb 11, 2023

gau is completely passive at the moment. It issues no HTTP requests to URLs that are archived from Wayback, OTX, etc. It can be piped into a tool such as httpx for additional info. Would you prefer that gau had an option for this instead?

@ocervell
Copy link
Author

ocervell commented Apr 3, 2023

Ah, I thought since there is a --mc strings # list of status codes to match option that there was still some crawling happening. What is the --mc flag purpose then ?
Otherwise an option for adding an httpx query could be done, even though we would not really control httpx input options like tech detection and so on ...

@zerodivisi0n
Copy link
Contributor

zerodivisi0n commented Nov 1, 2023

I think it is useful to add provider, timestamp, status_code, mimetype and content_length to the JSON output. In this case it would be possible to filter by this values on later stages.
I checked all providers and all of them return most of this fields.
I am ready to implement this change, if you agree.

@lc
Copy link
Owner

lc commented Nov 1, 2023

Hey @zerodivisi0n, I definitely agree

@zerodivisi0n
Copy link
Contributor

Great! Then I'll do it soon

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants