Skip to content

Feature: Autodetect noisy files

Josh Soref edited this page Feb 25, 2024 · 6 revisions

Autodetect noisy files

Problem space

There are a couple categories of files to consider:

  • binaries (e.g. pictures / audio / video)
  • files written in natural languages not recognized by the dictionary

Possible strategies

Consider keeping stats on a per file / line basis...

Look for new filename extensions?

See also

Escape path

If this feature trips on files you need scanned, see:

Scan noisy files

Clone this wiki locally