Skip to content

A Python package to calculate the delay between two arrays or two audio files

License

Notifications You must be signed in to change notification settings

RomainPastureau/find_delay

Repository files navigation

find_delay 2.7

Documentation Status

PyPI page

Author: Romain Pastureau

What is find_delay?

find_delay is a Python package that tries to find the delay where a time series appears in another via cross-correlation. It can theoretically work with any time series (see the examples in the demos folder, but was created to try to align audio files. Read the documentation here!

How to

The best way to use this function is to install the find_delay module for Python by running py -m pip install find-delay.

You can then import the function by writing from find_delay import find_delay (or from find_delay import find_delays if you want to locate multiple excerpts in one big time series).

You can also run demos/demo.py to get four examples (in that case, you will need to download the .wav files present in the repository and place them in the same folder for examples 3 and 4).

Quick use for audio files

To find when an excerpt starts in an audio file, use the find_delay function and fill only the first four parameters; leave the other parameters default (just set plot_figure = True if you want to visualize the output of the function).

Specifics

The function accepts two arrays containing time series - the time series can be of different frequency or amplitude.

The function can then calculate the envelope of the time series (recommended for audio files) and apply a band-pass filter to the result.

The function can also resample the arrays (necessary when the two time series do not have the same frequency).

Finally, the function performs the cross-correlation between the two arrays.

The results can be then plotted if the corresponding parameters are activated, and the function returns the delay at which to find the second array in the first by selecting the delay with the maximum correlation value (optionally, the function can also return this correlation value).

Dependencies

  • Matplotlib for the plots
  • Numpy for handling the numerical arrays
  • Scipy for loading the WAV files, performing the resampling, calculating the envelope, and applying a band-pass filter.

Examples

Delay between two numerical time series

array_1 = [24, 70, 28, 59, 13, 97, 63, 30, 89, 4, 8, 15, 16, 23, 42, 37, 70, 18, 59, 48, 41, 83, 99, 6, 24, 86]
array_2 = [4, 8, 15, 16, 23, 42]

find_delay(array_1, array_2, compute_envelope=False, plot_figure=True, path_figure="figure_1.png")

Delay between two numerical time series

Delay between a sine function and a portion of it, different frequencies

timestamps_1 = np.linspace(0, np.pi * 2, 200001)
array_1 = np.sin(timestamps_1)
timestamps_2 = np.linspace(np.pi * 0.5, np.pi * 0.75, 6001)
array_2 = np.sin(timestamps_2)

find_delay(array_1, array_2, 100000 / np.pi, 6000 / (np.pi / 4),
           compute_envelope=False, resampling_rate=1000, window_size_res=20000, overlap_ratio_res=0.5,
           resampling_mode="cubic", plot_figure=True, path_figure="figure_2.png", plot_intermediate_steps=True,
           verbosity=1)

Delay between a sine function and a portion of it, different frequencies

Delay between an audio file and an excerpt from it

audio_path = "i_have_a_dream_full.wav"
audio_wav = wavfile.read(audio_path)
audio_frequency = audio_wav[0]
audio_array = audio_wav[1][:, 0]  # Turn to mono

excerpt_path = "i_have_a_dream_excerpt.wav"
excerpt_wav = wavfile.read(excerpt_path)
excerpt_frequency = excerpt_wav[0]
excerpt_array = excerpt_wav[1][:, 0]  # Turn to mono

find_delay(audio_array, excerpt_array, audio_frequency, excerpt_frequency,
           compute_envelope=True, window_size_env=1e6, overlap_ratio_env=0.5,
           resampling_rate=1000, window_size_res=1e7, overlap_ratio_res=0.5, return_delay_format="timedelta",
           resampling_mode="cubic", plot_figure=True, path_figure="figure_3.png", plot_intermediate_steps=True,
           verbosity=1)

Delay between an audio file and an excerpt from it

Version history

2.7 (2024-05-09)

  • Simplified from find_delay.find_delay import find_delay to from find_delay import find_delay
  • Corrected scaling (again) on the aligned arrays graph
  • Reestablished audio examples with downloadable WAV files when running the demo
  • Added an example with randomly generated numbers

2.6 (2024-05-08)

  • Removed demo audio files to lighten the Python package; they are still available on the main branch

2.5 (2024-05-08)

  • Turned find_delay into a Python package, install with py -m pip install find_delay

2.4 (2024-05-08)

  • The functions now look for correlation at the edges of the first array, in the case where the second array contains information that starts before the beginning, or ends after the end of the first
  • Example 4 has been updated with one new audio file to demonstrate this change
  • Adding a parameter x_format_figure that allows to display HH:MM:SS time on the x-axis
  • Corrected a bug in the percentage progressions that prevented to display all the steps
  • Added "Quick use for audio files" segment in the README file

2.3 (2024-05-02)

  • Corrected a bug that prevented the figures to be saved as a file
  • Plotting without intermediate steps now plots the graphs on top of each other, not side-by-side

2.2 (2024-05-02)

  • "i_have_a_dream_excerpt2.wav" is now of lower amplitude to test the scaling on the graph overlay
  • Arrays with different amplitudes now appear scaled on the graph overlay
  • Excerpt numbers now start at 1 instead of 0 on the graphs in find_delays

2.1 (2024-04-25)

  • Modified the overall functions so that they take a window size instead of a number of windows

2.0 (2024-04-24)

  • Changed the parameter asking for a number of windows by a parameter asking for a window size instead
  • Clarified the docstrings in the documentation of the functions
  • Modified find_delays so that saving the figures would iterate the filenames instead of overwriting
  • Modified _get_envelope and _resample so that a number of windows inferior to 1 would be set at 1
  • Added documentation for _create_figure and simplified unused parameters
  • Corrected broken figure saving
  • Added figure saving for the 3 first examples

1.3 (2024-04-18)

  • Removed unused function _get_number_of_windows

1.2 (2024-04-17)

  • Added transparency of the second (orange) array on the graph overlay
  • Clarified README.md and added figures

1.1 (2024-04-16)

  • Added find_delays
  • Created _create_figure containing all the plotting-related code
  • Modified the graph plot when the max correlation is below threshold
  • Minor corrections in docstrings

1.0 (2024-04-12)

  • Initial release

If you detect any bug, please open an issue.

Thanks! 🦆

About

A Python package to calculate the delay between two arrays or two audio files

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages