find_delay 2.7

Author: Romain Pastureau

What is find_delay?

find_delay is a Python package that tries to find the delay where a time series appears in another via cross-correlation. It can theoretically work with any time series (see the examples in the demos folder, but was created to try to align audio files. Read the documentation here!

How to

The best way to use this function is to install the find_delay module for Python by running py -m pip install find-delay.

You can then import the function by writing from find_delay import find_delay (or from find_delay import find_delays if you want to locate multiple excerpts in one big time series).

You can also run demos/demo.py to get four examples (in that case, you will need to download the .wav files present in the repository and place them in the same folder for examples 3 and 4).

Quick use for audio files

To find when an excerpt starts in an audio file, use the find_delay function and fill only the first four parameters; leave the other parameters default (just set plot_figure = True if you want to visualize the output of the function).

Specifics

The function accepts two arrays containing time series - the time series can be of different frequency or amplitude.

The function can then calculate the envelope of the time series (recommended for audio files) and apply a band-pass filter to the result.

The function can also resample the arrays (necessary when the two time series do not have the same frequency).

Finally, the function performs the cross-correlation between the two arrays.

The results can be then plotted if the corresponding parameters are activated, and the function returns the delay at which to find the second array in the first by selecting the delay with the maximum correlation value (optionally, the function can also return this correlation value).

Dependencies

Matplotlib for the plots
Numpy for handling the numerical arrays
Scipy for loading the WAV files, performing the resampling, calculating the envelope, and applying a band-pass filter.

Examples

Delay between two numerical time series

array_1 = [24, 70, 28, 59, 13, 97, 63, 30, 89, 4, 8, 15, 16, 23, 42, 37, 70, 18, 59, 48, 41, 83, 99, 6, 24, 86]
array_2 = [4, 8, 15, 16, 23, 42]

find_delay(array_1, array_2, compute_envelope=False, plot_figure=True, path_figure="figure_1.png")

Delay between a sine function and a portion of it, different frequencies

timestamps_1 = np.linspace(0, np.pi * 2, 200001)
array_1 = np.sin(timestamps_1)
timestamps_2 = np.linspace(np.pi * 0.5, np.pi * 0.75, 6001)
array_2 = np.sin(timestamps_2)

find_delay(array_1, array_2, 100000 / np.pi, 6000 / (np.pi / 4),
           compute_envelope=False, resampling_rate=1000, window_size_res=20000, overlap_ratio_res=0.5,
           resampling_mode="cubic", plot_figure=True, path_figure="figure_2.png", plot_intermediate_steps=True,
           verbosity=1)

Delay between an audio file and an excerpt from it

audio_path = "i_have_a_dream_full.wav"
audio_wav = wavfile.read(audio_path)
audio_frequency = audio_wav[0]
audio_array = audio_wav[1][:, 0]  # Turn to mono

excerpt_path = "i_have_a_dream_excerpt.wav"
excerpt_wav = wavfile.read(excerpt_path)
excerpt_frequency = excerpt_wav[0]
excerpt_array = excerpt_wav[1][:, 0]  # Turn to mono

find_delay(audio_array, excerpt_array, audio_frequency, excerpt_frequency,
           compute_envelope=True, window_size_env=1e6, overlap_ratio_env=0.5,
           resampling_rate=1000, window_size_res=1e7, overlap_ratio_res=0.5, return_delay_format="timedelta",
           resampling_mode="cubic", plot_figure=True, path_figure="figure_3.png", plot_intermediate_steps=True,
           verbosity=1)

Version history

2.7 (2024-05-09)

Simplified from find_delay.find_delay import find_delay to from find_delay import find_delay
Corrected scaling (again) on the aligned arrays graph
Reestablished audio examples with downloadable WAV files when running the demo
Added an example with randomly generated numbers

2.6 (2024-05-08)

Removed demo audio files to lighten the Python package; they are still available on the main branch

2.5 (2024-05-08)

Turned find_delay into a Python package, install with py -m pip install find_delay

2.4 (2024-05-08)

The functions now look for correlation at the edges of the first array, in the case where the second array contains information that starts before the beginning, or ends after the end of the first
Example 4 has been updated with one new audio file to demonstrate this change
Adding a parameter x_format_figure that allows to display HH:MM:SS time on the x-axis
Corrected a bug in the percentage progressions that prevented to display all the steps
Added "Quick use for audio files" segment in the README file

2.3 (2024-05-02)

Corrected a bug that prevented the figures to be saved as a file
Plotting without intermediate steps now plots the graphs on top of each other, not side-by-side

2.2 (2024-05-02)

"i_have_a_dream_excerpt2.wav" is now of lower amplitude to test the scaling on the graph overlay
Arrays with different amplitudes now appear scaled on the graph overlay
Excerpt numbers now start at 1 instead of 0 on the graphs in find_delays

2.1 (2024-04-25)

Modified the overall functions so that they take a window size instead of a number of windows

2.0 (2024-04-24)

Changed the parameter asking for a number of windows by a parameter asking for a window size instead
Clarified the docstrings in the documentation of the functions
Modified find_delays so that saving the figures would iterate the filenames instead of overwriting
Modified _get_envelope and _resample so that a number of windows inferior to 1 would be set at 1
Added documentation for _create_figure and simplified unused parameters
Corrected broken figure saving
Added figure saving for the 3 first examples

1.3 (2024-04-18)

Removed unused function _get_number_of_windows

1.2 (2024-04-17)

Added transparency of the second (orange) array on the graph overlay
Clarified README.md and added figures

1.1 (2024-04-16)

Added find_delays
Created _create_figure containing all the plotting-related code
Modified the graph plot when the max correlation is below threshold
Minor corrections in docstrings

1.0 (2024-04-12)

Initial release

If you detect any bug, please open an issue.

Thanks! 🦆

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
.github/workflows		.github/workflows
.idea		.idea
demos		demos
dist		dist
docs		docs
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
LICENSE		LICENSE
README.md		README.md
citation.cff		citation.cff
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

License

RomainPastureau/find_delay

Folders and files

Latest commit

History

Repository files navigation

find_delay 2.7

What is find_delay?

How to

Quick use for audio files

Specifics

Dependencies

Examples

Delay between two numerical time series

Delay between a sine function and a portion of it, different frequencies

Delay between an audio file and an excerpt from it

Version history

About

Topics

Resources

License

Stars

Watchers

Forks

Languages