Other diarization metrics #3

desh2608 · 2021-03-09T01:21:56Z

Following metrics (from pyannote and dscore) may be implemented:

nryant · 2023-04-05T23:22:36Z

Are JER/clustering metrics still of interest? I'd be up for adding them if I know the PRs would get accepted.

desh2608 · 2023-04-05T23:42:28Z

Hi Neville! Yeah, that would be awesome. JER is top-most on the list, but I can imagine people would be interested in other metrics as well.

(@popcornell and I want to switch from dscore to spyder in CHiME-7 DASR, but it is blocked by JER not being implemented yet.)

nryant · 2023-04-05T23:52:54Z

Ok, I can add this to the TODO list. I'm in the process of rewriting dscore to eliminate the md-eval dependency and output more detailed reporting. The initial version is based on pyannote.metrics, but between the penalty of Python being an interpreted language and the repeated calls to uemify, it's not particularly quick. So, it's in my interest to get faster implementations of the various metrics and I'd rather contribute to an existing project if possible.

desh2608 · 2023-04-06T00:07:45Z

Cool! Your contributions would be very welcome. In my benchmarking, I found pyannote.metrics to be an order of magnitude slower than md-eval.pl --- pyannote is a great tool overall, just not suitable for DER evaluation :)

I'm sure spyder would benefit immensely from your expertise. Please use this thread for any questions/discussions once you get around to implementing the metrics.

nryant · 2023-04-06T00:37:15Z

That sounds about right. When I benchmarked on the DIHARD III eval (full) condition, just the DER computation (omitting IO and building the Annotation/Timeline instances in memory) averaged over 13 seconds; cf. to 3.5 seconds for running md-eval. Most of this comes from the call to IdentificationErrorRate.uemify that constructs the equivalent of your get_eval_regions. Specifically, this block, which accounts for 10 seconds of that run time.

I've been updating dscore off-and-on for the past week for an LDC internal project and want to finish that work first, but will look into implementing JER in spy-der after. I think it should be relatively straightforward.

desh2608 added the enhancement New feature or request label Mar 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Other diarization metrics #3

Other diarization metrics #3

desh2608 commented Mar 9, 2021

nryant commented Apr 5, 2023

desh2608 commented Apr 5, 2023

nryant commented Apr 5, 2023

desh2608 commented Apr 6, 2023

nryant commented Apr 6, 2023

Other diarization metrics #3

Other diarization metrics #3

Comments

desh2608 commented Mar 9, 2021

nryant commented Apr 5, 2023

desh2608 commented Apr 5, 2023

nryant commented Apr 5, 2023

desh2608 commented Apr 6, 2023

nryant commented Apr 6, 2023