0

I am trying to understand the file result recout.mlf, so I have the following lines in that file:

enter image description here

Which of 'as' was well prononced: the one with -524.427185 or -1054.774536

Dmytro Prylipko
  • 4,762
  • 2
  • 25
  • 44
Mehdi Souregi
  • 3,153
  • 5
  • 36
  • 53

1 Answers1

1

The acoustic scores obtained during decoding are usually very tiny. To prevent underflow, log likelihoods are used instead of likelihoods: 1.5 Recognition and Viterbi Decoding.

Smaller argument values correspond to larger negative values of logarithms:

Natural log

Thus, the first 'as' obtained a higher (-524.427185) acoustic score. Logarithm is a monotonic function (the larger is argument - the larger is the value), so you can compare the log-likelihoods directly: -524 > -1054.

BTW, it does not necessarily mean the first 'as' was better pronounced. The acoustic score depends on many factors, including model topology and the data the model was trained on.

Dmytro Prylipko
  • 4,762
  • 2
  • 25
  • 44