Alignment performs worse in v3.3.3 (zero scores/wrong timestamps) vs older versions

The transcription quality of WhisperX is good, but the forced alignment model in the newest version (v3.3.3) sometimes produces incorrect results for Japanese audio compared to older versions. When the segment is incorrect:

    Alignment returns a score of 0

    Older versions worked as expected