Running in Docker container results in the process being killed even for <30s audio #1266

catileptic · 2025-03-19T16:38:47Z

I am running faster-whisper in a Docker container.

The code I am using is this one (simplified for clarity):

import gc

model_size = "large-v3"

file_path = "..."

model = WhisperModel(model_size, device="cpu", compute_type="int8", cpu_threads=1, num_workers=1)

segments, _ = model.transcribe(file_path, vad_filter=True, beam_size=5, no_speech_threshold=0.6, condition_on_previous_text=False)

for segment in segments:
    print(f"[{segment.start:.2f}s -> {segment.end:.2f}s] {segment.text}")

if hasattr(model, 'model'):
    del model.model
if hasattr(model, 'feature_extractor'):
    del model.feature_extractor
if hasattr(model, 'hf_tokenizer'):
    del model.hf_tokenizer

del model

gc.collect()

When I run this on my own machine (Mac M2), it runs to the end for several small audio files that I tested (under 30s of audio and also over 30s of audio).

However, when I run it in the Docker container, the process will be killed (printing only the word Killed in the logs) when processing some short audio files (<30s), and certainly on audio files that are longer (for example, 2min). In Docker, i loop through several audio files to transcribe them, which is why I have followed the advice of threads describing OOM issues and I run del model and gc.collect() explicitly.

Even though I am trying to "clean up" after every transcription attempt, the process is still killed. Sometimes it fails on the first short audio file. Other times, the first short audio file (<30s) works just fine, but the following short audio file (<30s) always fails, the process is killed.

In Docker, I could never manage to transcribe the 2min audio file, the process is always killed.

I understand there is a memory leak in ctranslate2 (as per the thread linked above), but I'm surprised to see that this fails on Docker, even on very short audio files.

According to docker stats, the container usually uses around 3.9GiB memory out of around 7.8Gib available, so around 50%. Even after running del model and gc.collect(), the memory use stays at 50%.

What am I doing wrong here, or missing?

The text was updated successfully, but these errors were encountered:

catileptic mentioned this issue Apr 18, 2025

Make audio and video searchable dataresearchcenter/ingest-file#5

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Running in Docker container results in the process being killed even for <30s audio #1266

Running in Docker container results in the process being killed even for <30s audio #1266

catileptic commented Mar 19, 2025

Running in Docker container results in the process being killed even for <30s audio #1266

Running in Docker container results in the process being killed even for <30s audio #1266

Comments

catileptic commented Mar 19, 2025