Can pyramid-flow utilize kv-cache to reduce computation? #227

Edwardmark · 2025-01-03T01:47:35Z

It seems that the computation of the history condition latent is calculated in each frame prediction, but I think it can use kv-cache to reduce this redundant computation, is that true?

feifeiobama · 2025-01-07T11:48:01Z

yes, but due to the complex temporal compression designs, im afraid that kv cache wouldnt save much computation here

Edwardmark · 2025-01-07T12:26:11Z

yes, but due to the complex temporal compression designs, im afraid that kv cache wouldnt save much computation here

thanks, so the model will run slower and slower for the latter frame for more and more condition frames is used, is that true?

feifeiobama · 2025-01-07T14:03:59Z

yes unless you apply some methods to truncate history condition, such as sliding window.

Edwardmark · 2025-05-08T11:22:08Z

yes, but due to the complex temporal compression designs, im afraid that kv cache wouldnt save much computation here

could you please explain in detail why kv cache wouldn't save much computation?I am confused about that.

zhichengsun · 2025-05-08T13:33:07Z

if you compress history context aggressively, then most of the compute is spent on self-attention among new frame tokens, instead of cross-attention between new tokens and history tokens. kv cache can only reduce the latter part of compute.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can pyramid-flow utilize kv-cache to reduce computation? #227

Can pyramid-flow utilize kv-cache to reduce computation? #227

Edwardmark commented Jan 3, 2025

feifeiobama commented Jan 7, 2025

Edwardmark commented Jan 7, 2025 •

edited

Loading

feifeiobama commented Jan 7, 2025

Edwardmark commented May 8, 2025

zhichengsun commented May 8, 2025

Can pyramid-flow utilize kv-cache to reduce computation? #227

Can pyramid-flow utilize kv-cache to reduce computation? #227

Comments

Edwardmark commented Jan 3, 2025

feifeiobama commented Jan 7, 2025

Edwardmark commented Jan 7, 2025 • edited Loading

feifeiobama commented Jan 7, 2025

Edwardmark commented May 8, 2025

zhichengsun commented May 8, 2025

Edwardmark commented Jan 7, 2025 •

edited

Loading