🏠
Working from home
Pinned Loading
-
pytorch/pytorch
pytorch/pytorch PublicTensors and Dynamic neural networks in Python with strong GPU acceleration
-
tpa_pytorch
tpa_pytorch PublicSimple (slightly optimized) implementation of Tensor Product Attention from the T6 paper with a KV cache
Python 1
-
mla_pytorch
mla_pytorch PublicSimple implementation of Multi Latent Attention from the Deepseek V2 paper https://arxiv.org/abs/2405.04434
Python 1
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.