Retentive Network – notes

Useful links

Retentive Network: A Successor to Transformer for Large Language Models
Y. Sun, L. Dong, S. Huang, S. Ma, Y. Xia, J. Xue, J. Wang, F. Wei
arXiv:2307.08621 [cs.CL] (2023)

Official implementation on GitHub (link)

PyTorch implementation of RetNet by Jamie Stirling (link)

xPos paper (link)

Leave a comment