Paper
Document
Submit new version
Download
Flag content
Preprint
5

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Published
Apr 2, 2024
Peer Review
Show more
Save
TipTip
Document
Submit new version
Download
Flag content
5
TipTip
Save
Document
Submit new version
Download
Flag content