Publish
Home
Live
new
RH Journal
ResearchCoin
Grants
Funding
Browse
Journals
Hubs
Tools
Lab Notebook
Beta
Reference Manager
Resources
Verify Identity
Community
Support
About
Terms
Privacy
Issues
Docs
Paper
Log in
Sign up
Paper
5
Paper
Conversation
Grants
Reviews
1
Document
Submit new version
Download
Flag content
Preprint
5
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models
Machine Learning
Computer Science
Engineering
Show More
Authors
David Raposo
,
Sam Ritter
Blake Richards
,
Timothy Lillicrap
,
Peter Humphreys
,
A. Santoro
+4 authors
,
Alberto Santoro
Journal
arXiv (Cornell University)
Published
Apr 2, 2024
Peer Review
(1)
Show more
Save
Tip
Document
Submit new version
Download
Flag content
5
Tip
Save
Document
Submit new version
Download
Flag content
Paper
Conversation
Grants
Reviews
1