Paper

Paper

Changelog Terms Privacy Issues Docs Support Foundation About

Mixture-of-Depths: Dynamically allocating compute in tran... | ResearchHub

Paper

Paper

Changelog Terms Privacy Issues Docs Support Foundation About

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

7

Authors

David Raposo

David Raposo•Sam Ritter•Alberto Santoro

Published

April 2, 2024

Sign in to comment

Add a comment...

Supporters

Support the authors with ResearchCoin

Journal

arXiv (Cornell University)

Topics

Computer Science

Machine Learning

Artificial Intelligence

DOI

10.48550/arxiv.2404.02258

Other Formats

Supporters

Support the authors with ResearchCoin

Journal

arXiv (Cornell University)

Topics

Computer Science

Machine Learning

Artificial Intelligence

DOI

10.48550/arxiv.2404.02258

Other Formats