Publish
Home
Live
new
RH Journal
ResearchCoin
Grants
Funding
Browse
Journals
Hubs
Tools
Lab Notebook
Beta
Reference Manager
Resources
Verify Identity
Community
Support
About
Terms
Privacy
Issues
Docs
Paper
Log in
Sign up
Paper
15
Paper
Conversation
4
Grants
Reviews
Document
Submit new version
Download
Flag content
15
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Artificial Intelligence
Paleontology
Law
Show More
Authors
Keivan Alizadeh
,
Iman Mirzadeh
Dmitry Belenko
,
Karen Khatamifard
,
Minsik Cho
,
Carlo Mundo
,
Mohammad Rastegari
+5 authors
,
Mehrdad Farajtabar
Published
Dec 12, 2023
DOI
10.48550/arXiv.2312.11514
Posted by
Fettah Kiran
Save
Tip
Document
Submit new version
Download
Flag content
15
Tip
Save
Document
Submit new version
Download
Flag content
Paper
Conversation
4
Grants
Reviews
100%