Logo image
Open Research University homepage
Surrey researchers Sign in
Compressing Context to Enhance Inference Efficiency of Large Language Models
Other

Compressing Context to Enhance Inference Efficiency of Large Language Models

Yucheng Li, Bo Dong, Chenghua Lin and Frank Guerin
arXiv.org
Cornell University Library, arXiv.org
09/10/2023

Abstract

Context Efficiency Inference Large language models Redundancy

Metrics

2 Record Views

Details

Logo image

Usage Policy