Surrey researchers Sign in
Compressing Context to Enhance Inference Efficiency of Large Language Models
Preprint

Compressing Context to Enhance Inference Efficiency of Large Language Models

Yucheng Li, Bo Dong, Chenghua Lin and Frank Guerin
09/10/2023

Abstract

Computer Science - Computation and Language

Metrics

18 Record Views

Details

Usage Policy