EAGLE (Extrapolation Algorithm for Greater Language-model Efficiency) is a new baseline for fast decoding of Large Language Models (LLMs) with provable performance maintenance. This approach involves ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results
Feedback