In-context Learning

Vector-ICL: In-context Learning with Continuous Vector Representations

In standard autoregressive generation, an LLM predicts the next-token distribution, samples a discrete token, and then discards the distribution, passing only the sampled token as new input. To preserve this distribution's rich information, we …