GitHub - TaiMingLu/know-dont-tell

This is the code repository of Insights into LLM Long-Context Failures: When Transformers Know but Don't Tell. [EMNLP 2024 Findings]

Large Language Models (LLMs) exhibit positional bias, struggling to utilize information from the middle or end of long contexts. Our study explores LLMs' long-context reasoning by probing their hidden representations. We find that while LLMs encode the position of target information, they often fail to leverage this in generating accurate responses. This reveals a disconnect between information retrieval and utilization, a "know but don't tell" phenomenon. We further analyze the relationship between extraction time and final accuracy, offering insights into the underlying mechanics of transformer models.

If you find this work helpful, please consider citing the following:

@misc{lu2024insightsllmlongcontextfailures,
      title={Insights into LLM Long-Context Failures: When Transformers Know but Don't Tell}, 
      author={Taiming Lu and Muhan Gao and Kuai Yu and Adam Byerly and Daniel Khashabi},
      year={2024},
      eprint={2406.14673},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2406.14673}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
code		code
pics		pics
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

TaiMingLu/know-dont-tell

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages