Revision as of 08:16, 8 May 2025

Prompt Engineering

"Let's think step-by-step"

Tool-use, feedback, agentic

@@ Line 26: / Line 26: @@
 * 2024-11: [https://arxiv.org/abs/2411.01101 Self-Consistency Falls Short! The Adverse Effects of Positional Bias on Long-Context Problems]
 * 2025-02: [https://arxiv.org/abs/2502.01951 On the Emergence of Position Bias in Transformers]
-* ''Testing models:''
+* '''Testing models:'''
 ** [https://github.com/gkamradt/LLMTest_NeedleInAHaystack?utm_source=chatgpt.com Needle-in-a-Haystack tests]
 ** 2023-08: [https://arxiv.org/abs/2308.14508 LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding]
@@ Line 33: / Line 33: @@
 ** 2024-07: [https://arxiv.org/abs/2407.16695 Stress-Testing Long-Context Language Models with Lifelong ICL and Task Haystack]
 ** 2025-04: [https://arxiv.org/abs/2504.04150 Reasoning on Multiple Needles In A Haystack]
-* ''Mitigation:''
+* '''Mitigation:'''
 ** 2023-10: [https://arxiv.org/abs/2310.01427 Attention Sorting Combats Recency Bias In Long Context Language Models]
 ** 2024-07: [https://arxiv.org/abs/2407.01100 Eliminating Position Bias of Language Models: A Mechanistic Approach]