“…LLMs have already reached state-of-the-art (SoA) performance in various tasks, and selecting an appropriate prompt has a significant impact. As a result, a significant number of related works investigate techniques to improve prompt construction further and their effectiveness [54,81,88], explore their robustness to permutations of the demonstrative examples [107], their sensitivity to negations [37], and their ability to generalize across different LLMs [79].…”