V Vajrobol, BB Gupta, A Gaurav - Computers and Electrical Engineering, 2024 - Elsevier
Instruction attack is a malicious attempt to manipulate a chatbot by providing misleading or
harmful prompts to achieve unintended outcomes. Detecting instruction attacks is crucial to …