EmoRAG: The proposed attack method that uses emoticons as triggers to hijack RAG retrieval
RAG: Retrieval-Augmented Generation—AI systems that answer questions by first searching for relevant documents
ASR: Attack Success Rate—the percentage of times the system retrieves the attacker's target document instead of relevant information
F1 score: A metric balancing precision and recall, used here to measure how effectively the attacker's documents are retrieved
symbolic perturbation: Adding non-semantic symbols (like emoticons) to text to alter its processing by the model
MTEB: Massive Text Embedding Benchmark—a standard leaderboard for evaluating text embedding models
Garbled text: Random or nonsensical character sequences, often used in traditional adversarial attacks but easily detectable
NQ: Natural Questions—a standard question-answering dataset used for evaluation
MS-MARCO: A large-scale information retrieval dataset
dense passage retrieval: Retrieving documents by comparing vector embeddings of queries and passages
positional embeddings: Vectors added to token embeddings to indicate their order in the sequence; EmoRAG exploits these to shift query representation