Advanced RAG-Tool Fusion: The paper's proposed ensemble method applying pre-, intra-, and post-retrieval optimization techniques to tool selection
Toolshed Knowledge Base: A vector database storing 'enhanced' tool documents (concatenating name, description, schema, synthetic questions) for retrieval
tool-M: The total number of tools available in the knowledge base (the search space size)
top-k: The number of tools retrieved and presented to the LLM agent's context window
Recall@k: A metric measuring the proportion of relevant tools found within the top-k retrieved results
HyDE: Hypothetical Document Embeddings—a RAG technique where the model generates a fake 'ideal' document (or question) to embed for better retrieval matching
BM25: Best Matching 25—a standard probabilistic information retrieval function that ranks documents based on term frequency and inverse document frequency
Zero-shot: Using a model to perform a task without any specific training examples or fine-tuning for that task