← Back to Paper List

Geography-Aware Large Language Models for Next POI Recommendation

Zhao Liu, Wei Liu, Huajie Zhu, Jianxing Yu, Jian Yin, Wang-Chien Lee, Shun Wang
Sun Yat-sen University, The Pennsylvania State University
arXiv (2025)
Recommendation P13N

📝 Paper Summary

Next Point-of-Interest (POI) Recommendation Spatial Representation Learning
GA-LLM improves next POI recommendation by transforming GPS coordinates into hierarchical quadkey embeddings and aligning external graph-based transition knowledge with the LLM's semantic space.
Core Problem
LLMs struggle with spatial tasks because they tokenize high-precision GPS coordinates inefficiently (leading to hallucinations) and lack global knowledge of POI transition patterns due to limited context windows.
Why it matters:
  • Text-only LLMs often predict hallucinated POIs that are semantically plausible but geographically impossible (far from user location)
  • High-precision GPS coordinates (e.g., >10 decimal places) generate excessive tokens, increasing computational cost and confusing semantic modeling
  • Limited prompts cannot fit entire user histories, causing LLMs to miss implicit POI-POI transition rules evident in long-term trajectories
Concrete Example: When a user visits 'AirTrain JFK station', a text-only LLM might predict a previously mentioned POI again. However, the ground truth is 'Kennedy Airport' (POI 404), which is not in the recent context but is a logical transition. GA-LLM captures this via transition patterns where text-only models fail.
Key Novelty
Geography-Aware Large Language Model (GA-LLM)
  • Geographic Coordinate Injection Module (GCIM): Discretizes GPS into hierarchical quadkeys and applies Fourier positional encoding to capture multi-scale spatial dependencies within the LLM.
  • POI Alignment Module (PAM): Projects pre-trained embeddings from external graph-based models (which capture global transition patterns) directly into the LLM's high-dimensional semantic space.
Architecture
Architecture Figure Figure 2
The GA-LLM framework architecture, detailing how user trajectories are processed into prompts and how the two specialized modules (GCIM and PAM) inject information.
Breakthrough Assessment
7/10
Addresses a critical weakness of LLMs (spatial numeracy) with a principled encoding scheme (Quadkeys + Fourier). The alignment of graph embeddings is a logical step for hybrid recommendation.
×