Blocks duplicate pages during crawler-sync using BigQuery vector search (text ≥ 0.95, multimodal ≥ 0.92). Embeddings are always generated regardless of this setting.
Estimates lat/lng for pages without coordinates during crawler-sync. Uses tiered approach: JSON-LD → structured address → city centroid → LLM (OpenRouter).