Earth Embeddings Reveal Diverse Urban Signals from Space
arXiv:2604.03456v1 Announce Type: new Abstract: Conventional urban indicators derived from censuses, surveys, and administrative records are often costly, spatially inconsistent, and slow to update. Recent geospatial foundation models enable Earth embeddings, compact satellite image representations transferable across downstream tasks, but their utility for neighborhood-scale urban monitoring remains unclear. Here, we benchmark three Earth embedding families, AlphaEarth, Prithvi, and Clay, for urban signal prediction across six U.S. metropolitan areas from 2020 to 2023. Using a unified supervised-learning framework, we predict 14 neighborhood-level indicators spanning crime, income, health, and travel behavior, and evaluate performance under four settings: global, city-wise, year-wise, and city-year. Results show that Earth embeddings capture substantial urban variation, with the highest predictive skill for outcomes more directly tied to built-environment structure, including chronic
arXiv:2604.03456v1 Announce Type: new Abstract: Conventional urban indicators derived from censuses, surveys, and administrative records are often costly, spatially inconsistent, and slow to update. Recent geospatial foundation models enable Earth embeddings, compact satellite image representations transferable across downstream tasks, but their utility for neighborhood-scale urban monitoring remains unclear. Here, we benchmark three Earth embedding families, AlphaEarth, Prithvi, and Clay, for urban signal prediction across six U.S. metropolitan areas from 2020 to 2023. Using a unified supervised-learning framework, we predict 14 neighborhood-level indicators spanning crime, income, health, and travel behavior, and evaluate performance under four settings: global, city-wise, year-wise, and city-year. Results show that Earth embeddings capture substantial urban variation, with the highest predictive skill for outcomes more directly tied to built-environment structure, including chronic health burdens and dominant commuting modes. By contrast, indicators shaped more strongly by fine-scale behavior and local policy, such as cycling, remain difficult to infer. Predictive performance varies markedly across cities but remains comparatively stable across years, indicating strong spatial heterogeneity alongside temporal robustness. Exploratory analysis suggests that cross-city variation in predictive performance is associated with urban form in task-specific ways. Controlled dimensionality experiments show that representation efficiency is critical: compact 64-dimensional AlphaEarth embeddings remain more informative than 64-dimensional reductions of Prithvi and Clay. This study establishes a benchmark for evaluating Earth embeddings in urban remote sensing and demonstrates their potential as scalable, low-cost features for SDG-aligned neighborhood-scale urban monitoring.
Executive Summary
This study investigates the potential of Earth embeddings in predicting urban signals, such as crime rates, income levels, and health outcomes, from satellite images. By benchmarking three Earth embedding families (AlphaEarth, Prithvi, and Clay) across six U.S. metropolitan areas, the authors demonstrate that these embeddings can capture substantial urban variation, particularly in built-environment structure. However, predictive performance varies significantly across cities and to a lesser extent across years, underscoring the importance of spatial heterogeneity. The study also highlights the importance of representation efficiency, with compact AlphaEarth embeddings proving more informative than reduced Prithvi and Clay embeddings. This research has significant implications for scalable, low-cost urban monitoring and offers a promising approach for achieving the Sustainable Development Goals.
Key Points
- ▸ Earth embeddings can capture substantial urban variation in built-environment structure.
- ▸ Predictive performance varies significantly across cities and to a lesser extent across years.
- ▸ Compact AlphaEarth embeddings prove more informative than reduced Prithvi and Clay embeddings.
Merits
Strength in Novel Application
This study introduces a novel application of Earth embeddings in urban remote sensing, expanding their potential uses beyond geospatial foundation models.
Demerits
Limited Generalizability
The study's results may not be directly generalizable to non-U.S. metropolitan areas or cities with distinct urban forms.
Methodological Limitations
The authors' reliance on a single unified supervised-learning framework may introduce bias and limit the interpretability of their results.
Expert Commentary
The study's findings are significant, as they demonstrate the potential of Earth embeddings in capturing urban variation and predicting outcomes. However, the authors' reliance on a single unified supervised-learning framework limits the interpretability of their results. Furthermore, the study's focus on U.S. metropolitan areas may introduce limitations in generalizability. To further solidify the findings, future research should explore the application of Earth embeddings in diverse urban contexts and investigate the role of contextual factors in shaping predictive performance. The study's results also highlight the importance of representation efficiency in Earth embeddings, underscoring the need for further research on this topic.
Recommendations
- ✓ Future research should investigate the application of Earth embeddings in non-U.S. metropolitan areas and cities with distinct urban forms.
- ✓ Researchers should explore the role of contextual factors in shaping predictive performance and develop more interpretable and robust methods for evaluating Earth embeddings.
Sources
Original: arXiv - cs.LG