Document Type : Full length article
Authors
Department of RS and GIS, Faculty of Geography, University of Tehran, Tehran, Iran
Abstract
ABSTRACT
The protection and identification of archaeological sites have become increasingly important due to threats posed by urbanization, agricultural expansion, and infrastructure development. While traditional field surveys are effective, they are often time-consuming and costly. Spatial predictive modeling using environmental variables and known site data provides an efficient tool for identifying areas with high archaeological potential. This study compares two predictive modeling methods—Weights of Evidence (WoE) and Maximum Entropy (MaxEnt)—for identifying archaeological sites in North Khorasan Province, Iran. This region holds archaeological significance following the recent discovery of an Achaemenid palace near Ashkhaneh. A dataset of 980 archaeological sites was randomly divided into training (70%) and validation (30%) sets. Six environmental variables—elevation, slope, distance from rivers, vegetation cover, proximity to settlements, and geology—were processed in a GIS environment at a 30-meter resolution. The WoE model was implemented using a Bayesian approach and conditional independence testing (Chi-square), while MaxEnt was applied without requiring variable independence. Results showed that MaxEnt outperformed WoE, with training and validation success rates of 88.65% and 88.20%, respectively, compared to 83.26% and 83.00% for WoE. Proximity to settlements and rivers were the most influential predictive factors. Due to its flexibility and independence from variable assumptions, MaxEnt is more suitable for complex environments, whereas WoE remains effective for smaller-scale projects due to its simplicity. Future research is recommended to incorporate hybrid models and high-resolution datasets to enhance prediction accuracy.
Extended Abstract
Introduction
In recent decades, the protection and identification of archaeological sites have become increasingly critical due to the growing threats posed by urbanization, agricultural expansion, and infrastructure development. Conventional archaeological field surveys, while effective, are typically time-consuming, labor-intensive, and costly. Additionally, unintentional damage to undiscovered sites during construction or land conversion is a growing concern. To address these challenges, predictive spatial modeling has emerged as a powerful tool in archaeological research. By integrating environmental factors and the known locations of archaeological sites into statistical and geospatial models, researchers can generate potential maps that estimate the likelihood of undiscovered settlements. These maps are invaluable for guiding targeted surveys, preserving cultural heritage, and allocating research resources more effectively.
This study aims to compare two prominent predictive modeling approaches, Weights of Evidence (WoE) and Maximum Entropy (MaxEnt), in their ability to identify areas with high potential for archaeological settlements. The research focuses on North Khorasan Province in northeastern Iran, a region of emerging archaeological significance following the discovery of a large Achaemenid structure near the city of Ashkhaneh. Given the limited number of previously documented sites in the area, especially from the Achaemenid period, the region offers a compelling test case for evaluating the utility of predictive modeling in uncovering hidden archaeological landscapes.
Methodology
The research utilized a dataset comprising 980 known archaeological sites in North Khorasan. These were randomly divided into training (70%) and validation (30%) subsets. Six environmental variables, elevation, slope, distance from rivers, distance from vegetation, distance from villages, and geology, were selected based on their hypothesized influence on ancient human settlement patterns. All environmental data were resampled to a 30-meter spatial resolution and processed using GIS tools. For the WoE model, a Bayesian approach was employed, requiring an initial independence test among variable pairs using the chi-square method. Based on the test results, six combinations of conditionally independent variables were selected to create potential maps. Each map was generated by overlaying weighted raster layers corresponding to the chosen variables. In contrast, the MaxEnt model did not require conditional independence among variables. It utilized all selected environmental factors simultaneously, applying the principle of maximum entropy to estimate the most uniform probability distribution consistent with the input data. MaxEnt’s flexibility makes it particularly suitable for complex ecological or archaeological contexts where variable interdependence is likely to occur. To ensure consistency between the two modeling approaches, the same variable combinations used in the WoE model were applied in MaxEnt. For both models, success rates were calculated by measuring the proportion of known site locations falling within high-probability areas on the generated maps.
Results and discussion
The analysis revealed that MaxEnt consistently outperformed WoE in terms of predictive accuracy. In the best-performing scenario, MaxEnt achieved a training success rate of 88.65% and a validation rate of 88.20%, while WoE’s top result was 83.26% for training and 83.00% for validation. This difference, although not drastic, suggests that MaxEnt may be more reliable in identifying areas with high archaeological potential, particularly in data-rich or complex environments. Variable contribution analysis in MaxEnt showed that proximity to villages and rivers was the most significant predictor of site locations. In some combinations, these two variables accounted for over 90% of the model's predictive power. This result aligns with archaeological theory, which emphasizes access to water, arable land, and inter-settlement connectivity as key factors influencing ancient settlement decisions. Slope and elevation were moderately influential, while geological factors played a relatively minor role.
Visual inspection of the predictive maps further demonstrated the superiority of MaxEnt. The maps it produced featured clearer transitions between high- and low-potential areas, as well as more precise delineation of low-probability zones. This level of detail is essential for guiding field surveys and minimizing effort in areas with low archaeological potential. While WoE also produced reasonable results, its performance was limited by the requirement for variable independence and its comparatively lower spatial resolution in modeling subtle environmental gradients. From a practical standpoint, WoE has the advantage of simplicity. It requires less computational power and is easier to interpret, making it suitable for smaller-scale projects or regions with limited data availability. However, the methodological constraints inherent to WoE, particularly the assumption of conditional independence, can reduce model flexibility and accuracy when dealing with complex real-world environments.
Conclusion
This comparative study demonstrates that MaxEnt offers a more robust and flexible framework for predictive archaeological modeling in North Khorasan. Its ability to incorporate multiple environmental variables without imposing statistical independence assumptions, along with its transparent variable contribution metrics, makes it especially advantageous for exploratory research and regional planning. Moreover, the superior predictive accuracy of MaxEnt suggests that it can play a critical role in heritage management, conservation planning, and the efficient allocation of archaeological survey resources. Despite these advantages, WoE should not be overlooked. Its lower computational requirements and intuitive Bayesian foundation still make it a valuable tool, particularly in resource-constrained settings. Future research should consider hybrid approaches that combine the strengths of both methods, for instance, using WoE to narrow down variable sets before applying MaxEnt for final predictions. Further studies should also incorporate high-resolution satellite imagery, socio-cultural and historical datasets, and alternative modeling techniques such as machine learning and fuzzy logic. Validation of models in different regions and landscape types will be essential for assessing the transferability and robustness of the predictive frameworks established in this study.
Funding
There is no funding support.
Authors’ Contribution
All of the authors approved the content of the manuscript and agreed on all aspects of the work.
Conflict of Interest
Authors declared no conflict of interest.
Acknowledgments
We are grateful to all the scientific consultants of this paper.
Keywords
Main Subjects