Balancing Exploration and Exploitation in Robotics: Path Optimization and Uncertainty Management in Complex Environments

dc.contributor.authorKarakose, Perihan
dc.contributor.authorBal, Cafer
dc.contributor.authorYetkin, Harun
dc.date.accessioned2026-02-22T11:43:55Z
dc.date.created2026
dc.date.issued2026
dc.departmentBartın Üniversitesi
dc.description.abstractBalancing exploration and exploitation is a fundamental challenge in informative path planning for environmental monitoring. Although numerous reward functions have been proposed in the literature, most have been evaluated under different datasets and experimental conditions, making direct comparison difficult. The novelty of this study lies in its development of four synthetic datasets, experimentally validated and designed with increasing spatial complexity (1 to 4 Regions of Interest, ROIs), to enable a fair and systematic comparison of three widely used Gaussian Process-based reward functions: Entropy, Upper Confidence Bound (UCB), and Level Set. The proposed framework integrates a greedy local path optimization algorithm that maximizes expected reward and incorporates a cross-validation strategy to reduce initial model variance and mitigate overfitting. Importantly, this study not only compares the individual performances of these reward functions but also analyzes how each one contributes to the trade-off between exploration and exploitation under varying environmental conditions. Experimental results show that Level Set performs best in high-variance environments (favoring exploration), UCB excels in low-variance settings with fast convergence (favoring exploitation), and Entropy provides stable long-term uncertainty reduction (balancing both aspects). With the inclusion of cross-validation, the model achieves up to 60% reduction in RMSE and 50% reduction in variance across all scenarios. These findings highlight the practical value of reward-aware path planning in robotic exploration tasks, particularly when aligned with the spatial complexity of the monitoring environment.
dc.identifier.doi10.1007/s10846-025-02339-9
dc.identifier.issn0921-0296
dc.identifier.issn1573-0409
dc.identifier.issue1
dc.identifier.scopus2-s2.0-105027942914
dc.identifier.scopusqualityQ1
dc.identifier.urihttps://doi.org/10.1007/s10846-025-02339-9
dc.identifier.urihttps://hdl.handle.net/11772/26848
dc.identifier.volume112
dc.identifier.wosWOS:001665994100001
dc.identifier.wosqualityQ3
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherSpringer
dc.relation.ispartofJournal of Intelligent & Robotic Systems
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/openAccess
dc.snmzKA_WoS_20260218
dc.subjectEnvironmental monitoring
dc.subjectGaussian regression
dc.subjectMachine learning
dc.subjectInformative path planning
dc.titleBalancing Exploration and Exploitation in Robotics: Path Optimization and Uncertainty Management in Complex Environments
dc.typeArticle
dspace.entity.typePublication

Dosyalar