Balancing Exploration and Exploitation in Robotics: Path Optimization and Uncertainty Management in Complex Environments

Karakose, Perihan; Bal, Cafer; Yetkin, Harun

doi:10.1007/s10846-025-02339-9

Balancing Exploration and Exploitation in Robotics: Path Optimization and Uncertainty Management in Complex Environments

dc.contributor.author	Karakose, Perihan
dc.contributor.author	Bal, Cafer
dc.contributor.author	Yetkin, Harun
dc.date.accessioned	2026-02-22T11:43:55Z
dc.date.created	2026
dc.date.issued	2026
dc.department	Bartın Üniversitesi
dc.description.abstract	Balancing exploration and exploitation is a fundamental challenge in informative path planning for environmental monitoring. Although numerous reward functions have been proposed in the literature, most have been evaluated under different datasets and experimental conditions, making direct comparison difficult. The novelty of this study lies in its development of four synthetic datasets, experimentally validated and designed with increasing spatial complexity (1 to 4 Regions of Interest, ROIs), to enable a fair and systematic comparison of three widely used Gaussian Process-based reward functions: Entropy, Upper Confidence Bound (UCB), and Level Set. The proposed framework integrates a greedy local path optimization algorithm that maximizes expected reward and incorporates a cross-validation strategy to reduce initial model variance and mitigate overfitting. Importantly, this study not only compares the individual performances of these reward functions but also analyzes how each one contributes to the trade-off between exploration and exploitation under varying environmental conditions. Experimental results show that Level Set performs best in high-variance environments (favoring exploration), UCB excels in low-variance settings with fast convergence (favoring exploitation), and Entropy provides stable long-term uncertainty reduction (balancing both aspects). With the inclusion of cross-validation, the model achieves up to 60% reduction in RMSE and 50% reduction in variance across all scenarios. These findings highlight the practical value of reward-aware path planning in robotic exploration tasks, particularly when aligned with the spatial complexity of the monitoring environment.
dc.identifier.doi	10.1007/s10846-025-02339-9
dc.identifier.issn	0921-0296
dc.identifier.issn	1573-0409
dc.identifier.issue	1
dc.identifier.scopus	2-s2.0-105027942914
dc.identifier.scopusquality	Q1
dc.identifier.uri	https://doi.org/10.1007/s10846-025-02339-9
dc.identifier.uri	https://hdl.handle.net/11772/26848
dc.identifier.volume	112
dc.identifier.wos	WOS:001665994100001
dc.identifier.wosquality	Q3
dc.indekslendigikaynak	Web of Science
dc.indekslendigikaynak	Scopus
dc.language.iso	en
dc.publisher	Springer
dc.relation.ispartof	Journal of Intelligent & Robotic Systems
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rights	info:eu-repo/semantics/openAccess
dc.snmz	KA_WoS_20260218
dc.subject	Environmental monitoring
dc.subject	Gaussian regression
dc.subject	Machine learning
dc.subject	Informative path planning
dc.title	Balancing Exploration and Exploitation in Robotics: Path Optimization and Uncertainty Management in Complex Environments
dc.type	Article
dspace.entity.type	Publication

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu

Balancing Exploration and Exploitation in Robotics: Path Optimization and Uncertainty Management in Complex Environments

Dosyalar

Koleksiyon