AutoDiagn: An Automated Real-Time Diagnosis Framework for Big Data Systems

dc.contributor.authorDemirbaga, Ümit
dc.contributor.authorWen, Zhenyu
dc.contributor.authorNoor, Ayman
dc.contributor.authorMitra, Karan
dc.contributor.authorAlwasel, Khaled
dc.contributor.authorGarg, Saurabh
dc.contributor.authorZomaya, Albert Y.
dc.contributor.authorDemirbaga, Ümit
dc.date.accessioned2025-10-18T10:10:27Z
dc.date.created2022
dc.date.issued2022
dc.departmentFakülteler, Mühendislik Mimarlık ve Tasarım Fakültesi, Bilgisayar Mühendisliği Bölümü
dc.description.abstractBig data processing systems, such as Hadoop and Spark, usually work in large-scale, highly-concurrent, and multi-tenant environments that can easily cause hardware and software malfunctions or failures, thereby leading to performance degradation. Several systems and methods exist to detect big data processing systems' performance degradation, perform root-cause analysis, and even overcome the issues causing such degradation. However, these solutions focus on specific problems such as stragglers and inefficient resource utilization. There is a lack of a generic and extensible framework to support the real-time diagnosis of big data systems. In this article, we propose, develop and validate AutoDiagn. This generic and flexible framework provides holistic monitoring of a big data system while detecting performance degradation and enabling root-cause analysis. We present an implementation and evaluation of AutoDiagn that interacts with a Hadoop cluster deployed on a public cloud and tested with real-world benchmark applications. Experimental results show that AutoDiagn can offer a high accuracy root-cause analysis framework, at the same time as offering a small resource footprint, high throughput, and low latency.
dc.description.sponsorshipTurkish Ministry of National Education [EP/T021985/1, EP/R033293/1, EP/T022582/1]; National Natural Science Foundation of China [62072408]; Zhejiang Provincial Natural Science Foundation of China [LY20F020030]
dc.description.sponsorshipThis work was supported in part by the Turkish Ministry of National Education, in part by the following UKRI projects through SUPER under Grant EP/T021985/1, through PACE under Grant EP/R033293/1, and through Centre for Digital Citizens under Grant EP/T022582/1, in part by the National Natural Science Foundation of China under Grant 62072408, and in part by the Zhejiang Provincial Natural Science Foundation of China under Grant LY20F020030.
dc.identifier.doi10.1109/TC.2021.3070639
dc.identifier.endpage1048
dc.identifier.issn0018-9340
dc.identifier.issn1557-9956
dc.identifier.issue5
dc.identifier.orcidZomaya, Albert/0000-0002-3090-1059
dc.identifier.orcidNoor, Ayman/0000-0002-3344-2847
dc.identifier.orcidGarg, Saurabh Kumar/0000-0001-8719-284X
dc.identifier.orcidMitra, Karan/0000-0003-3489-7429
dc.identifier.orcidDemirbaga, Umit/0000-0001-5159-0723;
dc.identifier.scopus2-s2.0-85103783969
dc.identifier.scopusqualityQ1
dc.identifier.startpage1035
dc.identifier.urihttps://doi.org/10.1109/TC.2021.3070639
dc.identifier.urihttps://hdl.handle.net/11772/21875
dc.identifier.volume71
dc.identifier.wosWOS:000778905700004
dc.identifier.wosqualityQ2
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherIEEE Computer Soc
dc.relation.ispartofIeee Transactions on Computers
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.snmzWoS_20251016
dc.subjectBig Data
dc.subjectMonitoring
dc.subjectTask Analysis
dc.subjectReal-Time Systems
dc.subjectDegradation
dc.subjectMeasurement
dc.subjectData Visualization
dc.subjectRoot-Cause Analysis
dc.subjectBig Data Systems
dc.subjectQos
dc.subjectHadoop
dc.subjectPerformance
dc.titleAutoDiagn: An Automated Real-Time Diagnosis Framework for Big Data Systems
dc.typeArticle
dspace.entity.typePublication
relation.isAuthorOfPublication6197518d-2220-4e55-aa0a-5fc7d5c6606d
relation.isAuthorOfPublication.latestForDiscovery6197518d-2220-4e55-aa0a-5fc7d5c6606d

Dosyalar