Scalability Versus Accuracy Trade-offs in Distributed Big Data Processing Frameworks: A Comparative Evaluation of Apache Spark, Flink, and Dask Using Benchmark Datasets
- Authors
-
-
Isiaq O. ALABI
Author
-
Hassan T. ABDULAZEEZ
Author
-
Sulaiman AHMAD
Author
-
Yahaya M. SANI
Author
-
- Keywords:
- Big data processing; Distributed computing; Apache Spark; Apache Flink; Dask; Performance benchmarking; Fault tolerance.
- Abstract
-
The exponential growth in data volume, velocity, and variety has intensified demand for distributed processing frameworks that balance computational scalability with analytical accuracy. Apache Spark, Apache Flink, and Dask represent three dominant open-source ecosystems, yet selecting an appropriate framework requires nuanced understanding of their performance characteristics under diverse workloads. This study presents a systematic comparative evaluation of these frameworks using standardized benchmark datasets (Transactions Processing Performance Council-Decision Support (TPC-DS) at 100 GB scale factor and HiBench version 7.1) across four dimensions: execution time, memory consumption, fault tolerance, and result consistency. Experiments were conducted on Amazon Web Services EC2 infrastructure using identical c5.4xlarge instances (16 vCPUs, 32 GB RAM) configured in standalone cluster mode. Results demonstrate that Spark achieved optimal performance for batch-oriented SQL workloads, completing 92 of 99 TPC-DS queries with the lowest average runtime (18% faster than Flink, 32% faster than Dask). Flink exhibited superior latency characteristics and exactly-once processing semantics, recovering from simulated node failures within 12 seconds compared to Spark's 45 seconds. Dask demonstrated competitive performance for iterative machine learning tasks but exhibited higher memory volatility and occasional floating-point inconsistencies during fault recovery. These findings provide empirical guidance for practitioners designing analytics pipelines in domains requiring both timeliness and computational precision, including cybersecurity threat detection and financial analytics.
- References
- Downloads
- Published
- 25-04-2026
- Section
- Articles
- License
-
Copyright (c) 2026 FUDMA Journal of Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
How to Cite
Similar Articles
- Afeez A. AWOLOKUN, Olusegun O. ALUKO, Temitope F. AWOLUSI, Mayowa A. OGIDI, Performance Evaluation of Pervious Concrete Containing Glass Cullet and Glass Powder for Sustainable Construction , FUDMA Journal of Engineering and Technology: Vol. 2 No. 1 (2026): June 2026
- Sulaiman Y. ADAMU, Fadimatu N. DABAI, Abdulazeez Y. ATTA, Baba Y. JIBRIL, Preparation and Characterization of Enhanced Hierarchical Zn-Ni/HZSM-5 Catalysts for Potential use in Catalytic reactions to Upgrade Bio-Oil and Hydrogen from Biomass Pyrolysis , FUDMA Journal of Engineering and Technology: Vol. 1 No. 2 (2025): December 2025
- Lukman MOHAMMED, Victor O. WAZIRI, Ismaila IDRIS, Suleiman AHMAD, Performance Assessment of Android Antimalware Applications: An Experimental Approach , FUDMA Journal of Engineering and Technology: Vol. 1 No. 2 (2025): December 2025
- Zaharaddeen MUSA, Aliyu BELLO, Haris A. DANLADI, Design and Construction of Dual Input Mobile Solar Generator for Reliable Off-Grid Power , FUDMA Journal of Engineering and Technology: Vol. 1 No. 2 (2025): December 2025
- Samuel E. CHUKWU, Martins Y. OTACHE, Precious O. ATEMOAGBO, Emmanuel O. AGBESE, Copula-Based Modelling of Drought Severity-Duration-Frequency Relationship of Sokoto-Rima-River Basin, Nigeria , FUDMA Journal of Engineering and Technology: Vol. 1 No. 2 (2025): December 2025
- Munir A. ADEWOYE, Ahmed ALIYU, Usman A. ALI, Abdulrasheed JIMOH, Blockchain-Based Food Supply Chain Traceability: A Systematic Review of Privacy Preserving and Scalability , FUDMA Journal of Engineering and Technology: Vol. 2 No. 1 (2026): June 2026
- Peter EDOKA, Incidents of Flood Disaster in Kaduna: Preventive and Mitigating Measures , FUDMA Journal of Engineering and Technology: Vol. 1 No. 2 (2025): December 2025
- Muyideen O. MOMOH, Emmanuel I. AWODE, Gowon SULE, Umar ABUBAKAR, Ikechukwu O. ALUM, Usman K. AMINU, Bio-Inspired Flight Mechanisms for Unmanned Aerial Vehicles: An Overview , FUDMA Journal of Engineering and Technology: Vol. 1 No. 2 (2025): December 2025
- Oluwayinka. G. AKINWAMIDE, Olugbenga O. AMU, Christopher FAPOHUNDA, Prediction of International Roughness Index of Flexible Pavement Using Machine Learning-Based Predictive Framework in Ekiti State , FUDMA Journal of Engineering and Technology: Vol. 2 No. 1 (2026): June 2026
- Yusuf L. SHUAIB-BABATA, Kabiru S. AJAO, Yusuf O. BUSARI, Ibrahim O. AMBALI, Toheeb A. NURUDEEN, John A. OKOLO, Gabriel A. LONGE, Inhibitory Potential of Blended Parkia biglobosa and Delonix regia Extracts on Corrosion of AISI 1007 Steel in 1.0 M Hydrochloric Acid Medium , FUDMA Journal of Engineering and Technology: Vol. 2 No. 1 (2026): June 2026
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- Suleiman ZUBAIR, Hassan T. ABDULAZEEZ, Bala A. SALIHU, Gambo MOHAMMED, A Low-Cost, Offline-Capable Wireless Soil Moisture Monitoring System for Smallholder Farmers: Design, Validation, and Agronomic Impact , FUDMA Journal of Engineering and Technology: Vol. 2 No. 1 (2026): June 2026
