Graph processing frameworks
URL: https://doi.org/10.1007/978-3-319-77525-8_283
This chapter provides a comprehensive overview of Graph Processing Frameworks (GPFs), which are software systems designed to efficiently process large-scale graphs across distributed and parallel computing environments. It begins by defining GPFs as toolsets for representing data as vertices and edges, emphasizing their role in managing the three challenges of big data—volume, velocity, and variety. The chapter classifies GPFs along four major dimensions: target platform, computation model, graph processing approach, and communication model. It reviews key frameworks—such as Surfer, Pegasus, GBase, Pregel, GraphX, Giraph, Gelly, GraphLab, GraphChi, and Totem—covering their underlying architectures, data abstractions, and parallelization strategies, including MapReduce, Bulk Synchronous Parallel (BSP), and Gather-Apply-Scatter (GAS) models. A practical example using Apache Flink’s Gelly demonstrates how GPF APIs can be applied to compute the degree distribution of a large social network, highlighting performance scalability with multicore systems. The chapter concludes by discussing future research directions, including graph partitioning in heterogeneous (CPU–GPU) environments, the need for standardized benchmarks, and challenges in performance evaluation and algorithmic efficiency. Overall, it situates GPFs as essential infrastructures for large-scale graph analytics in domains such as social networks, bioinformatics, and cybersecurity.
Información adicional
Campo | Valor |
---|---|
Última actualización de los datos | 10 de octubre de 2025 |
Última actualización de los metadatos | 10 de octubre de 2025 |
Creado | 10 de octubre de 2025 |
Formato | HTML |
Licencia | No se ha provisto de una licencia |
Id | 74ad34b0-6ae8-4c70-98cb-6c8dcafbde68 |
Package id | 7d26b887-9381-4f45-829b-4fb6ac60a43c |
Position | 1 |
State | active |