A Dockerized Big Data Architecture for Sports Analytics

doi:10.21203/rs.3.rs-524005/v1

Download PDF

Research Article

A Dockerized Big Data Architecture for Sports Analytics

https://doi.org/10.21203/rs.3.rs-524005/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 01 Jan, 2022

Read the published version in Computer Science and Information Systems →

Version 1

posted

You are reading this latest preprint version

The revolution of big data has also affected the area of sports analytics. Many big companies have started to see the benefits of combining sports analytics and big data to make a profit. Aggregating and processing big sport data from different sources becomes challenging if we rely on central processing techniques, which hurts the accuracy and the timeliness of the information. Distributed systems come to the rescue as a solution to these problems and the MapReduce paradigm is promising for large-scale data analytics. In this study, we present a big data architecture based on Docker containers in Apache Spark. We demonstrate the architecture on four data-intensive case studies including structured analysis, streaming, machine learning methods, and graph-based analysis in sport analytics, showing ease of use.

Theoretical Computer Science

Big data

sports analytics

Apache Spark

containers

wearable devices

IoT

Download PDF

Journal Publication

published 01 Jan, 2022

Read the published version in Computer Science and Information Systems →

Version 1

posted

You are reading this latest preprint version

A Dockerized Big Data Architecture for Sports Analytics

Status:

Journal Publication

Version 1

Abstract

Full Text

Status:

Journal Publication

Version 1