Leveraging Cloudera Big Data Platform with Spark ETL and Kafka for Data Processing in the Travel Industry with GDS Integration
Author(s): Syed Ziaurrahman Ashraf
Publication #: 2410015
Date of Publication: 07.12.2019
Country: USA
Pages: 1-6
Published In: Volume 5 Issue 6 December-2019
DOI: https://doi.org/10.5281/zenodo.13949982
Abstract
Integrating Cloudera’s Big Data platform with Apache Kafka and Apache Spark creates a powerful architecture for real-time and batch data processing across industries, particularly in travel. This paper explores how Global Distribution Systems (GDS) in the travel industry can leverage these technologies to optimize data processing, enhance customer experiences, and improve operational efficiencies. We delve into the architecture, use cases, and benefits of this stack within the travel sector. The paper includes technical diagrams, pseudocode, and visual aids to provide an in-depth understanding of the implementation and its impact on GDS.
Keywords: Cloudera, Apache Spark, Apache Kafka, ETL, Real-time Processing, GDS, Travel Industry, Big Data, Data Pipeline, Streaming Data
Download/View Count: 135
Share this Article