Oct 02, 2024  
2024 2025 Academic Catalog 
    
2024 2025 Academic Catalog
Add to Portfolio (opens a new window)

CPSC 651 - Big Data Systems & Analysis


3 Credit(s)

Program or Course Description; This course will introduce the state-of-arts computing platforms with the focus on how to utilize them in processing (managing and analyzing) massive datasets. Specifically, we will discuss the MapReduce (Hadoop) framework, which provides the most accessible and practical means of computing in the Cloud. We will also introduce the emerging distributed database and services, such as HBase, Pig/Hive for large scale data analysis. Finally, we will utilize several key data processing tasks, including simple statistics, data aggregation, join processing, frequent pattern mining, data clustering, information retrieval, and other machine learning analytics as the case study for large scale data processing.



Add to Portfolio (opens a new window)