May 09, 2025  
2024 2025 Academic Catalog 
    
2024 2025 Academic Catalog
Add to Portfolio (opens a new window)

CPSC 652 - Hadoop and NoSQL DB


3 Credit(s)

In this course we will develop storage techniques on the RDBMS system through NoSQL to Big Data on the Cloud and Hadoop platform. We will cover various distributed database classifications, when and how to use Redis or Key-Value Stores as well as MongoDB or Document-oriented databases. The course shows how to develop HBase as a Wide-Columnar Store as well as how to use Time series database (InfluxDB). It will also cover Elasticsearch as a search engine and usage of the Neo4J as a Graph Database Management System.  Students will understand large scale distributed data storage and processing in Hadoop as well as when and how to use and build Streaming architecture with Apache Kafka. The course will cover Apache Hive and Understand where to use it in respect to big data platforms. It will overview a number of SQL-on-Hadoop Engines and how they work. Students will understand how to use data engineering technology to enable a data-driven organization.



Add to Portfolio (opens a new window)