Nov 08, 2024  
Graduate Catalog 2020-2022 
    
Graduate Catalog 2020-2022 [ARCHIVED CATALOG]

CS 6030 - Data Warehousing


Credits: 3

Description This course introduces the student to the major activities involved in data warehousing application design and implementation. The course starts with an in-depth discussion of the basic concepts and principles of data warehousing, then studies the changes dictated by big data analytics, We discuss the MapReduce framework and its implementation Hadoop and the higher level language HiveQL. We discuss the two popular database architectures, column store databases and in-memoryDBS. We also discuss real-time data warehousing and extract, transform and load (ETL) paradigms used in data warehousing and business intelligence. The students will carry out a simple warehousing application in groups.

Prerequisites: CS 6020 .