Skip to content

j-balkovec/CPSC4330

Repository files navigation

CPSC 4330: Big Data Analytics

Course Overview

This repository contains coursework and projects for the Big Data Analytics class at Seattle University. The course focuses on big data processing using Hadoop, MapReduce, Hive, and Spark.

Topics Covered

  • Hadoop Ecosystem

    • Hadoop Architecture
    • Hadoop Distributed File System (HDFS)
  • MapReduce

    • Programming Model
    • Common Algorithms
    • Implementing MapReduce in Java
  • Apache Spark

    • Spark Basics
    • Spark SQL
  • Hive

    • Hive Basics
    • Hive Optimization

About

Big Data Analytics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published