Robert C. Green II, Ph.D.
  • Teaching
  • Research
  • Service
  • Software
  • Students
  • Projects
  • Videos

CS/DATA 6500: Big Data Analytics

🗄

HDFS

Distributed Storage Foundations

Week 2
  • HDFS: Fundamentals
⚙️

MapReduce

Parallel Computation Model

Weeks 3–4
  • MapReduce: Fundamentals
  • MapReduce: mrjob
  • MapReduce: Practice
🔥

Spark

Modern Distributed Analytics

Weeks 5–6
  • Spark: Fundamentals
  • Spark: RDDs & PySpark
  • Spark: DataFrames
  • Spark: SQL & Advanced DataFrame Ops