Detailed Course Outline
Day 1
- Overview of Big Data
- Ingestion, Transfer, and Compression
- Storage Solutions
- Storing and Querying Data on DynamoDB
- Big Data Processing and Amazon Kinesis
- Introduction to Apache Hadoop and Amazon EMR
- Using Amazon Elastic MapReduce
Day 2
- Hadoop Programming Frameworks
- Processing Server Logs with Hive on Amazon EMR
- Processing Chemistry Data Using Hadoop Streaming on Amazon EMR
- Streamlining Your Amazon EMR Experience with Hue
- Running Pig Scripts in Hue on Amazon EMR
- Spark on Amazon EMR
- Interactively Creating and Querying Tables with Spark and Spark SQL on Amazon EMR
- Managing Amazon EMR Costs
- Securing your Amazon EMR Deployments
Day 3
- Data Warehouses and Columnar Datastores
- Amazon Redshift and Big Data
- Optimizing Your Amazon Redshift Environment
- Big Data Design Patterns
- Visualizing and Orchestrating Big Data
- Using Tibco Spotfire to Visualize Big Data