Booking options
£68.99
£68.99
On-Demand course
16 hours 33 minutes
All levels
This course covers the important topics needed to pass the AWS Certified Data Analytics-Specialty exam (AWS DAS-C01). You will learn about Kinesis, EMR, DynamoDB, and Redshift, and get ready for the exam by working through quizzes, exercises, and practice exams, along with exploring essential tips and techniques.
In this course, you will learn streaming massive data with AWS Kinesis; queuing messages with Simple Queue Service (SQS); wrangling the explosion data from the Internet of Things (IOT); transitioning from small to big data with the AWS Database Migration Service (DMS); storing massive data lakes with the Simple Storage Service (S3); optimizing transactional queries with DynamoDB; tying your big data systems together with AWS Lambda; making unstructured data query-able with AWS Glue, Glue ETL, Glue DataBrew, Glue Studio, and Lake Formation; processing data at an unlimited scale with Elastic MapReduce; applying neural networks at massive scale with deep learning, MXNet, and TensorFlow; applying advanced machine learning algorithms at scale with Amazon SageMaker; analyzing streaming data in real time with Kinesis Analytics; searching and analyzing petabyte-scale data with Amazon OpenSearch (formerly Elasticsearch) Service; querying S3 data lakes with Amazon Athena; hosting massive-scale data warehouses with Redshift and Redshift Spectrum; integrating smaller data with your big data using the Relational Database Service (RDS) and Aurora; visualizing your data interactively with QuickSight; and finally, keeping your data secure with encryption, KMS, HSM, IAM, Cognito, STS, and more. By the end of this course, you will be well-versed in the essential concepts and major domains necessary to pass the AWS DAS-C01 exam.
Store big data with S3 and DynamoDB in a scalable, secure manner
Move and transform massive data streams with Amazon Kinesis
Use the Hadoop ecosystem with AWS using Elastic MapReduce
Discover various methods to analyze big data
Visualize big data in the cloud using AWS QuickSight
Keep your data secure with encryption, KMS, HSM, IAM, Cognito, and STS
This course is for experienced technologists seeking certification in big data technologies through Amazon Web Services. If you are looking to achieve this certification, it is recommended to have associate-level certification first.
In this course, you will have lots of opportunities to reinforce your learning with hands-on exercises and quizzes. When you are done, this course includes a practice exam that's very similar to the real exam in difficulty, length, and style-so you will know if you are ready before you invest in taking it. We will also arm you with some valuable test-taking tips and strategies along the way.
Master the domains needed to pass the AWS Certified Data Analytics-Specialty exam (AWS DAS-C01) * Apply machine learning to massive datasets with Amazon ML, SageMaker, and deep learning * Find tips and techniques to analyze, visualize, and process big data
https://github.com/PacktPublishing/AWS-Certified-Data-Analytics-Specialty-2023-Hands-on
Frank Kane has spent nine years at Amazon and IMDb, developing and managing the technology that automatically delivers product and movie recommendations to hundreds of millions of customers all the time. He holds 17 issued patents in the fields of distributed computing, data mining, and machine learning. In 2012, Frank left to start his own successful company, Sundog Software, which focuses on virtual reality environment technology and teaches others about big data analysis.
Stéphane Maarek is a solutions architect, consultant, and software developer who has a particular interest in all things related to big data and analytics. He is also a bestseller instructor on Udemy for his courses on Apache Kafka, Apache NiFi, and AWS Lambda. He loves Apache Kafka and regularly contributes to the Apache Kafka project. Stéphane has also written a guest blog post that was featured on the Confluent website, the company behind Apache Kafka. He is also an AWS Certified Solutions Architect and has many years of experience with technologies such as Apache Kafka, Apache NiFi, Apache Spark, Hadoop, PostgreSQL, Tableau, Spotfire, Docker, Ansible, and more.
1. Introduction
1. Course Overview This video provides an overview of the course. |
2. Introducing our Hands-On Case Study: Cadabra.com This video presents the case study on Cadabra.com. |
3. Cost of the Course + AWS (Amazon Web Services) Budget Setup This video highlights the cost of the course and demonstrates how to set up your AWS budget. |
2. Domain 1: Collection
1. Collection Section Introduction This video introduces the collection section. |
2. Kinesis Data Streams Overview This video provides an overview of Amazon Kinesis data streams. |
3. Kinesis Producers This video explains the concept of Kinesis producers. |
4. Kinesis Consumers This video explains the concept of Kinesis consumers. |
5. Kinesis Data Streams - Hands On This video is a hands-on video for Kinesis data streams |
6. Kinesis Enhanced Fan Out This video focuses on the Kinesis enhanced fan-out feature. |
7. Kinesis Scaling This video explains how to scale the Amazon Kinesis data streams. |
8. Kinesis - Handling Duplicate Records This video focuses on handling duplicate records. |
9. Kinesis Security This video explains the concept of security in Amazon Kinesis data streams. |
10. Kinesis Data Firehose This video explains the concept of Amazon Kinesis data firehose. |
11. CloudWatch Subscription Filters with Kinesis This video focuses on the CloudWatch subscription filter with Kinesis. |
12. (Exercise) Kinesis Firehose, Part 1 This video is the first part of the three-part video that explains how to build a system to populate an S3 data lake from the Amazon EC2 server data using Kinesis Firehose. |
13. (Exercise) Kinesis Firehose, Part 2 This video is the second part of the three-part video that explains how to build a system to populate an S3 data lake from the Amazon EC2 server data using Kinesis Firehose. |
14. (Exercise) Kinesis Firehose, Part 3 This video is the third part of the three-part video that explains how to build a system to populate an S3 data lake from the Amazon EC2 server data using Kinesis Firehose. |
15. (Exercise) Kinesis Data Streams This video explains how to create the Amazon Kinesis data streams, use the Kinesis agent to send data from Amazon EC2 into it, and confirm if the data is successfully sent and received. |
16. SQS Overview This video provides an overview of Amazon SQS. |
17. Kinesis Data Streams Versus SQS This video illustrates the difference between Amazon Kinesis data streams and Amazon SQS. |
18. IoT Overview This video provides an overview of IoT. |
19. IoT Components Deep Dive This video explains the various IoT components in detail. |
20. Database Migration Service (DMS) This video explains the concept of AWS DMS (Database Migration Service). |
21. Direct Connect This video focuses on AWS Direct Connect. |
22. Snow Family This video focuses on the AWS Snow family. |
23. MSK: Managed Streaming for Apache Kafka This video explains the concept of Amazon MSK. |
24. MSK Connect This video focuses on the MSK Connect feature. |
25. MSK Serverless This video focuses on the MSK Serverless feature. |
26. Kinesis vs MSK This video explains the difference between Kinesis and MSK. |
3. Domain 2: Storage
1. S3 Overview This video provides an overview of Amazon S3. |
2. S3 Hands-On This video is a practical video where we will move into AWS S3 bucket. |
3. S3 Security: Bucket Policy This video explains the various Amazon S3 security features and bucket policies. |
4. S3 Security: Bucket Policy Hands-On This video is a practical video where we will demonstrate S3 security and bucket policies. |
5. S3 Versioning This video explains the versioning of Amazon S3. |
6. S3 Versioning - Hands On This video is a practical video where we will demonstrate S3 versioning. |
7. S3 Replication This video explains the concept of Amazon S3 replication (CRR and SRR). |
8. S3 Replication Notes This video shared some notes about Amazon S3 Replication. |
9. S3 Replication - Hands-On This video is a practical video where we will demonstrate S3 replication. |
10. S3 Storage Classes Overview This video explains the various Amazon S3 storage classes. |
11. S3 Storage Classes Hands-On This video is a practical video where we will demonstrate storage classes. |
12. S3 Lifecycle Rules (with S3 Analytics) This video explains the Amazon S3 lifecycle rules. |
13. S3 Lifecycle Rules - Hands-On This video is a practical video where we will demonstrate S3 lifecycle rules. |
14. S3 Event Notifications This video focuses on Amazon S3 event notifications. |
15. S3 Event Notifications - Hands-On This video is a practical video where we will demonstrate S3 event notifications. |
16. S3 Performance This video explains the performance of Amazon S3. |
17. S3 Select and Glacier Select This video focuses on Amazon S3 select and Glacier select. |
18. S3 Encryption This video explains the concept of Amazon S3 encryption. |
19. S3 Encryption - Hands-On This video is a practical video where we will demonstrate S3 encryption. |
20. S3 Default Encryption Versus and Bucket Policies This video explains the difference between default encryption and bucket policies. |
21. S3 Access Points and Object Lambda This video explains S3 access points and object lambda. |
22. DynamoDB Overview This video provides an overview of Amazon DynamoDB. |
23. DynamoDB Basics - Hands-On This video explores DynamoDB services. |
24. DynamoDB in Big data This video explains how DynamoDB relates to the big data world. |
25. DynamoDB RCU and WCU - Throughput This video explains the Amazon DynamoDB provisioned throughput capacity for read and write. |
26. DynamoDB RCU and WCU - Hands-On This hands-on video shows how to define RCU and WCU of our tables. |
27. DynamoDB Basic APIs This video explains the concept of Amazon DynamoDB APIs. |
28. DynamoDB Basic APIs - Hands-On This hands-on video explores DynamoDB Basic API. |
29. DynamoDB Indexes (GSI + LSI) This video explains the LSI and GSI Amazon DynamoDB indexes. |
30. DynamoDB Indexes (GSI + LSI) - Hands-On This hands-on video demonstrates how to use the LSI and GSI Amazon DynamoDB indexes. |
31. DynamoDB PartiQL This video covers DynamoDB PartiQL. |
32. DynamoDB DAX This video explains the concept of Amazon DAX. |
33. DynamoDB DAX - Hands-On This hands-on video covers DynamoDB DAX. |
34. DynamoDB Streams This video explains the Amazon DynamoDB streams service. |
35. DynamoDB Streams - Hands-On This hands-On video cover DynamoDB Streams. |
36. DynamoDB TTL This video explains the concept of Amazon DynamoDB TTL. |
37. DynamoDB Patterns with S3 This video covers DynamoDB Patterns with S3. |
38. DynamoDB Security This video focuses on Amazon DynamoDB security. |
39. (Exercise) DynamoDB This video explains how to write the order data from a Kinesis stream into a DynamoDB table using a Kinesis consumer app on Amazon EC2. |
40. ElastiCache Overview This video provides an overview of Amazon ElastiCache. |
4. Domain 3: Processing
1. Section Introduction: Processing This video introduces processing. |
2. What Is AWS Lambda? This video explains what AWS Lambda is. |
3. Lambda Integration - Part 1 This video is the first part of the two-part video that explains how to integrate a Lambda function. |
4. Lambda Integration - Part 2 This video is the second part of the two-part video that explains how to integrate a Lambda function. |
5. Lambda Costs, Promises, and Anti-Patterns This video focuses on Lambda costs, its features, and anti-patterns. |
6. (Exercise) AWS Lambda This video explains how to complete the "order history app" example by replacing the Kinesis consumer app with a Lambda function, which is serverless and more scalable. |
7. What Is Glue? + Partitioning Your Data Lake This video focuses on AWS Glue and explains how to partition the data lake. |
8. Glue, Hive, and ETL This video focuses on Glue, Hive, and ETL services. |
9. Modifying the Glue Data Catalog from ETL Scripts This video explains how to modify the Glue Data Catalog from ETL Scripts. |
10. Glue ETL: Developer Endpoints, Running ETL Jobs with Bookmarks This video focuses on Glue ETL. |
11. Glue Costs and Anti-Patterns This video focuses on Glue costs and anti-patterns. |
12. AWS Glue Studio This video focuses on AWS Glue studio. |
13. AWS Glue DataBrew This video focuses on AWS Glue DataBrew. |
14. AWS Glue Elastic Views (Coming Soon...) This video focuses on AWS Glue Elastic Views. |
15. AWS Lake Formation This video focuses on AWS Lake formation. |
16. Elastic MapReduce (EMR) Architecture and Usage This video focuses on the architecture of Amazon EMR and its usage. |
17. EMR, AWS integration, and Storage This video focuses on Amazon EMR, AWS integration, and storage. |
18. EMR Promises; Introduction to Hadoop This video focuses on the features of EMR and introduces Hadoop. |
19. EMR Serverless This video explains the new features of EMR Serverless. |
20. Introduction to Apache Spark This video introduces Apache Spark. |
21. Spark Integration with Kinesis and Redshift This video explains how to integrate Apache Spark with Amazon Kinesis and Redshift. |
22. Hive on EMR This video demonstrates how to integrate Hive with Amazon EMR. |
23. Pig on EMR This video demonstrates how to integrate Apache Pig with Amazon EMR. |
24. HBase on EMR This video demonstrates how to integrate Apache HBase with Amazon EMR. |
25. Presto on EMR This video demonstrates how to integrate Presto with Amazon EMR. |
26. Zeppelin and EMR Notebooks This video explains how to use Apache Zeppelin with Amazon EMR notebooks. |
27. Hue, Splunk, and Flume This video explains the concept of Hue, Splunk, and Flume in Apache Spark. |
28. S3DistCP and Other Services This video explains how to use S3DistCp and other services in Apache Spark. |
29. EMR Security and Instance Types This video explains Amazon EMR security and its instance types. |
30. (Exercise) Elastic MapReduce, Part 1 This video explains how to use Apache Spark and MLLib (its machine learning library) on an Amazon EMR cluster to consume the order data in an Amazon S3 data lake and produce product recommendations for the customers. |
31. (Exercise) Elastic MapReduce, Part 2 This video explains how to use Apache Spark and MLLib (its machine learning library) on an Amazon EMR cluster to consume the order data in an Amazon S3 data lake and produce product recommendations for the customers. |
32. AWS Data Pipeline This video focuses on the AWS data pipeline service. |
33. AWS Step Functions This video focuses on AWS step functions. |
5. Domain 4: Analysis
1. Section Introduction: Analysis This video introduces the section. |
2. Introduction to Kinesis Analytics This video introduces Kinesis analytics. |
3. Kinesis Analytics Costs; RANDOM_CUT_FOREST This video focuses on the cost of Kinesis analytics and explains how to use the RANDOM_CUT_FOREST function. |
4. (Exercise) Kinesis Analytics, Part 1 This video is the first part of the four-part video that explains how to build a more complex application that monitors the incoming order data using Kinesis analytics. If an anomalous order rate is detected, an alarm will be sent via text message to your cell phone using Lambda and SNS. |
5. (Exercise) Kinesis Analytics, Part 2 This video is the second part of the four-part video that explains how to build a more complex application that monitors the incoming order data using Kinesis analytics. If an anomalous order rate is detected, an alarm will be sent via text message to your cell phone using Lambda and SNS. |
6. (Exercise) Kinesis Analytics, Part 3 This video is the third part of the four-part video that shows to use Kinesis analytics to monitor our incoming data. |
7. (Exercise) Kinesis Analytics, Part 4 This video is the fourth part of the four-part video that shows Kinesis analytics in the studio environment. |
8. Introduction to OpenSearch (formerly Elasticsearch) This video gives a quick introduction to the Amazon OpenSearch services. |
9. Amazon OpenSearch Service This video explains what Amazon OpenSearch services is. |
10. OpenSearch Index Management and Designing for Stability This video explains the OpenSearch index state management and designing for stability. |
11. Amazon OpenSearch Service Performance This video covers Amazon OpenSearch service performance. |
12. (Exercise) Amazon OpenSearch Service This video demonstrates an exercise for Amazon OpenSearch service. |
13. Introduction to Athena This video introduces Amazon Athena. |
14. Athena and Glue, Costs, and Security This video explains the cost and security of Amazon Athena and Glue. |
15. Athena Performance This video focuses on Athena performance. |
16. Athena ACID Transactions This video covers Athena ACID Transactions. |
17. (Exercise) AWS Glue and Athena This video explains how to use a Glue crawler to set up a Glue Data catalog for the Amazon S3 order data and then query it directly using Amazon Athena. |
18. Redshift Introduction and Architecture This video introduces the Amazon Redshift architecture. |
19. Redshift Spectrum and Performance Tuning This video explains how to improve Amazon Redshift spectrum and tune its performance. |
20. Redshift Durability and Scaling This video focuses on Amazon Redshift durability and scaling. |
21. Redshift Distribution Styles This video shows the various Amazon Redshift distribution styles. |
22. Redshift Sort Keys This video explains how to select the Amazon Redshift sort keys. |
23. Redshift Data Flows and the COPY command This video explains the Amazon Redshift data flows and how to use the COPY command. |
24. Redshift Integration / WLM / Vacuum / Anti-Patterns This video explains how to integrate data using Amazon Redshift WLM, Vacuum, and Anti-Patterns. |
25. Redshift Resizing (Elastic Versus Classic) and New Redshift Features in 2020 This video has the new features in the 2020 version. |
26. Newer Redshift Features, AQUA This video covers the Newer Features for Redshift and AQUA. |
27. Redshift Security Concerns This video is about Redshift security concerns. |
28. Redshift Serverless This video covers Redshift Serverless. |
29. (Exercise) Redshift Spectrum, Part 1 This video is the first part of the two-part video that explains how to launch a Redshift cluster and use AWS Glue to provide a schema to query the Amazon S3 order data through Redshift spectrum. |
30. (Exercise) Redshift Spectrum, Part 2 This video is the second part of the two-part video that explains how to launch a Redshift cluster and use AWS Glue to provide a schema to query the Amazon S3 order data through Redshift spectrum. |
31. Amazon Relational Database Service (RDS) and Aurora This video focuses on Amazon RDS and Aurora. |
6. Domain 5: Visualization
1. Section Introduction: Visualization This video explains the concept of visualization in Amazon QuickSight. |
2. Introduction to Amazon QuickSight This video introduces Amazon QuickSight. |
3. QuickSight Pricing and Dashboards; ML Insights This video provides an overview of Amazon QuickSight's pricing and dashboards. |
4. QuickSight Q This video covers QuickSight Q, which is the most recent feature of QuickSight. |
5. Choosing Visualization Types This video explains how to select the visualization types in Amazon QuickSight. |
6. (Exercise) Amazon QuickSight This video explains how to set up QuickSight on top of the Redshift data warehouse and focuses on the visualizations it can produce. |
7. Other Visualization Tools (HighCharts, D3, and More) This video explains the other visualization tools in Amazon QuickSight. |
7. Domain 6: Security
1. Encryption 101 This video explains the concept of encryption in Amazon Web Services (AWS). |
2. S3 Encryption (Reminder) This video explains the concept of encryption in Amazon Web Services (AWS). |
3. KMS Overview This video provides an overview of AWS KMS. |
4. KMS Key Rotation This video focuses on AWS KMS Key rotation. |
5. Cloud HSM Overview This video provides an overview of AWS CloudHSM. |
6. AWS Services Security Deep Dive (1/3) This video is the first part of the three-part video that explains the AWS security features in detail. |
7. AWS Services Security Deep Dive (2/3) This video is the second part of the three-part video that explains the AWS security features in detail. |
8. AWS Services Security Deep Dive (3/3) This video is the third part of the three-part video that explains the AWS security features in detail. |
9. STS and Cross Account Access This video focuses on AWS STS and cross-account access. |
10. Identity Federation This video focuses on AWS identity federation. |
11. Policies - Advanced This video explains the various advanced policies in Amazon Web Services (AWS). |
12. CloudTrail This video focuses on the AWS CloudTrail service. |
13. VPC Endpoints This video explains how to use VPC endpoints to support the Amazon Web Services (AWS) service. |
8. Everything Else
1. AWS Services Integrations This video focuses on AWS service integrations. |
2. Instance Types for Big Data This video explains the instance types for big data. |
3. EC2 for Big Data This video focuses on Amazon EC2 for big data. |
4. Interacting with Data with AWS AppSync and Amazon Kendra This video demonstrates how to interact with data with AWS AppSync and Amazon Kendra. |
5. AWS Data Exchange This video covers AWS data exchange. |
6. Amazon AppFlow This video covers Amazon AppFlow. |
9. Preparing for the Exam
1. Exam Tips This video provides some tips to clear the exam. |
2. State of Learning Checkpoint This video focuses on the learning checkpoint. |
3. Exam Walkthrough and Signup This video explains how to sign up for the exam. |
4. Save 50% on Your AWS Exam Cost! This video guides you on how to save money on your next AWS exam. |
5. Get an Extra 30 Minutes in Your AWS Exam - Non-Native English Speakers Only This video will guide non-native speakers and help them with an extra 30 minutes in the exam. |
10. Appendix - Machine Learning Topics for the Legacy AWS Certified Big Data Exam
1. Machine Learning 101 This video focuses on machine learning. |
2. Classification Models This video explains the various classification models in machine learning. |
3. Amazon ML Service This video focuses on Amazon's machine learning service. |
4. SageMaker This video focuses on the Amazon SageMaker service. |
5. Deep Learning 101 This video explains the concept of machine learning in detail. |
6. (Exercise) Amazon Machine Learning, Part 1 This video is the first part of the two-part video that explains how to use Amazon's machine learning service to predict quantities for any given order and learn the importance of cleaning the data of outliers along the way. |
7. (Exercise) Amazon Machine Learning, Part 2 This video is the second part of the two-part video that explains how to predict order quantities with a linear regression from Amazon machine learning. |
11. Wrapping Up
1. Congratulations! Now, Make Sure You Are Ready This video provides some concluding notes to ensure that you are ready for the exam. |