• Professional Development
  • Medicine & Nursing
  • Arts & Crafts
  • Health & Wellbeing
  • Personal Development

285 Big Data courses

Building Big Data Pipelines with PySpark MongoDB and Bokeh

5.0(10)

By Apex Learning

Overview This comprehensive course on Building Big Data Pipelines with PySpark MongoDB and Bokeh will deepen your understanding on this topic. After successful completion of this course you can acquire the required skills in this sector. This Building Big Data Pipelines with PySpark MongoDB and Bokeh comes with accredited certification from CPD, which will enhance your CV and make you worthy in the job market. So enrol in this course today to fast-track your career ladder. How will I get my certificate? You may have to take a quiz or a written test online during or after the course. After successfully completing the course, you will be eligible for the certificate. Who is This course for? There is no experience or previous qualifications required for enrolment on this Building Big Data Pipelines with PySpark MongoDB and Bokeh. It is available to all students, of all academic backgrounds. Requirements Our Building Big Data Pipelines with PySpark MongoDB and Bokeh is fully compatible with PC's, Mac's, Laptop, Tablet and Smartphone devices. This course has been designed to be fully compatible with tablets and smartphones so you can access your course on Wi-Fi, 3G or 4G. There is no time limit for completing this course, it can be studied in your own time at your own pace. Career Path Learning this new skill will help you to advance in your career. It will diversify your job options and help you develop new techniques to keep up with the fast-changing world. This skillset will help you to- Open doors of opportunities Increase your adaptability Keep you relevant Boost confidence And much more! Course Curriculum 7 sections • 25 lectures • 05:04:00 total length •Introduction: 00:10:00 •Python Installation: 00:03:00 •Installing Third Party Libraries: 00:03:00 •Installing Apache Spark: 00:12:00 •Installing Java (Optional): 00:05:00 •Testing Apache Spark Installation: 00:06:00 •Installing MongoDB: 00:04:00 •Installing NoSQL Booster for MongoDB: 00:07:00 •Integrating PySpark with Jupyter Notebook: 00:05:00 •Data Extraction: 00:19:00 •Data Transformation: 00:15:00 •Loading Data into MongoDB: 00:13:00 •Data Pre-processing: 00:19:00 •Building the Predictive Model: 00:12:00 •Creating the Prediction Dataset: 00:08:00 •Loading the Data Sources from MongoDB: 00:17:00 •Creating a Map Plot: 00:33:00 •Creating a Bar Chart: 00:09:00 •Creating a Magnitude Plot: 00:15:00 •Creating a Grid Plot: 00:09:00 •Installing Visual Studio Code: 00:05:00 •Creating the PySpark ETL Script: 00:24:00 •Creating the Machine Learning Script: 00:30:00 •Creating the Dashboard Server: 00:21:00 •Source Code and Notebook: 00:00:00

Building Big Data Pipelines with PySpark MongoDB and Bokeh
Delivered Online On Demand5 hours 4 minutes
£12

Google Cloud Platform Big Data and Machine Learning Fundamentals

By Nexus Human

Duration 1 Days 6 CPD hours This course is intended for This class is intended for the following: Data analysts, Data scientists, Business analysts getting started with Google Cloud Platform. Individuals responsible for designing pipelines and architectures for data processing, creating and maintaining machine learning and statistical models, querying datasets, visualizing query results and creating reports. Executives and IT decision makers evaluating Google Cloud Platform for use by data scientists. Overview This course teaches students the following skills:Identify the purpose and value of the key Big Data and Machine Learning products in the Google Cloud Platform.Use Cloud SQL and Cloud Dataproc to migrate existing MySQL and Hadoop/Pig/Spark/Hive workloads to Google Cloud Platform.Employ BigQuery and Cloud Datalab to carry out interactive data analysis.Train and use a neural network using TensorFlow.Employ ML APIs.Choose between different data processing products on the Google Cloud Platform. This course introduces participants to the Big Data and Machine Learning capabilities of Google Cloud Platform (GCP). It provides a quick overview of the Google Cloud Platform and a deeper dive of the data processing capabilities. Introducing Google Cloud Platform Google Platform Fundamentals Overview. Google Cloud Platform Big Data Products. Compute and Storage Fundamentals CPUs on demand (Compute Engine). A global filesystem (Cloud Storage). CloudShell. Lab: Set up a Ingest-Transform-Publish data processing pipeline. Data Analytics on the Cloud Stepping-stones to the cloud. Cloud SQL: your SQL database on the cloud. Lab: Importing data into CloudSQL and running queries. Spark on Dataproc. Lab: Machine Learning Recommendations with Spark on Dataproc. Scaling Data Analysis Fast random access. Datalab. BigQuery. Lab: Build machine learning dataset. Machine Learning Machine Learning with TensorFlow. Lab: Carry out ML with TensorFlow Pre-built models for common needs. Lab: Employ ML APIs. Data Processing Architectures Message-oriented architectures with Pub/Sub. Creating pipelines with Dataflow. Reference architecture for real-time and batch data processing. Summary Why GCP? Where to go from here Additional Resources Additional course details: Nexus Humans Google Cloud Platform Big Data and Machine Learning Fundamentals training program is a workshop that presents an invigorating mix of sessions, lessons, and masterclasses meticulously crafted to propel your learning expedition forward. This immersive bootcamp-style experience boasts interactive lectures, hands-on labs, and collaborative hackathons, all strategically designed to fortify fundamental concepts. Guided by seasoned coaches, each session offers priceless insights and practical skills crucial for honing your expertise. Whether you're stepping into the realm of professional skills or a seasoned professional, this comprehensive course ensures you're equipped with the knowledge and prowess necessary for success. While we feel this is the best course for the Google Cloud Platform Big Data and Machine Learning Fundamentals course and one of our Top 10 we encourage you to read the course outline to make sure it is the right content for you. Additionally, private sessions, closed classes or dedicated events are available both live online and at our training centres in Dublin and London, as well as at your offices anywhere in the UK, Ireland or across EMEA.

Google Cloud Platform Big Data and Machine Learning Fundamentals
Delivered OnlineFlexible Dates
Price on Enquiry

Apache Spark 3 Advance Skills for Cracking Job Interviews

By Packt

A carefully structured advanced-level course on Apache Spark 3 to help you clear your job interviews. This course covers advanced topics and concepts that are part of the Databricks Spark certification exam. Boost your skills in Spark 3 architecture and memory management.

Apache Spark 3 Advance Skills for Cracking Job Interviews
Delivered Online On Demand3 hours 47 minutes
£67.99

Data Bundle Course

By iStudy UK

Are interested in understanding and protecting big data? Do you want to utilize any data that have been presented to you to its full extent? This Bundle Course will solve all of your problems!  Interpreting data is of immense importance when you handling data. Only then you can fully use it to your business. At the same time when you use data online, there is always a chance that your data can be stolen. Learn how to protect your information in this data bundle course. Make you own database and reap its benefits. In this course, you will know how to create your database and database user.You will then master how to import database tables since most new scripts come with a built-in installer, the scripts create all database, but when the time comes that there is no installer provided, then you have to create one manually. Upon completion of the course the students will possess a solid knowledge of data protection law, as well as an understanding of the practical implications for different organisations. Course Highlights Data Bundle Course is an award winning and the best selling course that has been given the CPD Certification & IAO accreditation. It is the most suitable course anyone looking to work in this or relevant sector. It is considered one of the perfect courses in the UK that can help students/learners to get familiar with the topic and gain necessary skills to perform well in this field. We have packed Data Bundle Course into several modules for teaching you everything you need to become successful in this profession. To provide you ease of access, this course is designed for both part-time and full-time students. You can become accredited in just 20/30 hours and it is also possible to study at your own pace. We have experienced tutors who will help you throughout the comprehensive syllabus of this course and answer all your queries through email. For further clarification, you will be able to recognize your qualification by checking the validity from our dedicated website. Why You Should Choose Data Bundle Course Lifetime access to the course No hidden fees or exam charges CPD Accredited certification on successful completion Full Tutor support on weekdays (Monday - Friday) Efficient exam system, assessment and instant results Download Printable PDF certificate immediately after completion Obtain the original print copy of your certificate, dispatch the next working day for as little as £9. Improve your chance of gaining professional skills and better earning potential. Who is this Course for? Data Bundle Course is CPD certified and IAO accredited. This makes it perfect for anyone trying to learn potential professional skills. As there is no experience and qualification required for this course, it is available for all students from any academic backgrounds. Requirements Our Data Bundle Course is fully compatible with any kind of device. Whether you are using Windows computer, Mac, smartphones or tablets, you will get the same experience while learning. Besides that, you will be able to access the course with any kind of internet connection from anywhere at any time without any kind of limitation. Career Path You will be ready to enter the relevant job market after completing this course. You will be able to gain necessary knowledge and skills required to succeed in this sector. All our Diplomas' are CPD and IAO accredited so you will be able to stand out in the crowd by adding our qualifications to your CV and Resume. Module 1: Database Basics What is a Database FREE 01:00:00 Definition of Terms FREE 01:00:00 Database Users and Languages 01:00:00 Components of a Database System 01:00:00 Basic Set Concepts 01:00:00 Relations as a Database 01:00:00 Relational Database Operators 01:00:00 Database keys 01:00:00 Module 2: Understanding Big Data What Is Big Data? Hint: You're a Part of It Every Day FREE 01:00:00 Why Is Big Data Important? FREE 01:00:00 Why IBM for Big Data? 01:00:00 All About Hadoop: The Big Data Lingo Chapter 01:00:00 InfoSphere BigInsights: Analytics for Big Data at Rest 01:00:00 IBM InfoSphere Streams: Analytics for Big Data in Motion 01:00:00 Module 3: Data Protection Why Internet Marketers Need More Protection FREE 01:00:00 You Can't Be Anonymous FREE 00:15:00 Start With Basic Security Measures 01:00:00 Using Disclaimers 01:00:00 Proper Use Of Email 01:00:00 Protecting Product Rights 01:00:00 Protecting Your Website 01:00:00 Terms Of Use Or Service 00:30:00 Privacy Policy 01:00:00

Data Bundle Course
Delivered Online On Demand21 hours 45 minutes
£25

DP-601T00 Implementing a Lakehouse with Microsoft Fabric

By Nexus Human

Duration 1 Days 6 CPD hours This course is intended for The primary audience for this course is data professionals who are familiar with data modeling, extraction, and analytics. It is designed for professionals who are interested in gaining knowledge about Lakehouse architecture, the Microsoft Fabric platform, and how to enable end-to-end analytics using these technologies. Job role: Data Analyst, Data Engineer, Data Scientist Overview Describe end-to-end analytics in Microsoft Fabric Describe core features and capabilities of lakehouses in Microsoft Fabric Create a lakehouse Ingest data into files and tables in a lakehouse Query lakehouse tables with SQL Configure Spark in a Microsoft Fabric workspace Identify suitable scenarios for Spark notebooks and Spark jobs Use Spark dataframes to analyze and transform data Use Spark SQL to query data in tables and views Visualize data in a Spark notebook Understand Delta Lake and delta tables in Microsoft Fabric Create and manage delta tables using Spark Use Spark to query and transform data in delta tables Use delta tables with Spark structured streaming Describe Dataflow (Gen2) capabilities in Microsoft Fabric Create Dataflow (Gen2) solutions to ingest and transform data Include a Dataflow (Gen2) in a pipeline This course is designed to build your foundational skills in data engineering on Microsoft Fabric, focusing on the Lakehouse concept. This course will explore the powerful capabilities of Apache Spark for distributed data processing and the essential techniques for efficient data management, versioning, and reliability by working with Delta Lake tables. This course will also explore data ingestion and orchestration using Dataflows Gen2 and Data Factory pipelines. This course includes a combination of lectures and hands-on exercises that will prepare you to work with lakehouses in Microsoft Fabric. Introduction to end-to-end analytics using Microsoft Fabric Explore end-to-end analytics with Microsoft Fabric Data teams and Microsoft Fabric Enable and use Microsoft Fabric Knowledge Check Get started with lakehouses in Microsoft Fabric Explore the Microsoft Fabric Lakehouse Work with Microsoft Fabric Lakehouses Exercise - Create and ingest data with a Microsoft Fabric Lakehouse Use Apache Spark in Microsoft Fabric Prepare to use Apache Spark Run Spark code Work with data in a Spark dataframe Work with data using Spark SQL Visualize data in a Spark notebook Exercise - Analyze data with Apache Spark Work with Delta Lake Tables in Microsoft Fabric Understand Delta Lake Create delta tables Work with delta tables in Spark Use delta tables with streaming data Exercise - Use delta tables in Apache Spark Ingest Data with DataFlows Gen2 in Microsoft Fabric Understand Dataflows (Gen2) in Microsoft Fabric Explore Dataflows (Gen2) in Microsoft Fabric Integrate Dataflows (Gen2) and Pipelines in Microsoft Fabric Exercise - Create and use a Dataflow (Gen2) in Microsoft Fabric

DP-601T00 Implementing a Lakehouse with Microsoft Fabric
Delivered Online
£595

DP-203T00 Data Engineering on Microsoft Azure

By Nexus Human

Duration 4 Days 24 CPD hours This course is intended for The primary audience for this course is data professionals, data architects, and business intelligence professionals who want to learn about data engineering and building analytical solutions using data platform technologies that exist on Microsoft Azure. The secondary audience for this course includes data analysts and data scientists who work with analytical solutions built on Microsoft Azure. In this course, the student will learn how to implement and manage data engineering workloads on Microsoft Azure, using Azure services such as Azure Synapse Analytics, Azure Data Lake Storage Gen2, Azure Stream Analytics, Azure Databricks, and others. The course focuses on common data engineering tasks such as orchestrating data transfer and transformation pipelines, working with data files in a data lake, creating and loading relational data warehouses, capturing and aggregating streams of real-time data, and tracking data assets and lineage. Prerequisites Successful students start this course with knowledge of cloud computing and core data concepts and professional experience with data solutions. AZ-900T00 Microsoft Azure Fundamentals DP-900T00 Microsoft Azure Data Fundamentals 1 - Introduction to data engineering on Azure What is data engineering Important data engineering concepts Data engineering in Microsoft Azure 2 - Introduction to Azure Data Lake Storage Gen2 Understand Azure Data Lake Storage Gen2 Enable Azure Data Lake Storage Gen2 in Azure Storage Compare Azure Data Lake Store to Azure Blob storage Understand the stages for processing big data Use Azure Data Lake Storage Gen2 in data analytics workloads 3 - Introduction to Azure Synapse Analytics What is Azure Synapse Analytics How Azure Synapse Analytics works When to use Azure Synapse Analytics 4 - Use Azure Synapse serverless SQL pool to query files in a data lake Understand Azure Synapse serverless SQL pool capabilities and use cases Query files using a serverless SQL pool Create external database objects 5 - Use Azure Synapse serverless SQL pools to transform data in a data lake Transform data files with the CREATE EXTERNAL TABLE AS SELECT statement Encapsulate data transformations in a stored procedure Include a data transformation stored procedure in a pipeline 6 - Create a lake database in Azure Synapse Analytics Understand lake database concepts Explore database templates Create a lake database Use a lake database 7 - Analyze data with Apache Spark in Azure Synapse Analytics Get to know Apache Spark Use Spark in Azure Synapse Analytics Analyze data with Spark Visualize data with Spark 8 - Transform data with Spark in Azure Synapse Analytics Modify and save dataframes Partition data files Transform data with SQL 9 - Use Delta Lake in Azure Synapse Analytics Understand Delta Lake Create Delta Lake tables Create catalog tables Use Delta Lake with streaming data Use Delta Lake in a SQL pool 10 - Analyze data in a relational data warehouse Design a data warehouse schema Create data warehouse tables Load data warehouse tables Query a data warehouse 11 - Load data into a relational data warehouse Load staging tables Load dimension tables Load time dimension tables Load slowly changing dimensions Load fact tables Perform post load optimization 12 - Build a data pipeline in Azure Synapse Analytics Understand pipelines in Azure Synapse Analytics Create a pipeline in Azure Synapse Studio Define data flows Run a pipeline 13 - Use Spark Notebooks in an Azure Synapse Pipeline Understand Synapse Notebooks and Pipelines Use a Synapse notebook activity in a pipeline Use parameters in a notebook 14 - Plan hybrid transactional and analytical processing using Azure Synapse Analytics Understand hybrid transactional and analytical processing patterns Describe Azure Synapse Link 15 - Implement Azure Synapse Link with Azure Cosmos DB Enable Cosmos DB account to use Azure Synapse Link Create an analytical store enabled container Create a linked service for Cosmos DB Query Cosmos DB data with Spark Query Cosmos DB with Synapse SQL 16 - Implement Azure Synapse Link for SQL What is Azure Synapse Link for SQL? Configure Azure Synapse Link for Azure SQL Database Configure Azure Synapse Link for SQL Server 2022 17 - Get started with Azure Stream Analytics Understand data streams Understand event processing Understand window functions 18 - Ingest streaming data using Azure Stream Analytics and Azure Synapse Analytics Stream ingestion scenarios Configure inputs and outputs Define a query to select, filter, and aggregate data Run a job to ingest data 19 - Visualize real-time data with Azure Stream Analytics and Power BI Use a Power BI output in Azure Stream Analytics Create a query for real-time visualization Create real-time data visualizations in Power BI 20 - Introduction to Microsoft Purview What is Microsoft Purview? How Microsoft Purview works When to use Microsoft Purview 21 - Integrate Microsoft Purview and Azure Synapse Analytics Catalog Azure Synapse Analytics data assets in Microsoft Purview Connect Microsoft Purview to an Azure Synapse Analytics workspace Search a Purview catalog in Synapse Studio Track data lineage in pipelines 22 - Explore Azure Databricks Get started with Azure Databricks Identify Azure Databricks workloads Understand key concepts 23 - Use Apache Spark in Azure Databricks Get to know Spark Create a Spark cluster Use Spark in notebooks Use Spark to work with data files Visualize data 24 - Run Azure Databricks Notebooks with Azure Data Factory Understand Azure Databricks notebooks and pipelines Create a linked service for Azure Databricks Use a Notebook activity in a pipeline Use parameters in a notebook Additional course details: Nexus Humans DP-203T00 Data Engineering on Microsoft Azure training program is a workshop that presents an invigorating mix of sessions, lessons, and masterclasses meticulously crafted to propel your learning expedition forward. This immersive bootcamp-style experience boasts interactive lectures, hands-on labs, and collaborative hackathons, all strategically designed to fortify fundamental concepts. Guided by seasoned coaches, each session offers priceless insights and practical skills crucial for honing your expertise. Whether you're stepping into the realm of professional skills or a seasoned professional, this comprehensive course ensures you're equipped with the knowledge and prowess necessary for success. While we feel this is the best course for the DP-203T00 Data Engineering on Microsoft Azure course and one of our Top 10 we encourage you to read the course outline to make sure it is the right content for you. Additionally, private sessions, closed classes or dedicated events are available both live online and at our training centres in Dublin and London, as well as at your offices anywhere in the UK, Ireland or across EMEA.

DP-203T00 Data Engineering on Microsoft Azure
Delivered Online
£2,380

Apache Kafka A-Z with Hands-On Learning

By Packt

Through this course, you will learn how to arrange Kafka a producer and consumer and Kafka Streams and Connectors accurately. You will also gain the skills needed to coordinate Kafka with existing application stages and to pass the Apache Kafka certification exam.

Apache Kafka A-Z with Hands-On Learning
Delivered Online On Demand9 hours 36 minutes
£35.99

Real-Time Data Stream Processing in Azure

By Packt

Delve into big data streaming with Azure using Event Hubs, Data Lake, and Azure Stream Analytics

Real-Time Data Stream Processing in Azure
Delivered Online On Demand1 hour 14 minutes
£22.99

Preparing for Digital Transformation: In-House Training

By IIL Europe Ltd

Preparing for Digital Transformation Organizations today must keep pace with changing technology to stay abreast of the market demand. Keeping pace means a transformation of the entire organization into the digital age. This workshop presents the challenges, benefits, and pitfalls of digital transformation and how it will affect you, and how you can be better prepared and positioned for the upcoming digital transformation. The goal of this course is to enable you to become knowledgeable about the technologies behind a digital transformation in your organization and the driving forces compelling such a transformation. You will learn how to become engaged in the processes of transforming your organization digitally to meet with the growing demands of customers and clients What you will Learn Describe the impact that digital is making on the economy and on customer expectations Examine the nature and drivers of the digital transformation Evaluate new technologies such as Blockchain, Big Data, Artificial Intelligence, and other technologies and see how they work to bring about digital transformation Assess the impact of digital technologies on the current roles and positions in the organization Discuss both the demand that customers have for digital technologies and the impact the digital technologies have on the customer and its relationship with the organization Recognize the new technology trends in the digital transformation and what they mean to the future of the organization Identify how digital transformation will affect all roles and how to be prepared for the upcoming and continuing digital transformation Foundation Concepts Digital transformation versus automation Driving forces behind the digital transformation Learning from digital transformation successes Digital Transformation and Customer Orientation The Digital Customer Customer touch points and the customer journey Omnichannel concept Transform to the customer Digital Technologies and the Organization Relationship Management Big Data: The Basis for it All The Human Factor in Digital Transformations Risks of Digital Transformation Technology Trends Data and Business Analytics Other Major Trends Preview of Coming Attractions Pathway to Digital Transformation

Preparing for Digital Transformation: In-House Training
Delivered in London or UK Wide or OnlineFlexible Dates
£1,495

Preparing for Digital Transformation

By IIL Europe Ltd

Preparing for Digital Transformation Organizations today must keep pace with changing technology to stay abreast of the market demand. Keeping pace means a transformation of the entire organization into the digital age. This workshop presents the challenges, benefits, and pitfalls of digital transformation and how it will affect you, and how you can be better prepared and positioned for the upcoming digital transformation. The goal of this course is to enable you to become knowledgeable about the technologies behind a digital transformation in your organization and the driving forces compelling such a transformation. You will learn how to become engaged in the processes of transforming your organization digitally to meet with the growing demands of customers and clients What you will Learn Describe the impact that digital is making on the economy and on customer expectations Examine the nature and drivers of the digital transformation Evaluate new technologies such as Blockchain, Big Data, Artificial Intelligence, and other technologies and see how they work to bring about digital transformation Assess the impact of digital technologies on the current roles and positions in the organization Discuss both the demand that customers have for digital technologies and the impact the digital technologies have on the customer and its relationship with the organization Recognize the new technology trends in the digital transformation and what they mean to the future of the organization Identify how digital transformation will affect all roles and how to be prepared for the upcoming and continuing digital transformation Foundation Concepts Digital transformation versus automation Driving forces behind the digital transformation Learning from digital transformation successes Digital Transformation and Customer Orientation The Digital Customer Customer touch points and the customer journey Omnichannel concept Transform to the customer Digital Technologies and the Organization Relationship Management Big Data: The Basis for it All The Human Factor in Digital Transformations Risks of Digital Transformation Technology Trends Data and Business Analytics Other Major Trends Preview of Coming Attractions Pathway to Digital Transformation

Preparing for Digital Transformation
Delivered In-Person in LondonFlexible Dates
£1,495
1...34567...29