-
Course Code
DSBA-003
Big Data Solutions Development – Technologies and Applications
- In this course, trainees are introduced to mainstream Big Data technologies implemented in the practical industry solutions. Big Data ecosystem consists of wide range of technologies that are used for different parts of a Big Data solution. After introducing the common patterns and use cases of Big Data, those technologies are mapped to specific functionalities to architect end-to-end data-driven solutions. Then each main technology is discussed in more details with examples and linking back to typical use cases.
- The course approach is by-example where tutorials are used for each topic followed by hands-on practice exercise on that topic to reinforce Knowledge and understanding. At the end, a practical guide is discussed to enable selecting the most appropriate Big Data technologies for a given use case.
Learning Outcomes
Outcomes
Course Contents
- Overview of Data Processing and NoSQL Solutions
- Big Data Use Cases and Common Patterns
- Introduction to Hadoop and the Hadoop Ecosystem
- Hadoop Architecture and HDFS
- Importing Relational Data with Apache Sqoop
- Modeling and Managing Data with Impala and Hive
- Data Formats, Avro, Parquet and Data File Partitioning
- Capturing Data with Apache Flume
- Using Apache HBase for Massive Tables
- Apache Spark Distributed Processing
- Working with RDDS, Aggregating Data with Pair RDDs
- Writing and Deploying Spark Applications
- Parallel Processing and Spark RDD Persistence
- Common Patterns in Spark, Graph Analysis and Machine Learning
- Spark SQL and DataFrames
- Working with Greenplum MPP Platform
- Implementing Analytics using MADLib on Greenplum
- Streaming Applications and Data Flow Engines
- Apache Spark Streaming, Kafka and Flink
- Streaming using Spring CDF and Apache Airflow
- Big Data Platforms and Solutions by Industry Vendors
- Practical Guide on Selecting Big Data Technologies
- Course Conclusion
Our Methodology
- Make coaching and monitoring innovative and using modern
- Media training also using on the go training by using interactive means and focusing on
- The exercises, practical applications and real situations study
- Live delivery method, instructor-led training
- Experienced consultant, trainers, and professional
- Qualified trainer with high-level experience
Attendance Reports
- Send daily attendance reports to training departments
- Send full attendance report to training dep. by the end of the course
- Attend 100 % from the course days also provide daily
- Issue attendance certificate for participant who attend minimum 80% from the course duration
Pre/Post Reports
- Pre- assessment before starting training
- Post assessment after finish training
- Full report for the deferent between Pre-& Post assessment
Who Should Attend
- Managers, Heads of Departments, and Directors
- Data Center
- Individuals responsible for designing and implementing big data solutions,
- Data Scientists and Data Analysts