وب سایت تخصصی شرکت فرین
دسته بندی دوره ها
2

Master Big Data – Apache Spark/Hadoop/Sqoop/Hive/Flume/Mongo

سرفصل های دوره

In-depth course on Big Data - Apache Spark , Hadoop , Sqoop , Flume & Apache Hive, MongoDB & Big Data Cluster setup


01 - Big Data Introduction
  • 001 Course Intro
  • 002 Big Data Intro
  • 003 Understanding Big Data Ecosystem

  • 02 - Environment Setup
  • 001 GCP Cluster Fixes.html
  • 002 Cluster Setup on Google Cloud
  • 002 gcp-cluster.txt
  • 002 resources.zip
  • 002 retail-db.zip
  • 003 Environment Update

  • 03 - Hadoop And Yarn
  • 001 HDFS and Hadoop Commands
  • 002 Yarn Cluster Overview

  • 04 - Sqoop Import
  • 001 Sqoop Introduction
  • 001 Sqoop-Commands.docx
  • 001 Sqoop-Import-pdf.pdf
  • 002 Managing Target Directories
  • 002 Sqoop-Import.docx
  • 003 Working with Parquet File Format
  • 003 parquet-tools-1.10.0.zip
  • 004 Working with Avro File Format
  • 004 avro-tools-1.8.2.zip
  • 005 Working with Different Compressions
  • 006 Conditional Imports
  • 007 Split-by and Boundary Queries
  • 008 Field delimeters
  • 009 Incremental Appends
  • 010 Sqoop-Hive Cluster Fix.html
  • 011 Sqoop Hive Import
  • 012 Sqoop List TablesDatabase
  • 013 Sqoop Import Practice1
  • 013 Sqoop-Import-Practice.docx
  • 013 Sqoop-Import-Practice-Sol.docx
  • 014 Sqoop Import Practice2

  • 05 - Sqoop Export
  • 001 Export from Hdfs to Mysql
  • 001 sqoop-export.zip
  • 002 Export from Hive to Mysql
  • 003 Export Avro Compressed to Mysql
  • 004 Bonus Lecture Sqoop with Airflow

  • 06 - Apache Flume
  • 001 Flume Introduction And Architecture
  • 001 flume-config-resources-20200505T024933Z-001.zip
  • 002 Exec Source and Logger Sink
  • 003 Moving data from Twitter to HDFS
  • 004 Moving data from NetCat to HDFS
  • 005 Flume Interceptors
  • 006 Flume Interceptor Example
  • 007 Flume Multi-Agent Flow
  • 008 Flume Consolidation

  • 07 - Apache Hive
  • 001 Hive Introduction
  • 001 Hive-pdf.pdf
  • 002 Hive Database
  • 002 Hive-Practice-pdf.pdf
  • 003 Hive Managed Tables
  • 004 Hive External Tables
  • 005 Hive Inserts
  • 006 Hive Analytics
  • 007 Working with Parquet
  • 008 Compressing Parquet
  • 009 Working with Fixed File Format
  • 010 Alter Command
  • 011 Hive String Functions
  • 012 Hive Date Functions
  • 013 Hive Partitioning
  • 014 Hive Bucketing

  • 08 - Spark with Yarn And HDFS
  • 001 What is Apache Spark
  • 002 Understanding Cluster Manager (Yarn)
  • 003 Understanding Distributed Storage (HDFS)
  • 004 Running Spark on YarnHDFS
  • 005 Understanding Deploy Modes

  • 09 - GCS Cluster
  • 001 Spark on GCS Cluster

  • 10 - Spark Internals
  • 001 Drivers And Executors
  • 002 RDDs And Dataframes
  • 003 Transformation And Actions
  • 004 Wide And Narrow Transformations
  • 005 Understanding Execution Plan
  • 006 Different Plans by Driver

  • 11 - Spark RDD Transformation And Actions
  • 001 MapFlatMap Transformation
  • 002 FilterIntersection
  • 003 UnionDistinct Transformation
  • 004 GroupByKey Group people based on Birthday months
  • 005 ReduceByKey Total Number of students in each Subject
  • 006 SortByKey Sort students based on their rollno
  • 007 MapPartition MapPartitionWithIndex
  • 008 Change number of Partitions
  • 009 Join join email address based on customer name
  • 010 Spark Actions

  • 12 - Spark RDD Practice
  • 001 Scala Tuples
  • 001 spark-dataset-20200505T025156Z-001.zip
  • 002 Filter Error Logs
  • 003 Frequency of word in Text File
  • 004 Population of each city
  • 005 Orders placed by Customers
  • 006 average rating of movie

  • 13 - Spark Dataframes And Spark SQL
  • 001 Dataframe Intro
  • 001 Spark-Dataframe.pdf
  • 001 dataframe-dataset-20200505T025651Z-001.zip
  • 002 Dafaframe from Json Files
  • 003 Dataframe from Parquet Files
  • 004 Dataframe from CSV Files
  • 005 Dataframe from Avro File
  • 006 Working with XML
  • 007 Working with Columns
  • 008 Working with String
  • 009 Working with Dates
  • 010 Dataframe Filter API
  • 011 DataFrame API Part1
  • 012 DataFrame API Part2
  • 013 Spark SQL
  • 014 Working with Hive Tables in Spark
  • 015 Datasets versus Dataframe
  • 016 User Defined Functions (UDFS)

  • 14 - Using Intellij IDE
  • 001 Intellij Setup
  • 002 Project Setup
  • 003 Writing first Spark program on IDE
  • 004 Understanding spark configuration
  • 005 Adding ActionsTransformations
  • 006 Understanding Execution Plan

  • 15 - Running Spark on EMR (AWS Cloud)
  • 001 EMR Cluster Overview
  • 002 Cluster Setup
  • 003 Setting Spark Code for EMR
  • 004 Using Spark-submit
  • 005 Running Spark on EMR Cluster

  • 16 - Spark with Cassandra
  • 001 Cassandra Course.html
  • 002 Creating Spark RDD from Cassandra Table
  • 003 Processing Cassandra data in Spark
  • 004 Cassandra Rows to Case Class
  • 005 Saving Spark RDD to Cassandra

  • 17 - Getting Started with MongoDB
  • 001 MongoDB Intro
  • 001 mongo-commads3.docx
  • 001 mongo-commands1.zip
  • 001 movies.zip
  • 002 MongoDB Usecase And Limitations
  • 003 MongoDB Installation

  • 18 - CRUD Operations
  • 001 Find
  • 002 Find With Filter
  • 003 Insert
  • 004 Update
  • 005 Update Continues
  • 006 Projections
  • 007 Delete

  • 19 - Working with Operators
  • 001 In not in Operators
  • 002 gte lte Operators
  • 003 and or operators
  • 004 regex operator

  • 20 - MongoDB Compass
  • 001 Working with GUI

  • 21 - Advanced Mongo
  • 001 ValidationSchema
  • 002 Working with Indexes

  • 22 - Spark with Mongo
  • 001 Spark Mongo Integration.html
  • 139,000 تومان
    بیش از یک محصول به صورت دانلودی میخواهید؟ محصول را به سبد خرید اضافه کنید.
    افزودن به سبد خرید
    خرید دانلودی فوری

    در این روش نیاز به افزودن محصول به سبد خرید و تکمیل اطلاعات نیست و شما پس از وارد کردن ایمیل خود و طی کردن مراحل پرداخت لینک های دریافت محصولات را در ایمیل خود دریافت خواهید کرد.

    ایمیل شما:
    تولید کننده:
    مدرس:
    شناسه: 18341
    حجم: 3260 مگابایت
    مدت زمان: 658 دقیقه
    تاریخ انتشار: ۳ شهریور ۱۴۰۲
    طراحی سایت و خدمات سئو

    139,000 تومان
    افزودن به سبد خرید