وب سایت تخصصی شرکت فرین
دسته بندی دوره ها

Master Big Data – Apache Spark/Hadoop/Sqoop/Hive/Flume/Mongo

سرفصل های دوره

In-depth course on Big Data - Apache Spark , Hadoop , Sqoop , Flume & Apache Hive, MongoDB & Big Data Cluster setup


01 - Big Data Introduction
  • 001 Course Intro
  • 002 Big Data Intro
  • 003 Understanding Big Data Ecosystem

  • 02 - Environment Setup
  • 001 GCP Cluster Fixes.html
  • 002 Cluster Setup on Google Cloud
  • 002 gcp-cluster.txt
  • 002 resources.zip
  • 002 retail-db.zip
  • 003 Environment Update

  • 03 - Hadoop And Yarn
  • 001 HDFS and Hadoop Commands
  • 002 Yarn Cluster Overview

  • 04 - Sqoop Import
  • 001 Sqoop Introduction
  • 001 Sqoop-Commands.docx
  • 001 Sqoop-Import-pdf.pdf
  • 002 Managing Target Directories
  • 002 Sqoop-Import.docx
  • 003 Working with Parquet File Format
  • 003 parquet-tools-1.10.0.zip
  • 004 Working with Avro File Format
  • 004 avro-tools-1.8.2.zip
  • 005 Working with Different Compressions
  • 006 Conditional Imports
  • 007 Split-by and Boundary Queries
  • 008 Field delimeters
  • 009 Incremental Appends
  • 010 Sqoop-Hive Cluster Fix.html
  • 011 Sqoop Hive Import
  • 012 Sqoop List TablesDatabase
  • 013 Sqoop Import Practice1
  • 013 Sqoop-Import-Practice.docx
  • 013 Sqoop-Import-Practice-Sol.docx
  • 014 Sqoop Import Practice2

  • 05 - Sqoop Export
  • 001 Export from Hdfs to Mysql
  • 001 sqoop-export.zip
  • 002 Export from Hive to Mysql
  • 003 Export Avro Compressed to Mysql
  • 004 Bonus Lecture Sqoop with Airflow

  • 06 - Apache Flume
  • 001 Flume Introduction And Architecture
  • 001 flume-config-resources-20200505T024933Z-001.zip
  • 002 Exec Source and Logger Sink
  • 003 Moving data from Twitter to HDFS
  • 004 Moving data from NetCat to HDFS
  • 005 Flume Interceptors
  • 006 Flume Interceptor Example
  • 007 Flume Multi-Agent Flow
  • 008 Flume Consolidation

  • 07 - Apache Hive
  • 001 Hive Introduction
  • 001 Hive-pdf.pdf
  • 002 Hive Database
  • 002 Hive-Practice-pdf.pdf
  • 003 Hive Managed Tables
  • 004 Hive External Tables
  • 005 Hive Inserts
  • 006 Hive Analytics
  • 007 Working with Parquet
  • 008 Compressing Parquet
  • 009 Working with Fixed File Format
  • 010 Alter Command
  • 011 Hive String Functions
  • 012 Hive Date Functions
  • 013 Hive Partitioning
  • 014 Hive Bucketing

  • 08 - Spark with Yarn And HDFS
  • 001 What is Apache Spark
  • 002 Understanding Cluster Manager (Yarn)
  • 003 Understanding Distributed Storage (HDFS)
  • 004 Running Spark on YarnHDFS
  • 005 Understanding Deploy Modes

  • 09 - GCS Cluster
  • 001 Spark on GCS Cluster

  • 10 - Spark Internals
  • 001 Drivers And Executors
  • 002 RDDs And Dataframes
  • 003 Transformation And Actions
  • 004 Wide And Narrow Transformations
  • 005 Understanding Execution Plan
  • 006 Different Plans by Driver

  • 11 - Spark RDD Transformation And Actions
  • 001 MapFlatMap Transformation
  • 002 FilterIntersection
  • 003 UnionDistinct Transformation
  • 004 GroupByKey Group people based on Birthday months
  • 005 ReduceByKey Total Number of students in each Subject
  • 006 SortByKey Sort students based on their rollno
  • 007 MapPartition MapPartitionWithIndex
  • 008 Change number of Partitions
  • 009 Join join email address based on customer name
  • 010 Spark Actions

  • 12 - Spark RDD Practice
  • 001 Scala Tuples
  • 001 spark-dataset-20200505T025156Z-001.zip
  • 002 Filter Error Logs
  • 003 Frequency of word in Text File
  • 004 Population of each city
  • 005 Orders placed by Customers
  • 006 average rating of movie

  • 13 - Spark Dataframes And Spark SQL
  • 001 Dataframe Intro
  • 001 Spark-Dataframe.pdf
  • 001 dataframe-dataset-20200505T025651Z-001.zip
  • 002 Dafaframe from Json Files
  • 003 Dataframe from Parquet Files
  • 004 Dataframe from CSV Files
  • 005 Dataframe from Avro File
  • 006 Working with XML
  • 007 Working with Columns
  • 008 Working with String
  • 009 Working with Dates
  • 010 Dataframe Filter API
  • 011 DataFrame API Part1
  • 012 DataFrame API Part2
  • 013 Spark SQL
  • 014 Working with Hive Tables in Spark
  • 015 Datasets versus Dataframe
  • 016 User Defined Functions (UDFS)

  • 14 - Using Intellij IDE
  • 001 Intellij Setup
  • 002 Project Setup
  • 003 Writing first Spark program on IDE
  • 004 Understanding spark configuration
  • 005 Adding ActionsTransformations
  • 006 Understanding Execution Plan

  • 15 - Running Spark on EMR (AWS Cloud)
  • 001 EMR Cluster Overview
  • 002 Cluster Setup
  • 003 Setting Spark Code for EMR
  • 004 Using Spark-submit
  • 005 Running Spark on EMR Cluster

  • 16 - Spark with Cassandra
  • 001 Cassandra Course.html
  • 002 Creating Spark RDD from Cassandra Table
  • 003 Processing Cassandra data in Spark
  • 004 Cassandra Rows to Case Class
  • 005 Saving Spark RDD to Cassandra

  • 17 - Getting Started with MongoDB
  • 001 MongoDB Intro
  • 001 mongo-commads3.docx
  • 001 mongo-commands1.zip
  • 001 movies.zip
  • 002 MongoDB Usecase And Limitations
  • 003 MongoDB Installation

  • 18 - CRUD Operations
  • 001 Find
  • 002 Find With Filter
  • 003 Insert
  • 004 Update
  • 005 Update Continues
  • 006 Projections
  • 007 Delete

  • 19 - Working with Operators
  • 001 In not in Operators
  • 002 gte lte Operators
  • 003 and or operators
  • 004 regex operator

  • 20 - MongoDB Compass
  • 001 Working with GUI

  • 21 - Advanced Mongo
  • 001 ValidationSchema
  • 002 Working with Indexes

  • 22 - Spark with Mongo
  • 001 Spark Mongo Integration.html
  • 45,900 تومان
    بیش از یک محصول به صورت دانلودی میخواهید؟ محصول را به سبد خرید اضافه کنید.
    خرید دانلودی فوری

    در این روش نیاز به افزودن محصول به سبد خرید و تکمیل اطلاعات نیست و شما پس از وارد کردن ایمیل خود و طی کردن مراحل پرداخت لینک های دریافت محصولات را در ایمیل خود دریافت خواهید کرد.

    ایمیل شما:
    تولید کننده:
    مدرس:
    شناسه: 18341
    حجم: 3260 مگابایت
    مدت زمان: 658 دقیقه
    تاریخ انتشار: 3 شهریور 1402
    طراحی سایت و خدمات سئو

    45,900 تومان
    افزودن به سبد خرید