وب سایت تخصصی شرکت فرین
دسته بندی دوره ها

Introduction to Spark SQL and DataFrames

سرفصل های دوره

Explore DataFrames, a widely used data structure in Apache Spark. DataFrames allow Spark developers to perform common data operations, such as filtering and aggregation, as well as advanced data analysis on large collections of distributed data. With the addition of Spark SQL, developers have access to an even more popular and powerful query language than the built-in DataFrames API. In this course, instructor Dan Sullivan shows how to perform basic operations—loading, filtering, and aggregating data in DataFrames—with the API and SQL, as well as more advanced techniques that are easily performed in SQL. In this section of the course, Dan explains how to join data, eliminate duplicates, and deal with null or NA values. The lessons conclude with three in-depth examples of using DataFrames for data science: exploratory data analysis, time series analysis, and machine learning.


01 - Introduction
  • 01 - Apache Spark SQL and data analysis
  • 02 - What you should know

  • 02 - 1. Introduction to Spark DataFrames
  • 01 - Introduction to DataFrames
  • 02 - SQL for DataFrames

  • 03 - 2. Installing Spark
  • 01 - Install Spark
  • 02 - Install PySpark
  • 03 - Using Jupyter notebooks with PySpark

  • 04 - 3. Getting Started with Spark DataFrames
  • 01 - Set up a Jupyter notebook
  • 02 - Load data into DataFrames CSV Files
  • 03 - Load data into DataFrames JSON Files
  • 04 - Basic DataFrame operations
  • 05 - Filter data with DataFrame API
  • 06 - Aggregate data with DataFrame API
  • 07 - Sample data from DataFrames
  • 08 - Save data from DataFrames

  • 05 - 4. SQL for DataFrames
  • 01 - Querying DataFrames with SQL
  • 02 - Filtering DataFrames with SQL
  • 03 - Aggregating Data with SQL
  • 04 - Joining DataFrames with SQL
  • 05 - Eliminating duplicates in DataFrames
  • 06 - Working with NA values in DataFrames

  • 06 - 5. Data Analysis with Spark
  • 01 - Exploratory data analysis with DataFrames
  • 02 - Exploratory data analysis with Spark SQL
  • 03 - Timeseries analysis with DataFrames
  • 04 - Basic machine learning with DataFrames, part 1
  • 05 - Basic machine learning with DataFrames, part 2

  • 07 - Conclusion
  • 01 - Next steps
  • 45,900 تومان
    بیش از یک محصول به صورت دانلودی میخواهید؟ محصول را به سبد خرید اضافه کنید.
    خرید دانلودی فوری

    در این روش نیاز به افزودن محصول به سبد خرید و تکمیل اطلاعات نیست و شما پس از وارد کردن ایمیل خود و طی کردن مراحل پرداخت لینک های دریافت محصولات را در ایمیل خود دریافت خواهید کرد.

    ایمیل شما:
    تولید کننده:
    مدرس:
    شناسه: 24108
    حجم: 275 مگابایت
    مدت زمان: 114 دقیقه
    تاریخ انتشار: 12 آذر 1402
    طراحی سایت و خدمات سئو

    45,900 تومان
    افزودن به سبد خرید