| by Arround The Web

How to Install Apache Spark on Ubuntu 22.04

Apache Spark is a free, open-source, and general-purpose data processing engine used by data scientists to perform extremely fast data queries on a large amount of data. This guide will show you how to install Apache Spark on Ubuntu 22.04.
The post How…

Share Button
Read More
| by Arround The Web

PySpark radians() and degrees() Functions

Guide on using the radians() function to return the radians and the degrees() function to return the degrees of any given number present in a DataFrame column.

Share Button
Read More
| by Arround The Web

PySpark desc_nulls_first() and desc_nulls_last() Functions

A guide on dealing with null while sorting the values in a Dataframe using the desc_nulls_first() and desc_nulls_last() functions.

Share Button
Read More
| by Arround The Web

Return Top & Last Rows From PySpark Pandas DataFrame

This article explains how to display the top and last rows from the pyspark pandas dataframe using head() and tail() functions.

Share Button
Read More
| by Arround The Web

Different Ways to Create PySpark DataFrame

This discussed methods to create PySpark DataFrame: list of tuples, tuple of lists, nested tuple, nested list use, and columns list to provide column names.

Share Button
Read More
| by Arround The Web

Check the Given Data is PySpark RDD or DataFrame

This is on ways to check if object is an RDD or a DataFrame. If the object type is the same, then it will return True, otherwise False.

Share Button
Read More
| by Arround The Web

PySpark SQL Case When

Tutorial on how to check multiple conditions and return the first element that follows the condition using PySpark and the SQL Case in PySpark When.

Share Button
Read More
| by Arround The Web

PySpark Fillna() Method

This tutorial teaches the PySpark fillna method that is used to fill the null values in the dataframe with custom values to clean the data in a calculation.

Share Button
Read More
| by Arround The Web

How to Install and Setup Apache Spark on Ubuntu/Debian

The post How to Install and Setup Apache Spark on Ubuntu/Debian first appeared on Tecmint: Linux Howtos, Tutorials & Guides .
Apache Spark is an open-source distributed computational framework that is created to provide faster computational results. It…

Share Button
Read More