How to convert a column value to list in PySpark Azure Databricks?

Are you looking to find how to convert a column to a list of PySpark Dataframe in Azure Databricks cloud using RDD or maybe you are looking for a solution, to convert a column to a list of PySpark Databricks in Azure Databricks using DataFrame? If you are looking for any of these problem solutions, […]
How to convert time of StringType into TimestampType in PySpark Azure Databricks?

Are you looking to find out how to convert date time column of string datatype to timestamp format in PySpark using Azure Databricks cloud or maybe you are looking for a solution, to format date time column of StringType to PySpark’s TimestampType format in PySpark Databricks using the to_timestamp() function? If you are looking for […]
How to cache RDD and DataFrame in PySpark Azure Databricks?

Are you looking to find out how to cache RDD in PySpark on the Azure Databricks cloud or maybe you are looking for a solution, to cache DataFrame in PySpark on Azure Databricks? If you are looking for any of these problem solutions, you have landed on the correct page. I will also show you […]
How to use the count() function in PySpark Azure Databricks?

Are you looking to find how to use the count() function of a PySpark Dataframe using Azure Databricks cloud or maybe you are looking for a solution, to count grouped records of a Dataframe in PySpark using Azure Databricks? If you are looking for any of these problem solutions, then you have landed on the […]
Explained: How to convert PySpark RDD to DataFrame in Databricks

Are you looking to find how to convert pyspark rdd’s into the dataframe in Azure Databricks cloud or maybe you are looking for a solution, to creating the Dataframe from pysprk rdd in the Databricks? If you are looking for any of these problem solutions then you have landed on the correct page. I will […]
Create Dataframe in Azure Databricks with Example

Creating dataframe in the Databricks is one of the starting step in your data engineering workload. In this blog post I will explain how you can create the Azure Databricks pyspark based dataframe from multiple source like RDD, list, CSV file, text file, Parquet file or may be ORC or JSON file. Let’s go step […]
WithColumn() Usage in Databricks with Examples

Are you wondering on how to change the column datatype or may be you want to modify the value of the exisiting column of the dataframe in the Azure Databricks. Then you have reached to right blog post. In this aricle I will take you through step by step guide on how you can use […]
How to Collect() – Retrieve data from DataFrame in Databricks

Are you looking or ways to get the data or retrieve the information from the Azure Databricks dataframe or may be you are looking to find the different between the collect and select in the Azure Databricks then you and landed to the right page. In this post I will explain you step by step […]
How to Select Columns From DataFrame in Databricks

In this article I will explain how to use the select function in the pyspark Azure Databricks. I will also show you how to select the specific column from dataframe in Databricks. Let’s get into the step by step details of using the select function in Databricks Azure Databricks Spark Tutorial for beginner to advance […]
Lesson 7: Azure Databricks Spark Tutorial – Spark SQL

In this lesson 7 of our Azure Spark tutorial series I will take you through Spark SQL detailed understanding of concepts with practical examples. I will also take you through how you can leverage your SQL knowledge and power of spark spark sql to solve complex business problem statement. You can use spark SQL both […]