skip to Main Content

Azure – DataBricks Pyspark DATEADD

I'm trying to filter out data from the current date to last 3 years and trying to use this for Pyspark dataframe. below sql query needs to convert into Pyspark dataframe format date >= dateadd(month,-4,current_date) How to write above sql…

VIEW QUESTION

Pyspark – JSON string column explode into multiple without mentioning schema

I have below JSON string as a column in a pyspark dataframe. { "result":{ "version":"1.2", "timeStamp":"2023-08-14 14:00:12", "description":"", "data":{ "DateTime_Received":"2023-08-14T14:01:10.4516457+01:00", "DateTime_Actual":"2023-08-14T14:00:12", "OtherInfo":null, "main":[ { "Status":0, "ID":111, "details":null } ] }, "tn":"aaa" } } I want to explode the above one…

VIEW QUESTION
Back To Top
Search