Question posted in Json
Our archive of expertly curated questions and answers provides insights and solutions to common problems related to this popular data interchange format. From parsing and manipulating JSON data to integrating it with various programming languages and web services, our archive has got you covered. Start exploring today and take your JSON skills to the next level

Json – Snowflake – FLATTEN() – How does it work internally?

Billie_H
December 13, 2023
78 views
0 votes
2 Answers

I am learning Snowflake right now, and the way FLATTEN() works is a bit counter intuitive.

A simple query.

SELECT
    raw:first_name::STRING AS FName,
    raw:last_name::STRING AS LName,
    f.value::STRING AS Skill
FROM tbl,
    TABLE(FLATTEN(raw:Skills)) f
ORDER BY raw:id::INT
LIMIT 5;

Elegantly, it flattens the Skills array and returns this.

FNAME	LNAME	SKILL
Flossy	Fasson	PS3
Flossy	Fasson	Vlookup
Flossy	Fasson	Go
Celeste	Hubert	Tcl-Tk
Celeste	Hubert	Zines

My questions are:

Does Snowflake just infer that the source of raw:Skills is table tbl, as table name is not explicitly expressed here?
How does Snowflake align the array (as the right table) to the left table? It looks like a Cross Join with no join keys, if so, every array should be joined to each and every row on the left and result in incorrect alignment.

Answers

- LukaszSzozda
- December 13, 2023 at 9:39 am
- 0 votes
0
Does Snowflake just infer that the source of raw:Skills is table tbl, as table name is not explicitly expressed here?
```
SELECT
    raw:first_name::STRING AS FName,
    raw:last_name::STRING AS LName,
    f.value::STRING AS Skill
FROM tbl, 
    LATERAL TABLE(FLATTEN(input=> tbl.raw:Skills)) f
```
Related: Lateral Join
Login or Signup to reply.

- SimeonPilgrim
- December 13, 2023 at 9:53 am
- 0 votes
0
the first question is same "inference" as when you use a column name but do not use table name or the alias, thus these are all the same:
```
select a 
from table_name
```
```
select a 
from table_name as t
```
```
select t.a 
from table_name as t
```
```
select table_name.a 
from table_name
```
now when this gets tricky is if two tables have the same column name, you must say which one you want
```
select id 
from table_a_name
cross join table_b_name
```
if they both have an id this will not compile.

In some databases if you join on id between two tables:
```
select id 
from table_a_name
join table_b_name 
   on table_a_name.id = table_b_name.id
```
this is valid, but not in Snowflake.

the FLATTEN is like a LEFT JOIN every input row is match to every expanded row, and both sources and the flatten objects (and you can do many FLATTEN and it just combinations, aka cross joins all the way down).

the comma method you have show is the "old SQL style" and the new style would be a CROSS JOIN, but LATERAL is also valid as that is more akin to what is happening. Snowflake support mean dialects from other RMDBs, to make porting SQL easier, but it does not cover all cases.
Login or Signup to reply.

Please signup or login to give your own answer.

Click here to cancel reply.