Storing SHOW/LIST/DESCRIBE Command Results in Snowflake Tables

Data

Storing SHOW/LIST/DESCRIBE Command Results in Snowflake Tables

In Snowflake, there are several commands you can use to retrieve metadata from the Snowflake environment without using a virtual warehouse. The following are all examples that fit this approach:

  • SHOW – List all objects of a certain type within the account or within a given database/schema
  • DESCRIBE –  Retrieve metadata information about a specific object
  • LIST – List files within a Snowflake stage

Since these commands do not fit the general format of SELECT … FROM …  it is harder to store the results somewhere. You cannot simply use INSERT INTO … SELECT … FROM …

Fortunately, there is a way around this. Actually, this blog includes two ways, since the first may not cover your requirements sufficiently. For our simple demonstration, we will leverage the SHOW USERS command.

The Quick Way

The fastest way to store results from one of these commands in a table is to combine the RESULT_SCAN() function with the LAST_QUERY_ID( ) function. This can be achieved using the following steps.

Step 1 – Execute Your SQL Command

Execute your SQL command to retrieve the initial results. You don’t need to worry about actually reading these results, so you can execute this via an integration tool and simply not look at the result set. In other words, don’t worry about being able to visibly see the results of this query:

SHOW USERS;

Step 2 – Create a New Table Containing the Result Set of Your Last Query

As mentioned above, we combine the RESULT_SCAN( ) function with the LAST_QUERY_ID( ) function to retrieve the result set from the last query in a standard SELECT … FROM … format, thus creating a new table that contains our results:

CREATE TABLE MY_TABLE
AS
SELECT *
FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()))
;

Summary

So that’s it. In only two steps, we can insert the results of a SHOW/DESCRIBE/LIST command into a table in Snowflake. Depending on your use case, you may find this is all you need. As long as the second step is executed directly after the first and you are running in the same Snowflake session, this will pick up the results of that initial command and drop them into the desired table.

The Safer and More Methodical Approach

There are several flaws with the quick method I have outlined above. Executing any other query between step 1 and step 2 will inevitably result in an undesired output from the LAST_QUERY_ID( ) function and break the process. If you are running a series of scripts concurrently that each leverage the same session (as is often the case when your queries are compiled and executed by third-party orchestration tools) then you may find that your LAST_QUERY_ID( ) function points at a query from another thread in the session. You may also find the quick method fails if your session is interrupted.

To mitigate these issues, a more methodical approach can be configured that can maintain idempotency and improve reliability.

Step 1 – Execute Your SQL Command

As before, execute your SQL command to retrieve the initial results. You still don’t need to worry about actually reading these results, so you can execute this via an integration tool and not look at the result set. You don’t have to worry about being able to visibly see the results of this query:

SHOW USERS;

Step 2 – Store the Query ID of Your Command in a Variable

Use a SQL query, such as the following, to safely retrieve and store the query ID from the execution of your SQL command:

SET query_id = (
  SELECT QUERY_ID
  FROM TABLE(INFORMATION_SCHEMA.QUERY_HISTORY())
  WHERE QUERY_TEXT = 'SHOW USERS;'
  ORDER BY START_TIME DESC
  LIMIT 1
)

Note that this example uses the QUERY_HISTORY( ) view; however, you may prefer to use one of the alternatives for your specific use case.

Step 3 – Create a New Table Containing the Result Set of Your Specific Query

Now that we have the query ID for our query, we can combine it with the RESULT_SCAN( ) function as before to retrieve the result set in a standard SELECT … FROM … format and create a new table that contains our results:

CREATE TABLE MY_TABLE
AS
SELECT *
FROM TABLE(RESULT_SCAN($query_id)
;

Summary

It may have taken an additional step, but our process is now more robust whilst still being succinct. We can now insert the results of a SHOW/DESCRIBE/LIST command into a table in Snowflake, easily ensuring that we retrieve the correct result set by specifying the query text that relates to our query ID.

The main beauty of this methodical approach is that it can be integrated into third-party orchestration tools that run multiple flows concurrently, without risking overlap or collision. If you need to improve the specificity of step 2, you can add additional entries to the WHERE clause, such as the session’s database name, schema name, query type, warehouse, start time or even a specific query tag!

More About the Author

Chris Hastie

Data Lead
Configure Azure Private Endpoints for Snowflake Internal Stages Snowflake and Azure are incredibly capable of storing data securely. You can see all reports available to customers through their pages ...
Configure Azure Private Link Connectivity with Snowflake Snowflake and Azure are incredibly capable of storing data securely. You can see all reports available to customers through their pages ...

See more from this author →

InterWorks uses cookies to allow us to better understand how the site is used. By continuing to use this site, you consent to this policy. Review Policy OK

×

Interworks GmbH
Ratinger Straße 9
40213 Düsseldorf
Germany
Geschäftsführer: Mel Stephenson

Kontaktaufnahme: markus@interworks.eu
Telefon: +49 (0)211 5408 5301

Amtsgericht Düsseldorf HRB 79752
UstldNr: DE 313 353 072

×

Love our blog? You should see our emails. Sign up for our newsletter!