Note
Access to this page requires authorization. You can try signing in or changing directories.
Access to this page requires authorization. You can try changing directories.
Applies to:
Databricks Runtime
Overwrites the existing data in the directory with the new values using Hive SerDe.
Hive support must be enabled to use this command. You specify the inserted rows by value expressions or the result of a query.
Syntax
INSERT OVERWRITE [ LOCAL ] DIRECTORY directory_path
[ ROW FORMAT row_format ] [ STORED AS file_format ]
{ VALUES ( { value | NULL } [ , ... ] ) [ , ( ... ) ] | query }
Parameters
directory_path
The destination directory. The
LOCALkeyword specifies that the directory is on the local file system.row_format
The row format for this insert. Valid options are
SERDEclause andDELIMITEDclause.SERDEclause can be used to specify a customSerDefor this insert. Alternatively,DELIMITEDclause can be used to specify the nativeSerDeand state the delimiter, escape character, null character, and so on.file_format
The file format for this insert. Valid options are
TEXTFILE,SEQUENCEFILE,RCFILE,ORC,PARQUET, andAVRO. You can also specify your own input and output format usingINPUTFORMATandOUTPUTFORMAT.ROW FORMAT SERDEcan only be used withTEXTFILE,SEQUENCEFILE, orRCFILE, whileROW FORMAT DELIMITEDcan only be used withTEXTFILE.VALUES ( { value | NULL } [ , … ] ) [ , ( … ) ]
The values to be inserted. Either an explicitly specified value or a NULL can be inserted. A comma must be used to separate each value in the clause. More than one set of values can be specified to insert multiple rows.
query
A query that produces the rows to be inserted. One of following formats:
- A
SELECTstatement - A
TABLEstatement - A
FROMstatement
- A
Examples
INSERT OVERWRITE LOCAL DIRECTORY '/tmp/destination'
STORED AS orc
SELECT * FROM test_table;
INSERT OVERWRITE LOCAL DIRECTORY '/tmp/destination'
ROW FORMAT DELIMITED FIELDS TERMINATED BY ','
SELECT * FROM test_table;