Query External Data with ORC, Parquet, or Avro Source Files
Autonomous Database makes it easy to access ORC,
Parquet, or Avro data stored in object store using external tables. ORC, Parquet, and Avro
sources have metadata embedded in them and the DBMS_CLOUD.CREATE_EXTERNAL_TABLE
procedure can utilize this metadata
to simplify the creation of external tables.
You don’t need to know the structure of the data, DBMS_CLOUD
can examine the file and convert ORC, Parquet, or Avro contents into the equivalent Oracle
columns and data types. You only need to know the location of the data in object store,
specify its type, ORC, Parquet, or Avro, and have credentials to access the source file on
your object store.
The steps to use external tables are very similar for ORC, Parquet, and Avro. These steps show working with a Parquet format source file.
The source file in this example, sales_extended.parquet
,
contains Parquet format data. To query this file in Autonomous Database, do the following:
See CREATE_EXTERNAL_TABLE Procedure for Avro, ORC, or Parquet Files and COPY_DATA Procedure for Avro, ORC, or Parquet Files for more information.
See DBMS_CLOUD URI Formats for information on supported cloud object storage services.
- Query External Data with ORC, Parquet, or Avro Source Files and Explicitly Set the Text Column Size
When you access ORC, Parquet, or Avro data stored in object store using external tables on Autonomous Database, you can either automatically or explicitly set the maximum size of text columns.
Parent topic: Query External Data with Autonomous Database
Query External Data with ORC, Parquet, or Avro Source Files and Explicitly Set the Text Column Size
When you access ORC, Parquet, or Avro data stored in object store using external tables on Autonomous Database, you can either automatically or explicitly set the maximum size of text columns.
By default the text column size is set based on the value of
MAX_STRING_SIZE
.
The source file in this example, sales_extended.parquet
,
contains Parquet format data. To query this file in Autonomous Database and set the maximum text column size, do the following:
See CREATE_EXTERNAL_TABLE Procedure for Avro, ORC, or Parquet Files and COPY_DATA Procedure for Avro, ORC, or Parquet Files for more information.
See DBMS_CLOUD URI Formats for information on supported cloud object storage services.