Skip to content

Sources

This page is generated from the PhiSQL spec artifacts for v1.0. Do not edit it by hand; change the artifact and rebuild.

Discovery statements (FIND PII, DISCOVER ENTITIES, SCAN) name their source with an IN '<uri>' clause. These are the recognized URI schemes. A conforming engine that does not support a scheme must reject the statement rather than silently no-op.

Scheme Source URI template Example
s3 Amazon S3 s3://<bucket>/<prefix> s3://prod-data/customers/
gs Google Cloud Storage gs://<bucket>/<prefix> gs://analytics-exports/2026/
azure Azure Blob Storage azure://<account>.<container>/<prefix> azure://acmestorage.uploads/customer-files/
file Local filesystem file://<absolute-path> file:///mnt/exports/intake/
postgres PostgreSQL postgres://<host>[:<port>]/<database>[?schema=<schema>&table=<table>] postgres://warehouse.internal/customer_db?schema=public&table=orders
mysql MySQL mysql://<host>[:<port>]/<database>[?table=<table>] mysql://reporting.internal/analytics?table=events
snowflake Snowflake snowflake://<account>/<database>.<schema>[.<table>] snowflake://xy12345/PROD.RAW.CUSTOMERS
bigquery Google BigQuery bigquery://<project>/<dataset>[.<table>] bigquery://acme-prod/analytics.events

Amazon S3 (s3)

Amazon S3 objects. The bucket is the URI host; the prefix is the URI path. A trailing slash on the prefix scans the entire prefix.

  • URI template: s3://<bucket>/<prefix>
  • Example: s3://prod-data/customers/

Google Cloud Storage (gs)

Google Cloud Storage objects. Mirrors the S3 layout.

  • URI template: gs://<bucket>/<prefix>
  • Example: gs://analytics-exports/2026/

Azure Blob Storage (azure)

Azure Blob Storage objects. The host carries the storage account and container; the path is the blob prefix.

  • URI template: azure://<account>.<container>/<prefix>
  • Example: azure://acmestorage.uploads/customer-files/

Local filesystem (file)

Local filesystem. Absolute paths only; relative paths are rejected by conforming engines.

  • URI template: file://<absolute-path>
  • Example: file:///mnt/exports/intake/

PostgreSQL (postgres)

A PostgreSQL database. The path component selects the database; query parameters narrow the scan to a schema or a specific table.

  • URI template: postgres://<host>[:<port>]/<database>[?schema=<schema>&table=<table>]
  • Example: postgres://warehouse.internal/customer_db?schema=public&table=orders

MySQL (mysql)

A MySQL database. Same pattern as postgres but without a schema layer.

  • URI template: mysql://<host>[:<port>]/<database>[?table=<table>]
  • Example: mysql://reporting.internal/analytics?table=events

Snowflake (snowflake)

A Snowflake table or schema. The account is the URI host.

  • URI template: snowflake://<account>/<database>.<schema>[.<table>]
  • Example: snowflake://xy12345/PROD.RAW.CUSTOMERS

Google BigQuery (bigquery)

A BigQuery dataset or table.

  • URI template: bigquery://<project>/<dataset>[.<table>]
  • Example: bigquery://acme-prod/analytics.events