Data Productivity Cloud Pipeline

Author: Matillion
Date Posted: Aug 22, 2024
Last Modified: Aug 7, 2025

Load streaming data from cloud storage

Process the latest files from cloud storage to maintain tables in your cloud data platform.

If you have configured a streaming pipeline with Amazon S3 or Azure Blob as the destination, these Data Productivity Cloud pre-built pipelines can be used to load the Avro files into Snowflake, Databricks or Amazon Redshift.

Requirements

To load the files into Snowflake, you can use a Data Productivity Cloud project configured with either a Full SaaS or Hybrid SaaS agent.

To load the files into Databricks or Amazon Redshift, you must use a Data Productivity Cloud project configured with a Hybrid SaaS agent.

Installation

Open a branch on your Data Productivity Cloud project.
Click “Add > Browse Exchange”

Image ofBrowse the Matillion Exchange to find pipelines to import — Browse the Matillion Exchange to find pipelines to import

Search for “streaming” and select the pipeline to import it into your project.

Image ofFilter the pipelines on the Matillion Exchange — Filter the pipelines on the Matillion Exchange

You should now have a folder named “Imported from Exchange > Load Streaming Data from Cloud Storage” containing the latest versions of these pipelines.

Image ofImported pipelines in your project — Imported pipelines in your project

Usage

Open the orchestration pipeline “Imported from Exchange > Load Streaming Data from Cloud Storage > Example”.

Follow the instructions on the notes in this pipeline to copy the Run Orchestration component into your own orchestration pipeline, and configure it to load your Avro files into your data platform.

See the Matillion documentation for full descriptions of the parameters.

Image ofConfigure the Run Orchestration component to load the files into your data platform — Configure the Run Orchestration component to load the files into your data platform

Downloads

Licensed under: Matillion Free Subscription License

Download pre_built_pipelines_streaming_databricks_20250604.zip
- Target: Databricks
Download pre_built_pipelines_streaming_redshift_20250425.zip
- Target: Redshift
Download pre_built_pipelines_streaming_snowflake_20250807.zip
- Target: Snowflake

Installation Instructions

How to Install a Data Productivity Cloud Pipeline