Data Productivity Cloud Pipeline

Author: Matillion
Date Posted: Aug 22, 2024
Last Modified: Mar 28, 2025

Load streaming data from cloud storage

Process the latest files from cloud storage to maintain tables in your cloud data platform.

If you have configured a streaming pipeline with Amazon S3 or Azure Blob as the destination, these Data Productivity Cloud pre-built pipelines can be used to load the Avro files into Snowflake, Databricks or Amazon Redshift.


Requirements

To load the files into Snowflake, you can use a Data Productivity Cloud project configured with either a Full SaaS or Hybrid SaaS agent.

To load the files into Databricks or Amazon Redshift, you must use a Data Productivity Cloud project configured with a Hybrid SaaS agent.


Installation

  1. Open a branch on your Data Productivity Cloud project.
  2. Click “Add > Browse Exchange”
Image ofBrowse the Matillion Exchange to find pipelines to import
Browse the Matillion Exchange to find pipelines to import
  1. Search for “streaming” and select the pipeline to import it into your project.
Image ofFilter the pipelines on the Matillion Exchange
Filter the pipelines on the Matillion Exchange
  1. You should now have a folder named “Imported from Exchange > Load Streaming Data from Cloud Storage” containing the latest versions of these pipelines.
Image ofImported pipelines in your project
Imported pipelines in your project

Usage

Open the orchestration pipeline “Imported from Exchange > Load Streaming Data from Cloud Storage > Example”.

Follow the instructions on the notes in this pipeline to copy the Run Orchestration component into your own orchestration pipeline, and configure it to load your Avro files into your data platform.

See the Matillion documentation for full descriptions of the parameters.

Image ofConfigure the Run Orchestration component to load the files into your data platform
Configure the Run Orchestration component to load the files into your data platform

Downloads

Licensed under: Matillion Free Subscription License

Installation Instructions

How to Install a Data Productivity Cloud Pipeline