Matillion ETL Shared Job

Author: Matillion
Date Posted: Nov 14, 2023
Last Modified: Nov 14, 2023

Extract Password Protected Zip File

Extract all files from a password protected zip file.

Extract Password Protected Zip File

The source zip file must be in S3 cloud storage. The contents of the file are extracted into the target S3 bucket, optionally with a path prefix. Any paths in the zip file are respected during the copy.

Parameters

ParameterDescription
Source Bucket NameName of the S3 bucket containing the .zip file. Do not include the s3:// prefix. Do not include the object path
Source zip file nameThe .zip file name, including path if any
Target Bucket NameName of the target S3 bucket. Do not include the s3:// prefix. Do not include a path
Target prefixThe target path. This parameter is optional. If you leave it blank, the files will be unzipped into the root folder of the target bucket
Zip passwordThe password for the .zip file

Warnings

This shared job temporarily saves the original zip file onto the local disk of your Matillion ETL instance. You must ensure there is enough space in your filesystem for this.

If the password is incorrect, this utility will create zero-sized files in the target S3 bucket.

Single spaces in filenames or paths are fine. But this shared job can not handle cases where the zip contains files with multiple spaces in their name.

Prerequisites

The aws command line utility must be installed on your Matillion ETL instance. If the shared job fails with an error line X: aws: command not found then please follow this guide to installing the aws command.

This shared job attempts to read and write to S3. Ensure that the EC2 instance credentials attached to your Matillion ETL instance include the privilege to do this. For more information, refer to the “IAM in AWS” section in this article on RBAC in the Cloud.


Downloads

Licensed under: Matillion Free Subscription License

Installation Instructions

How to Install a Matillion ETL Shared Job