{"objects":[{"metadata":{"identifier":{"packageName":"Matillion Exchange","name":"MongoDB Incremental Load","revision":1,"type":"DYNAMIC"},"rootJobReference":{"name":"MongoDB - 1 - Iterate Objects","type":"ORCHESTRATION","parameterMetadata":[{"slot":2,"variableName":"user_name","variableType":"SCALAR","displayName":"Username","description":"The username to be supplied to connect to the source.","defaultValue":[{"values":{"1":""}}],"defaultValueType":"TEXT","required":false},{"slot":3,"variableName":"password_manager_entry","variableType":"SCALAR","displayName":"Password Manager Entry","description":"The name in the Matillion Password Manager for the password relating to the authentication method.","defaultValue":[{"values":{"1":""}}],"defaultValueType":"TEXT","required":false},{"slot":4,"variableName":"connection_url","variableType":"SCALAR","displayName":"Server","description":"The server IP or DNS address of the MongoDB server endpoint.","defaultValue":[{"values":{"1":""}}],"defaultValueType":"TEXT","required":true},{"slot":5,"variableName":"database_name","variableType":"SCALAR","displayName":"Database","description":"The Database name on the MongoDB server.","defaultValue":[{"values":{"1":""}}],"defaultValueType":"TEXT","required":true},{"slot":6,"variableName":"flatten_objects","variableType":"SCALAR","displayName":"Flatten Objects","description":"Yes: Nested document structures are flattened into a set of fields. Determining which fields are available can become expensive in this mode, since more data needs to be scanned in order to determine which fields are available.\nNo: Nested document structures are returns as JSON strings. They can be further queried/manipulated by JSON functions in a transformation job after being staged","defaultValue":[{"values":{"1":"Yes"}}],"defaultValueType":"TEXT","required":true},{"slot":7,"variableName":"flatten_arrays","variableType":"SCALAR","displayName":"Flatten Arrays","description":"The maximum number of elements that any array can be flattened to. Flattened arrays have each element placed into its own respective (newly created) column.\nEntering 0 for this property will ensure all arrays remain in JSON string format.\nEntering -1 for this property will ensure all elements from arrays are flattened.\nRequesting to flatten more elements than exist in an array will result in all elements of that array being flattened.\n\nDetermining which fields are available can become expensive in this mode, since more data needs to be scanned in order to determine which fields are available.","defaultValue":[{"values":{"1":"0"}}],"defaultValueType":"TEXT","required":true},{"slot":8,"variableName":"source_list","variableType":"GRID","displayName":"Tables And Columns","description":"Contains the list of tables and columns (and in some cases an incremental_column to specify which column the load should be incremented on [accepts values of 0 or 1]) to be processed. ","defaultValue":[{"values":{"1":""}}],"defaultValueType":null,"required":true},{"slot":9,"variableName":"advanced_connection_options","variableType":"GRID","displayName":"Connection Options","description":"A list of values and parameters. Parameters and their allowed values are database/driver specific. Referring to the data model will provide insight of what you could provide here.\n
\nThey are usually not required as sensible defaults are assumed.","defaultValue":[{"values":{"1":""}}],"defaultValueType":null,"required":false},{"slot":10,"variableName":"load_concurrent","variableType":"SCALAR","displayName":"Load Type","description":"Sequential - Iterations are done in sequence, waiting for each to complete before starting the next. \nThis is the default.\n
\nConcurrent - Iterations are run concurrently. This requires all \"Variables to Iterate\" to be defined as\nCopied variables, so that each iteration gets its own copy of the variable isolated from the same\nvariable being used by other concurrent executions.\n
Note: The maximum concurrency is limited by the number of available threads (2x the number of virtual cpus on your cloud instance).","defaultValue":[{"values":{"1":""}}],"defaultValueType":"TEXT","required":true},{"slot":11,"variableName":"cloud_storage_area","variableType":"SCALAR","displayName":"Cloud Storage Staging Area","description":"The URL and path of the target Google Storage bucket to be used for staging the queried data throughout the job.","defaultValue":[{"values":{"1":""}}],"defaultValueType":"TEXT","required":true},{"slot":12,"variableName":"stage_project","variableType":"SCALAR","displayName":"Stage Project","description":"The project name where the staging data will be stored.","defaultValue":[{"values":{"1":""}}],"defaultValueType":"TEXT","required":true},{"slot":13,"variableName":"stage_dataset","variableType":"SCALAR","displayName":"Stage Dataset","description":"The dataset name where the staging data will be stored.","defaultValue":[{"values":{"1":""}}],"defaultValueType":"TEXT","required":true},{"slot":14,"variableName":"stage_prefix","variableType":"SCALAR","displayName":"Stage Prefix","description":"A prefix value that will be added to the start of the stage table names.\n
\ne.g. If a Stage Prefix of 'stage_' is specified and the table being processed is named 'test_data' then the target table will be named 'stage_test_data'.","defaultValue":[{"values":{"1":""}}],"defaultValueType":"TEXT","required":true},{"slot":15,"variableName":"target_project","variableType":"SCALAR","displayName":"Target Project","description":"The project name where the target data will be stored.","defaultValue":[{"values":{"1":""}}],"defaultValueType":"TEXT","required":true},{"slot":16,"variableName":"target_dataset","variableType":"SCALAR","displayName":"Target Dataset","description":"The dataset name where the target data will be stored.","defaultValue":[{"values":{"1":""}}],"defaultValueType":"TEXT","required":true},{"slot":17,"variableName":"target_prefix","variableType":"SCALAR","displayName":"Target Prefix","description":"A prefix value that will be added to the start of the target table names.\n
\ne.g. If a Target Prefix of 'target_' is specified and the table being processed is named 'test_data' then the target table will be named 'target_test_data'.","defaultValue":[{"values":{"1":""}}],"defaultValueType":"TEXT","required":false},{"slot":18,"variableName":"log_metrics","variableType":"SCALAR","displayName":"Log Metrics","description":"Option to switch off Metrics logging in the full product - default is TRUE which records the metrics for the batch run. ","defaultValue":[{"values":{"1":"TRUE"}}],"defaultValueType":"TEXT","required":false}]},"orchestrationJobs":["MongoDB - 1 - Iterate Objects","MongoDB - 4 - Query Source","MongoDB - 3 - Stage Object","MongoDB - 2 - Stage Object"],"transformationJobs":[],"imageMetadata":{"componentIcon":"iVBORw0KGgoAAAANSUhEUgAAACAAAAAgCAYAAABzenr0AAACjUlEQVR42mNgGAUUgKAl/iEDY/N/BsbIVf7tAYt9P/nO9JWju/0Rq/0nRqwK+B+4xPe/5zzvK0BHiNDN8sjV/lkgy+EOmOsFwicc5jtw0Nzy0FV+6hGr/L9jccB/r7leU2kf9Kv8t8AsB+GgJX5wBwDxP4+5HjY0szx8RZARsuVgByxFcQAQex6joe8DZhJ2ABDP8bSmuuUNDQ1MQAtfEeMAYFqYQ3UHRK/y10W3HGcIzPV6DXIwdeN/tX8cCQ747zPXR5/KBU9APSkO8JznGU6Tkg8dB+NywBzPHOpGwcqASVgdsAy7A7zmeOVT1wGrAutIcYDHXK8EkL6ysjLeysrKIGqUgCGkOcDDBmr5kYqKii4q1AGhokAL/xLpgJ9ui9y4gZZvBuL/QPwHiB8A8aTS0lIJCqIhYAcxiRBYEG0Bqa+qqjICWvoOSK8BhkIWkH0JiF8A+TpklgUBbsRkw5TOFHiQgxwBtLwSWpqyAR2wHsi/kZaWxkpufbAN2QHI1TEIh0wJ/QUKcqAlgkjFOAuMDZQTBuKvQPlg8hyw3F8WaPFbmANCFwcgO+BHbnPuIqAFF/GZAZTfTVHCjFwV6AS0/CfIAWmLoxEpf453PNDgcqAFZwk4YDsQ91LWLFvp7wtqGbUvKQRZ/hdoeRZIHOgAN6Dhv4BxL46jVgWlg/dAddEUZ824VaGWSxY3/2tdUBQHz66hocxAy6+BUj6IDU2ICcgpH+iAZKBDuKhSQi5b3Hpt0aJubmSx8vJyfaAlb4G+PAoqBUHBDcSvyM5++MDSxa3bsInX1NQoAi1dDsTfoIXRf2jaYKSqA5Ytaq3cvx+RzbDEOQvQ4kYgfgx0lPKAdKKAQR8xYJYPCQAAhx5xDiQtIkgAAAAASUVORK5CYII=","smallIcon":"iVBORw0KGgoAAAANSUhEUgAAABAAAAAQCAYAAAAf8/9hAAABL0lEQVR42mNgoCUIWepb4jnJk50szRGrA+pDlvn/95zjtaahoYGJNM1r/M0jVgT8C4UYAMTeJaQZsCpgc8TKgP8wAzzmeH10menCT5Tm0FWhPEDbf4ENWA5zAQh7JhHvfKBmDANmey0kyoDw1QGe2AwAemM7cQasCDLF6oI5nsurq6stKisrzfEakDYzjRWo+S2GC2Z75VVUVGwF4r1APLOkpIQbbxpAjgXvWT7/CmuL08rKyniBrpCvqqoyKi8v78RpgOc2T3agAaeilwWDDYjpi5kDtDUDWQ2QvxivV+LWBQo3LMs6EzYvsBCYEoWAfl8KFGasqalRBtFAfh7BAF26qC151apVzCA20Mn+QLwWiO8BNdsTFSOrFnXI/f//nxFJiBFoQEVhYSEn1XIsAIy/tTcU2bOmAAAAAElFTkSuQmCC"},"description":"","helpHtml":"
Property | Type | Description |
---|---|---|
Username | Variable | The username to be supplied to connect to the source. |
Password Manager Entry | Variable | The name in the Matillion Password Manager for the password relating to the authentication method. |
Server | Variable | The server IP or DNS address of the MongoDB server endpoint. |
Database | Variable | The Database name on the MongoDB server. |
Flatten Objects | Variable | Yes: Nested document structures are flattened into a set of fields. Determining which fields are available can become expensive in this mode, since more data needs to be scanned in order to determine which fields are available.\nNo: Nested document structures are returns as JSON strings. They can be further queried/manipulated by JSON functions in a transformation job after being staged |
Flatten Arrays | Variable | The maximum number of elements that any array can be flattened to. Flattened arrays have each element placed into its own respective (newly created) column.\nEntering 0 for this property will ensure all arrays remain in JSON string format.\nEntering -1 for this property will ensure all elements from arrays are flattened.\nRequesting to flatten more elements than exist in an array will result in all elements of that array being flattened.\n\nDetermining which fields are available can become expensive in this mode, since more data needs to be scanned in order to determine which fields are available. |
Tables And Columns | Grid | Contains the list of tables and columns (and in some cases an incremental_column to specify which column the load should be incremented on [accepts values of 0 or 1]) to be processed. |
Connection Options | Grid | A list of values and parameters. Parameters and their allowed values are database/driver specific. Referring to the data model will provide insight of what you could provide here.\n \nThey are usually not required as sensible defaults are assumed. |
Load Type | Variable | Sequential - Iterations are done in sequence, waiting for each to complete before starting the next. \nThis is the default.\n \nConcurrent - Iterations are run concurrently. This requires all \"Variables to Iterate\" to be defined as\nCopied variables, so that each iteration gets its own copy of the variable isolated from the same\nvariable being used by other concurrent executions.\n Note: The maximum concurrency is limited by the number of available threads (2x the number of virtual cpus on your cloud instance). |
Cloud Storage Staging Area | Variable | The URL and path of the target Google Storage bucket to be used for staging the queried data throughout the job. |
Stage Project | Variable | The project name where the staging data will be stored. |
Stage Dataset | Variable | The dataset name where the staging data will be stored. |
Stage Prefix | Variable | A prefix value that will be added to the start of the stage table names.\n \ne.g. If a Stage Prefix of 'stage_' is specified and the table being processed is named 'test_data' then the target table will be named 'stage_test_data'. |
Target Project | Variable | The project name where the target data will be stored. |
Target Dataset | Variable | The dataset name where the target data will be stored. |
Target Prefix | Variable | A prefix value that will be added to the start of the target table names.\n \ne.g. If a Target Prefix of 'target_' is specified and the table being processed is named 'test_data' then the target table will be named 'target_test_data'. |
Log Metrics | Variable | Option to switch off Metrics logging in the full product - default is TRUE which records the metrics for the batch run. |