Meltano Cloud Connect

Meltano Cloud data into your data warehouse in minutes

Collect Meltano Cloud data into your data warehouse or ours. The Matatika pipelines will take care of the data collection and preparation for your analytics and BI tools.

Automate Meltano Cloud from a single space with no code

Meltano Cloud is a SaaS data platform that lets teams build, run, and monitor data pipelines entirely in the cloud. It provides managed infrastructure for running Meltano projects, with workspaces that organise pipelines, data components, data stores, datasets, channels, and job history all in one place.

The tap-meltano-cloud extractor connects to the Meltano Cloud API to extract metadata about your Meltano Cloud organisation — giving you full visibility of your pipeline configurations, execution history, and data assets in your data warehouse.

Prerequisites

Obtain an API token from the Meltano Cloud UI and set it as the Auth Token setting.

Available Streams

workspaces

All workspaces accessible to the authenticated user. All other streams are scoped to each workspace and synced as children.

Field Description
id Unique identifier of the workspace
name Display name of the workspace
alias URL-friendly alias for the workspace
status Workspace status
managed Whether the workspace is managed by Meltano Cloud
defaultEnvironment Default Meltano environment name
pipelinesImage Docker image used to run pipelines
repositoryUrl Git repository URL linked to the workspace
repositoryBranch Git branch used for the workspace
repositoryDirectory Subdirectory within the repository
githubInstallationId GitHub App installation ID
domains Allowed email domains for the workspace
imageUrl Workspace logo/image URL
appProperties Additional application-level properties object
created Timestamp when the workspace was created
lastModified Timestamp when the workspace was last modified

pipelines

All pipelines within each workspace.

Field Description
id Unique identifier of the pipeline
workspaceId ID of the parent workspace
name Internal pipeline name
label Human-readable display label
status Pipeline status (e.g. ACTIVE, INACTIVE)
schedule Cron or interval schedule expression
scheduleFromInterval Whether the schedule was derived from an interval
timeout Execution timeout in seconds
maxRetries Maximum number of retry attempts on failure
script Pipeline script content
dataComponents List of data component references used by the pipeline
actions Available pipeline actions
triggeredBy Trigger source for the pipeline
repositoryPath Path to the pipeline definition in the workspace repository
created Timestamp when the pipeline was created
lastModified Timestamp when the pipeline was last modified

jobs

Execution history for all pipelines within each workspace. Each record represents a single pipeline run attempt.

Field Description
id Unique identifier of the job
workspaceId ID of the parent workspace
type Job type (e.g. PIPELINE_RUN)
status Job status (e.g. RUNNING, SUCCESS, FAILED)
triggeredBy What or who triggered the job
attempt Current attempt number
maxAttempts Maximum number of execution attempts allowed
exitCode Process exit code
commitId Git commit ID at time of execution
startTime Timestamp when the job started
endTime Timestamp when the job ended
created Timestamp when the job record was created

datacomponents

Data component configurations (extractors and mappers) within each workspace.

Field Description
id Unique identifier of the data component
workspaceId ID of the parent workspace
name Data component name
dataPlugin Name of the underlying Singer plugin
dataPluginType Plugin type (e.g. extractor, mapper)
dataPluginLogoUrl Logo URL for the plugin
commands Available plugin commands
streams Singer streams exposed by this component
draft Whether this is a draft (unsaved) configuration
managed Whether this component is managed by Meltano Cloud
fromDataPlugin Whether this was created from a plugin template
repositoryPath Path to the component definition in the workspace repository
created Timestamp when the component was created
lastModified Timestamp when the component was last modified

datastores

Data store configurations (loaders) within each workspace.

Field Description
id Unique identifier of the data store
workspaceId ID of the parent workspace
name Data store name
dataPlugin Name of the underlying Singer plugin
dataPluginType Plugin type (e.g. loader)
dataPluginLogoUrl Logo URL for the plugin
logoUrl Logo URL for this data store instance
commands Available plugin commands
streams Singer streams consumed by this store
workspace Parent workspace reference object
draft Whether this is a draft (unsaved) configuration
managed Whether this store is managed by Meltano Cloud
fromDataPlugin Whether this was created from a plugin template
repositoryPath Path to the store definition in the workspace repository
created Timestamp when the store was created
lastModified Timestamp when the store was last modified

datasets

Published datasets within each workspace.

Field Description
id Unique identifier of the dataset
workspaceId ID of the parent workspace
title Display title of the dataset
alias URL-friendly alias
description Dataset description text
descriptionContentType MIME type of the description (e.g. text/markdown)
published Whether the dataset is publicly published
source Data source identifier
query SQL or query definition
rawData Raw data payload
visualisation Visualisation configuration object
metadata Additional metadata object
questions Associated question prompts
likeCount Number of likes
likedByProfiles Profiles that have liked this dataset
commentCount Number of comments
viewCount Number of views
score Relevance or quality score
repositoryPath Path to the dataset definition in the workspace repository
created Timestamp when the dataset was created

channels

Channels (curated dataset collections) within each workspace.

Field Description
id Unique identifier of the channel
workspaceId ID of the parent workspace
name Channel name
description Channel description
scope Visibility scope (e.g. PUBLIC, PRIVATE)
type Channel type
creator Profile of the user who created the channel
picture Channel picture URL
datasetCount Number of datasets in the channel
containsDataset Whether a queried dataset is present in the channel
repositoryPath Path to the channel definition in the workspace repository
created Timestamp when the channel was created
lastModified Timestamp when the channel was last modified

Settings

Auth Token

Temporary Auth Token from the Meltano Cloud UI (leave blank to use the pipeline default AUTH_TOKEN when deployed in Meltano Cloud).

Workspace IDs

Optional list of workspace IDs to sync. When set, only the specified workspaces are fetched directly by ID, bypassing the workspaces list endpoint. When omitted, all workspaces accessible to the authenticated user are discovered and synced automatically.


View source code

Meltano Cloud data you can trust

Extract, Transform, and Load Meltano Cloud data into your data warehouse or ours.

Interested in learning more?

Get in touch