Collect Meltano Cloud data into your data warehouse or ours. The Matatika pipelines will take care of the data collection and preparation for your analytics and BI tools.
Meltano Cloud is a SaaS data platform that lets teams build, run, and monitor data pipelines entirely in the cloud. It provides managed infrastructure for running Meltano projects, with workspaces that organise pipelines, data components, data stores, datasets, channels, and job history all in one place.
The tap-meltano-cloud extractor connects to the Meltano Cloud API to extract metadata about your Meltano Cloud organisation — giving you full visibility of your pipeline configurations, execution history, and data assets in your data warehouse.
Obtain an API token from the Meltano Cloud UI and set it as the Auth Token setting.
All workspaces accessible to the authenticated user. All other streams are scoped to each workspace and synced as children.
| Field | Description |
|---|---|
| id | Unique identifier of the workspace |
| name | Display name of the workspace |
| alias | URL-friendly alias for the workspace |
| status | Workspace status |
| managed | Whether the workspace is managed by Meltano Cloud |
| defaultEnvironment | Default Meltano environment name |
| pipelinesImage | Docker image used to run pipelines |
| repositoryUrl | Git repository URL linked to the workspace |
| repositoryBranch | Git branch used for the workspace |
| repositoryDirectory | Subdirectory within the repository |
| githubInstallationId | GitHub App installation ID |
| domains | Allowed email domains for the workspace |
| imageUrl | Workspace logo/image URL |
| appProperties | Additional application-level properties object |
| created | Timestamp when the workspace was created |
| lastModified | Timestamp when the workspace was last modified |
All pipelines within each workspace.
| Field | Description |
|---|---|
| id | Unique identifier of the pipeline |
| workspaceId | ID of the parent workspace |
| name | Internal pipeline name |
| label | Human-readable display label |
| status | Pipeline status (e.g. ACTIVE, INACTIVE) |
| schedule | Cron or interval schedule expression |
| scheduleFromInterval | Whether the schedule was derived from an interval |
| timeout | Execution timeout in seconds |
| maxRetries | Maximum number of retry attempts on failure |
| script | Pipeline script content |
| dataComponents | List of data component references used by the pipeline |
| actions | Available pipeline actions |
| triggeredBy | Trigger source for the pipeline |
| repositoryPath | Path to the pipeline definition in the workspace repository |
| created | Timestamp when the pipeline was created |
| lastModified | Timestamp when the pipeline was last modified |
Execution history for all pipelines within each workspace. Each record represents a single pipeline run attempt.
| Field | Description |
|---|---|
| id | Unique identifier of the job |
| workspaceId | ID of the parent workspace |
| type | Job type (e.g. PIPELINE_RUN) |
| status | Job status (e.g. RUNNING, SUCCESS, FAILED) |
| triggeredBy | What or who triggered the job |
| attempt | Current attempt number |
| maxAttempts | Maximum number of execution attempts allowed |
| exitCode | Process exit code |
| commitId | Git commit ID at time of execution |
| startTime | Timestamp when the job started |
| endTime | Timestamp when the job ended |
| created | Timestamp when the job record was created |
Data component configurations (extractors and mappers) within each workspace.
| Field | Description |
|---|---|
| id | Unique identifier of the data component |
| workspaceId | ID of the parent workspace |
| name | Data component name |
| dataPlugin | Name of the underlying Singer plugin |
| dataPluginType | Plugin type (e.g. extractor, mapper) |
| dataPluginLogoUrl | Logo URL for the plugin |
| commands | Available plugin commands |
| streams | Singer streams exposed by this component |
| draft | Whether this is a draft (unsaved) configuration |
| managed | Whether this component is managed by Meltano Cloud |
| fromDataPlugin | Whether this was created from a plugin template |
| repositoryPath | Path to the component definition in the workspace repository |
| created | Timestamp when the component was created |
| lastModified | Timestamp when the component was last modified |
Data store configurations (loaders) within each workspace.
| Field | Description |
|---|---|
| id | Unique identifier of the data store |
| workspaceId | ID of the parent workspace |
| name | Data store name |
| dataPlugin | Name of the underlying Singer plugin |
| dataPluginType | Plugin type (e.g. loader) |
| dataPluginLogoUrl | Logo URL for the plugin |
| logoUrl | Logo URL for this data store instance |
| commands | Available plugin commands |
| streams | Singer streams consumed by this store |
| workspace | Parent workspace reference object |
| draft | Whether this is a draft (unsaved) configuration |
| managed | Whether this store is managed by Meltano Cloud |
| fromDataPlugin | Whether this was created from a plugin template |
| repositoryPath | Path to the store definition in the workspace repository |
| created | Timestamp when the store was created |
| lastModified | Timestamp when the store was last modified |
Published datasets within each workspace.
| Field | Description |
|---|---|
| id | Unique identifier of the dataset |
| workspaceId | ID of the parent workspace |
| title | Display title of the dataset |
| alias | URL-friendly alias |
| description | Dataset description text |
| descriptionContentType | MIME type of the description (e.g. text/markdown) |
| published | Whether the dataset is publicly published |
| source | Data source identifier |
| query | SQL or query definition |
| rawData | Raw data payload |
| visualisation | Visualisation configuration object |
| metadata | Additional metadata object |
| questions | Associated question prompts |
| likeCount | Number of likes |
| likedByProfiles | Profiles that have liked this dataset |
| commentCount | Number of comments |
| viewCount | Number of views |
| score | Relevance or quality score |
| repositoryPath | Path to the dataset definition in the workspace repository |
| created | Timestamp when the dataset was created |
Channels (curated dataset collections) within each workspace.
| Field | Description |
|---|---|
| id | Unique identifier of the channel |
| workspaceId | ID of the parent workspace |
| name | Channel name |
| description | Channel description |
| scope | Visibility scope (e.g. PUBLIC, PRIVATE) |
| type | Channel type |
| creator | Profile of the user who created the channel |
| picture | Channel picture URL |
| datasetCount | Number of datasets in the channel |
| containsDataset | Whether a queried dataset is present in the channel |
| repositoryPath | Path to the channel definition in the workspace repository |
| created | Timestamp when the channel was created |
| lastModified | Timestamp when the channel was last modified |
Temporary Auth Token from the Meltano Cloud UI (leave blank to use the pipeline default AUTH_TOKEN when deployed in Meltano Cloud).
Optional list of workspace IDs to sync. When set, only the specified workspaces are fetched directly by ID, bypassing the workspaces list endpoint. When omitted, all workspaces accessible to the authenticated user are discovered and synced automatically.
Extract, Transform, and Load Meltano Cloud data into your data warehouse or ours.