Dagster exposes a GraphQL API that allows clients to interact with Dagster programmatically. The API allows users to:
The GraphQL API is still evolving and is subject to breaking changes. A large portion of the API is primarily for internal use by Dagit. For any of the queries below, we will be clear about breaking changes in release notes.
The GraphQL API is served alongside the Dagit web server. To start the server, run dagit
in your Dagster workspace.
dagit
Dagit serves the GraphQL endpoint at the /graphql
endpoint. If you are running Dagit locally on port 3000, you can access the API at localhost:3000/graphql
You can access the GraphQL Playground by navigating to the /graphql
route in your browser. The GraphQL Playground contains the full GraphQL Schema and an interactive playground to write and test queries and mutations.
Clicking on the Docs
tab on the right edge of the playground opens up interactive documentation for the GraphQL API. The interactive documentation is the best way to explore the API and get information about which fields are available on the queries and mutations.
Dagster also provides a Python Client
To retrieve a list of all pipeline runs, use the pipelineRunsOrError
query.
query PaginatedPipelineRuns {
pipelineRunsOrError {
__typename
... on PipelineRuns {
results {
runId
pipelineName
status
runConfigYaml
stats {
... on PipelineRunStatsSnapshot {
startTime
endTime
stepsFailed
}
}
}
}
}
}
You may eventually accumulate too many runs to return in one query. The pipelineRunsOrError
query takes in optional cursor
and limit
arguments for pagination:
query PaginatedPipelineRuns {
pipelineRunsOrError(
cursor: "7fd2e5ef-5591-43db-be15-1ebbbbed8bb5"
limit: 10
) {
__typename
... on PipelineRuns {
results {
runId
pipelineName
status
runConfigYaml
stats {
... on PipelineRunStatsSnapshot {
startTime
endTime
stepsFailed
}
}
}
}
}
}
The pipelineRunsOrError
query also takes in an optional filter argument, of type PipelineRunsFilter
. This query allows you to filter runs by:
For example, the following query will return all failed runs:
query FilteredPipelineRuns {
pipelineRunsOrError(filter: { status: FAILURE }) {
__typename
... on PipelineRuns {
results {
runId
pipelineName
status
runConfigYaml
stats {
... on PipelineRunStatsSnapshot {
startTime
endTime
stepsFailed
}
}
}
}
}
}
This query returns the names and location names of all the repositories currently loaded.
query RepositoriesQuery {
repositoriesOrError {
... on RepositoryConnection {
nodes {
name
location {
name
}
}
}
}
}
Given a repository, this query returns the names of all the pipelines in the repository.
This query takes a selector
, which is of type RepositorySelector
. A repository selector consists of both the repository location name and repository name.
query PipelinesQuery(
$repositoryLocationName: String!
$repositoryName: String!
) {
repositoryOrError(
repositorySelector: {
repositoryLocationName: $repositoryLocationName
repositoryName: $repositoryName
}
) {
... on Repository {
pipelines {
name
}
}
}
}
To execute a pipeline, we can use the launchPipelineExecution
mutation. Here, we define ExecutePipeline
to wrap our mutation and allow us to pass in the required arguments as query variables.
For this query, the required arguments are:
selector
:
PipelineSelector
, and consists of the repository location name, repository name, and pipeline namesolidSelection
argument here, to execute a subset of the pipelinerunConfigData
:
runConfigData
is of type RunConfigData
. This type is used when passing in an arbitrary object for run config. This is any-typed in the GraphQL type system but must conform to the constraints of the config schema for this pipeline. If it does not, the mutation returns a PipelineConfigValidationInvalid
responsemode
:
default
mutation ExecutePipeline(
$repositoryLocationName: String!
$repositoryName: String!
$pipelineName: String!
$runConfigData: RunConfigData!
$mode: String!
) {
launchPipelineExecution(
executionParams: {
selector: {
repositoryLocationName: $repositoryLocationName
repositoryName: $repositoryName
pipelineName: $pipelineName
}
runConfigData: $runConfigData
mode: $mode
}
) {
__typename
... on LaunchPipelineRunSuccess {
run {
runId
}
}
... on PipelineConfigValidationInvalid {
errors {
message
reason
}
}
... on PythonError {
message
}
}
}
If you want to use a preset instead of defining the run config, use the preset
argument instead.:
mutation ExecutePipeline(
$repositoryLocationName: String!
$repositoryName: String!
$pipelineName: String!
$presetName: String!
) {
launchPipelineExecution(
executionParams: {
selector: {
repositoryLocationName: $repositoryLocationName
repositoryName: $repositoryName
pipelineName: $pipelineName
}
preset: $presetName
}
) {
__typename
... on LaunchPipelineRunSuccess {
run {
runId
}
}
}
}