v1.51.0
🚀 Features
Support conditional coprocessor execution per stage of request lifecycle (PR #5557)
The router now supports conditional execution of the coprocessor for each stage of the request lifecycle (except for the Execution
stage).
To configure, define conditions for a specific stage by using selectors based on headers or context entries. For example, based on a supergraph response you can configure the coprocessor not to execute for any subscription:
coprocessor:
url: http://127.0.0.1:3000 # mandatory URL which is the address of the coprocessor
timeout: 2s # optional timeout (2 seconds in this example). If not set, defaults to 1 second
supergraph:
response:
condition:
not:
eq:
- subscription
- operation_kind: string
body: true
To learn more, see the documentation about coprocessor conditions.
Add option to deactivate introspection response caching (PR #5583)
The router now supports an option to deactivate introspection response caching. Because the router caches responses as introspection happens in the query planner, cached introspection responses may consume too much of the distributed cache or fill it up. Setting this option prevents introspection responses from filling up the router's distributed cache.
To deactivate introspection caching, set supergraph.query_planning.legacy_introspection_caching
to false
:
supergraph:
query_planning:
legacy_introspection_caching: false
Add 'subgraph_on_graphql_error' selector for subgraph (PR #5622)
The router now supports the subgraph_on_graphql_error
selector for the subgraph service, which it already supported for the router and supergraph services. Subgraph service support enables easier detection of GraphQL errors in response bodies of subgraph requests.
An example configuration with subgraph_on_graphql_error
configured:
telemetry:
instrumentation:
instruments:
subgraph:
http.client.request.duration:
attributes:
subgraph.graphql.errors: # attribute containing a boolean set to true if response.errors is not empty
subgraph_on_graphql_error: true
🐛 Fixes
Add response_context
in event selector for event_*
instruments (PR #5565)
The router now supports creating custom instruments with a value set to event_*
and using both a condition executed on an event and the response_context
selector in attributes. Previous releases didn't support the response_context
selector in attributes.
An example configuration:
telemetry:
instrumentation:
instruments:
supergraph:
sf.graphql_router.errors:
value: event_unit
type: counter
unit: count
description: "graphql errors handled by the apollo router"
condition:
eq:
- true
- on_graphql_error: true
attributes:
"operation":
response_context: "operation_name" # This was not working before
Provide valid trace IDs for unsampled traces in Rhai scripts (PR #5606)
The traceid()
function in a Rhai script for the router now returns a valid trace ID for all traces.
Previously, traceid()
didn't return a trace ID if the trace wasn't selected for sampling.
Allow query batching and entity caching to work together (PR #5598)
The router now supports entity caching and subgraph batching to run simultaneously. Specifically, this change updates entity caching to ignore a subgraph request if the request is part of a batch.
Gracefully handle subgraph response with -1
values inside error locations (PR #5633)
This router now gracefully handles responses that contain invalid "-1
" positional values for error locations in queries by ignoring those invalid locations.
This change resolves the problem of GraphQL Java and GraphQL Kotlin using { "line": -1, "column": -1 }
values if they can't determine an error's location in a query, but the GraphQL specification requires both line
and column
to be positive numbers.
As an example, a subgraph can respond with invalid error locations:
{
"data": { "topProducts": null },
"errors": [{
"message":"Some error on subgraph",
"locations": [
{ "line": -1, "column": -1 },
],
"path":["topProducts"]
}]
}
With this change, the router returns a response that ignores the invalid locations:
{
"data": { "topProducts": null },
"errors": [{
"message":"Some error on subgraph",
"path":["topProducts"]
}]
}
By @IvanGoncharov in #5633
Return request timeout and rate limited error responses as structured errors (PR #5578)
The router now returns request timeout errors (408 Request Timeout
) and request rate limited errors (429 Too Many Requests
) as structured GraphQL errors (for example, {"errors": [...]}
). Previously, the router returned these as plaintext errors to clients.
Both types of errors are properly tracked in telemetry, including the apollo_router_graphql_error_total
metric.
By @IvanGoncharov in #5578
Fix span names and resource mapping for Datadog trace exporter (Issue #5282)
Note
This is an incremental improvement, but we expect more improvements in Router v1.52.0 after #5609 lands.
The router now uses static span names by default. This change fixes the user experience of the Datadog trace exporter when sending traces with Datadog native configuration.
The router has two ways of sending traces to Datadog:
- The OpenTelemetry for Datadog approach (which is the recommended method). This is identified by
otlp
in YAML configuration, and it is not impacted by this fix. - The "Datadog native" configuration. This is identified by the use of a
datadog:
key in YAML configuration.
This change fixes a bug in the latter approach that broke some Datadog experiences, such as the "Resources" section of the Datadog APM Service Catalog page.
We now use static span names by default, with resource mappings providing additional context when requested, which enables the desired behavior which was not possible before.
If for some reason you wish to maintain the existing behavior, you must either update your spans and resource mappings, or keep your spans and instead configure the router to use dynamic span names and disable resource mapping.
Enabling resource mapping and fixed span names is configured by the enable_span_mapping
and fixed_span_names
options:
telemetry:
exporters:
tracing:
datadog:
enabled: true
# Enables resource mapping, previously disabled by default, but now enabled.
enable_span_mapping: true
# Enables fixed span names, defaults to true.
fixed_span_names: true
instrumentation:
spans:
mode: spec_compliant
With enable_span_mapping
set to true
(now default), the following resource mappings are applied:
OpenTelemetry Span Name | Datadog Span Operation Name |
---|---|
request |
http.route |
router |
http.route |
supergraph |
graphql.operation.name |
query_planning |
graphql.operation.name |
subgraph |
subgraph.name |
subgraph_request |
graphql.operation.name |
http_request |
http.route |
You can override the default resource mappings by specifying the resource_mapping
configuration:
telemetry:
exporters:
tracing:
datadog:
enabled: true
resource_mapping:
# Use `my.span.attribute` as the resource name for the `router` span
router: "my.span.attribute"
To learn more, see the Datadog trace exporter documentation.
By @bnjjj and @BrynCooke in #5386
📚 Documentation
Update documentation for ignore_other_prefixes
(PR #5592)
Update JWT authentication documentation to clarify the behavior of the ignore_other_prefixes
configuration option.
By @andrewmcgivery in #5592