* feat(telemetry): define HTTP OTLP endpoint behavior and signal routing - Add resolveHttpOtlpUrl() that appends /v1/traces, /v1/logs, /v1/metrics to base HTTP OTLP endpoints per the OpenTelemetry specification - Add per-signal endpoint overrides (otlpTracesEndpoint, otlpLogsEndpoint, otlpMetricsEndpoint) for backends with non-standard paths (e.g. Alibaba Cloud) - Add LogToSpanProcessor that bridges OTel log records to spans for traces-only backends, with session-based traceId correlation and error status propagation - Auto-wire LogToSpanProcessor when traces URL exists but logs URL doesn't - Validate per-signal URLs gracefully (log error + skip, don't crash) - Preserve query strings when appending signal paths to URLs - Guard gRPC branch against missing base endpoint with per-signal config - Update telemetry documentation with signal routing semantics and Alibaba Cloud HTTP per-signal endpoint examples Closes #3734 Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com> * fix(telemetry): fix TS noPropertyAccessFromIndexSignature errors in tests Use typed ExportedSpan interface and bracket notation for index signature properties to satisfy strict TypeScript checks in CI. Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com> * fix(telemetry): replace MD5 with SHA-256 for traceId derivation CodeQL flagged MD5 as a weak cryptographic algorithm when used with session.id (considered sensitive data). Switch to SHA-256 truncated to 32 hex chars to satisfy CodeQL while maintaining the same traceId format required by the OTel specification. Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com> * fix(telemetry): address review feedback for LogToSpanProcessor robustness - Wrap JSON.stringify in try/catch to handle circular refs and BigInt - Add export timeout (30s) and try/catch to prevent hung shutdown - Track in-flight exports to avoid interval-vs-shutdown race condition - Fix deriveSpanStatus: use truthy checks (!!), drop success===false heuristic since declined tool calls are normal, not errors - Enforce http(s) scheme in validateUrl to reject file:/javascript: URLs - Change DiagLogLevel from ERROR to WARN to preserve operational diagnostics - Preserve logRecord.instrumentationScope instead of hardcoding - Forward severityNumber/severityText as span attributes - Add tests for circular refs, error status edge cases, severity Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com> * fix(telemetry): flush sdk shutdown through cleanup Remove async process exit handlers from telemetry initialization and route SDK shutdown through Config cleanup so normal CLI exit paths await pending telemetry exports. Keep shutdown idempotent while an SDK shutdown is in flight. Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com> * fix(telemetry): harden bridged log shutdown Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com> * fix(telemetry): address review follow-ups Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com> --------- Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
16 KiB
Observability with OpenTelemetry
Learn how to enable and setup OpenTelemetry for Qwen Code.
Key Benefits
- 🔍 Usage Analytics: Understand interaction patterns and feature adoption across your team
- ⚡ Performance Monitoring: Track response times, token consumption, and resource utilization
- 🐛 Real-time Debugging: Identify bottlenecks, failures, and error patterns as they occur
- 📊 Workflow Optimization: Make informed decisions to improve configurations and processes
- 🏢 Enterprise Governance: Monitor usage across teams, track costs, ensure compliance, and integrate with existing monitoring infrastructure
OpenTelemetry Integration
Built on OpenTelemetry — the vendor-neutral, industry-standard observability framework — Qwen Code's observability system provides:
- Universal Compatibility: Export to any OpenTelemetry backend (Aliyun, Jaeger, Prometheus, Datadog, etc.)
- Standardized Data: Use consistent formats and collection methods across your toolchain
- Future-Proof Integration: Connect with existing and future observability infrastructure
- No Vendor Lock-in: Switch between backends without changing your instrumentation
Configuration
Note
⚠️ Special Note: This feature requires corresponding code changes. This documentation is provided in advance; please refer to future code updates for actual functionality.
All telemetry behavior is controlled through your .qwen/settings.json file.
These settings can be overridden by environment variables or CLI flags.
| Setting | Environment Variable | CLI Flag | Description | Values | Default |
|---|---|---|---|---|---|
enabled |
QWEN_TELEMETRY_ENABLED |
--telemetry / --no-telemetry |
Enable or disable telemetry | true/false |
false |
target |
QWEN_TELEMETRY_TARGET |
--telemetry-target <local|gcp> |
Where to send telemetry data | "gcp"/"local" |
"local" |
otlpEndpoint |
QWEN_TELEMETRY_OTLP_ENDPOINT |
--telemetry-otlp-endpoint <URL> |
OTLP collector endpoint | URL string | http://localhost:4317 |
otlpProtocol |
QWEN_TELEMETRY_OTLP_PROTOCOL |
--telemetry-otlp-protocol <grpc|http> |
OTLP transport protocol | "grpc"/"http" |
"grpc" |
otlpTracesEndpoint |
QWEN_TELEMETRY_OTLP_TRACES_ENDPOINT |
- | Per-signal endpoint override for traces (HTTP only) | URL string | - |
otlpLogsEndpoint |
QWEN_TELEMETRY_OTLP_LOGS_ENDPOINT |
- | Per-signal endpoint override for logs (HTTP only) | URL string | - |
otlpMetricsEndpoint |
QWEN_TELEMETRY_OTLP_METRICS_ENDPOINT |
- | Per-signal endpoint override for metrics (HTTP only) | URL string | - |
outfile |
QWEN_TELEMETRY_OUTFILE |
--telemetry-outfile <path> |
Save telemetry to file (overrides otlpEndpoint) |
file path | - |
logPrompts |
QWEN_TELEMETRY_LOG_PROMPTS |
--telemetry-log-prompts / --no-telemetry-log-prompts |
Include prompts in telemetry logs | true/false |
true |
useCollector |
QWEN_TELEMETRY_USE_COLLECTOR |
- | Use external OTLP collector (advanced) | true/false |
false |
Note on boolean environment variables: For the boolean settings (enabled,
logPrompts, useCollector), setting the corresponding environment variable to
true or 1 will enable the feature. Any other value will disable it.
HTTP OTLP signal routing: When using HTTP protocol (otlpProtocol: "http"),
Qwen Code automatically appends signal-specific paths (/v1/traces, /v1/logs,
/v1/metrics) to the base otlpEndpoint. For example, http://collector:4318
becomes http://collector:4318/v1/traces for traces. If the URL already ends
with a signal path, it is used as-is. Per-signal endpoint overrides
(otlpTracesEndpoint, etc.) take precedence over the base endpoint and are used
verbatim. gRPC protocol uses service-based routing and does not append paths.
The per-signal endpoint environment variables also accept the standard
OpenTelemetry names: OTEL_EXPORTER_OTLP_TRACES_ENDPOINT,
OTEL_EXPORTER_OTLP_LOGS_ENDPOINT, OTEL_EXPORTER_OTLP_METRICS_ENDPOINT.
The QWEN_TELEMETRY_OTLP_* variants take precedence over the OTEL_* variants.
For detailed information about all configuration options, see the Configuration Guide.
Aliyun Telemetry
Manual OTLP Export
To view Qwen Code telemetry in Alibaba Cloud Managed Service for OpenTelemetry, configure Qwen Code to export to the OTLP endpoint provided by ARMS.
Setting "target": "gcp" alone does not configure the export
destination. If otlpEndpoint is not set, Qwen Code still defaults to
http://localhost:4317. If outfile is set, it overrides
otlpEndpoint and telemetry is written to the file instead of being
sent to Alibaba Cloud.
-
Enable telemetry in your
.qwen/settings.jsonand set the OTLP endpoint:Option A: gRPC protocol (standard OTLP endpoint):
{ "telemetry": { "enabled": true, "target": "gcp", "otlpEndpoint": "https://<your-otlp-endpoint>", "otlpProtocol": "grpc" } }Option B: HTTP protocol with per-signal endpoints (for backends that use non-standard paths, e.g.,
/api/otlp/tracesinstead of/v1/traces):{ "telemetry": { "enabled": true, "otlpProtocol": "http", "otlpTracesEndpoint": "http://<host>/<token>/api/otlp/traces", "otlpLogsEndpoint": "http://<host>/<token>/api/otlp/logs", "otlpMetricsEndpoint": "http://<host>/<token>/api/otlp/metrics" } }Note: When using HTTP protocol with only
otlpEndpoint(no per-signal overrides), Qwen Code appends standard OTLP paths (/v1/traces,/v1/logs,/v1/metrics) to the base URL. If your backend uses different paths, use per-signal endpoint overrides as shown in Option B. -
If your Alibaba Cloud endpoint requires authentication, provide OTLP headers through standard OpenTelemetry environment variables such as
OTEL_EXPORTER_OTLP_HEADERS(or the signal-specific variants). Qwen Code does not currently expose OTLP auth headers directly in.qwen/settings.json. -
Run Qwen Code and send prompts.
-
View telemetry in Managed Service for OpenTelemetry:
- Product overview: What is Managed Service for OpenTelemetry?
- Getting started: Get started with Managed Service for OpenTelemetry
- Console entry points:
- China mainland: trace.console.aliyun.com (legacy console: tracing.console.aliyun.com)
- International: arms.console.alibabacloud.com
- In the console, use
Applicationsto inspect traces and service topology. - To locate the OTLP endpoint and access information:
- New console (
trace.console.aliyun.comor international): navigate toIntegration Center. - Legacy console (
tracing.console.aliyun.com): navigate toCluster Configurations→Access point information.
- New console (
Local Telemetry
For local development and debugging, you can capture telemetry data locally:
File-based Output (Recommended)
- Enable telemetry in your
.qwen/settings.json:{ "telemetry": { "enabled": true, "target": "local", "otlpEndpoint": "", "outfile": ".qwen/telemetry.log" } } - Run Qwen Code and send prompts.
- View logs and metrics in the specified file (e.g.,
.qwen/telemetry.log).
Collector-Based Export (Advanced)
- Run the automation script:
This will:npm run telemetry -- --target=local- Download and start Jaeger and OTEL collector
- Configure your workspace for local telemetry
- Provide a Jaeger UI at http://localhost:16686
- Save logs/metrics to
~/.qwen/tmp/<projectHash>/otel/collector.log - Stop collector on exit (e.g.
Ctrl+C)
- Run Qwen Code and send prompts.
- View traces at http://localhost:16686 and logs/metrics in the collector log file.
Logs and Metrics
The following section describes the structure of logs and metrics generated for Qwen Code.
- A
sessionIdis included as a common attribute on all logs and metrics.
Logs
Logs are timestamped records of specific events. The following events are logged for Qwen Code:
-
qwen-code.config: This event occurs once at startup with the CLI's configuration.- Attributes:
model(string)sandbox_enabled(boolean)core_tools_enabled(string)approval_mode(string)file_filtering_respect_git_ignore(boolean)debug_mode(boolean)truncate_tool_output_threshold(number)truncate_tool_output_lines(number)hooks(string, comma-separated hook event types, omitted if hooks disabled)ide_enabled(boolean)interactive_shell_enabled(boolean)mcp_servers(string)output_format(string: "text" or "json")
- Attributes:
-
qwen-code.user_prompt: This event occurs when a user submits a prompt.- Attributes:
prompt_length(int)prompt_id(string)prompt(string, this attribute is excluded iflog_prompts_enabledis configured to befalse)auth_type(string)
- Attributes:
-
qwen-code.tool_call: This event occurs for each function call.- Attributes:
function_namefunction_argsduration_mssuccess(boolean)decision(string: "accept", "reject", "auto_accept", or "modify", if applicable)error(if applicable)error_type(if applicable)content_length(int, if applicable)metadata(if applicable, dictionary of string -> any)
- Attributes:
-
qwen-code.file_operation: This event occurs for each file operation.- Attributes:
tool_name(string)operation(string: "create", "read", "update")lines(int, if applicable)mimetype(string, if applicable)extension(string, if applicable)programming_language(string, if applicable)diff_stat(json string, if applicable): A JSON string with the following members:ai_added_lines(int)ai_removed_lines(int)user_added_lines(int)user_removed_lines(int)
- Attributes:
-
qwen-code.api_request: This event occurs when making a request to Qwen API.- Attributes:
modelrequest_text(if applicable)
- Attributes:
-
qwen-code.api_error: This event occurs if the API request fails.- Attributes:
modelerrorerror_typestatus_codeduration_msauth_type
- Attributes:
-
qwen-code.api_response: This event occurs upon receiving a response from Qwen API.- Attributes:
modelstatus_codeduration_mserror(optional)input_token_countoutput_token_countcached_content_token_countthoughts_token_countresponse_text(if applicable)auth_type
- Attributes:
-
qwen-code.tool_output_truncated: This event occurs when the output of a tool call is too large and gets truncated.- Attributes:
tool_name(string)original_content_length(int)truncated_content_length(int)threshold(int)lines(int)prompt_id(string)
- Attributes:
-
qwen-code.malformed_json_response: This event occurs when agenerateJsonresponse from Qwen API cannot be parsed as a json.- Attributes:
model
- Attributes:
-
qwen-code.flash_fallback: This event occurs when Qwen Code switches to flash as fallback.- Attributes:
auth_type
- Attributes:
-
qwen-code.slash_command: This event occurs when a user executes a slash command.- Attributes:
command(string)subcommand(string, if applicable)
- Attributes:
-
qwen-code.extension_enable: This event occurs when an extension is enabled -
qwen-code.extension_install: This event occurs when an extension is installed- Attributes:
extension_name(string)extension_version(string)extension_source(string)status(string)
- Attributes:
-
qwen-code.extension_uninstall: This event occurs when an extension is uninstalled
Metrics
Metrics are numerical measurements of behavior over time. The following metrics are collected for Qwen Code (metric names remain qwen-code.* for compatibility):
-
qwen-code.session.count(Counter, Int): Incremented once per CLI startup. -
qwen-code.tool.call.count(Counter, Int): Counts tool calls.- Attributes:
function_namesuccess(boolean)decision(string: "accept", "reject", or "modify", if applicable)tool_type(string: "mcp", or "native", if applicable)
- Attributes:
-
qwen-code.tool.call.latency(Histogram, ms): Measures tool call latency.- Attributes:
function_namedecision(string: "accept", "reject", or "modify", if applicable)
- Attributes:
-
qwen-code.api.request.count(Counter, Int): Counts all API requests.- Attributes:
modelstatus_codeerror_type(if applicable)
- Attributes:
-
qwen-code.api.request.latency(Histogram, ms): Measures API request latency.- Attributes:
model
- Attributes:
-
qwen-code.token.usage(Counter, Int): Counts the number of tokens used.- Attributes:
modeltype(string: "input", "output", "thought", or "cache")
- Attributes:
-
qwen-code.file.operation.count(Counter, Int): Counts file operations.- Attributes:
operation(string: "create", "read", "update"): The type of file operation.lines(Int, if applicable): Number of lines in the file.mimetype(string, if applicable): Mimetype of the file.extension(string, if applicable): File extension of the file.model_added_lines(Int, if applicable): Number of lines added/changed by the model.model_removed_lines(Int, if applicable): Number of lines removed/changed by the model.user_added_lines(Int, if applicable): Number of lines added/changed by user in AI proposed changes.user_removed_lines(Int, if applicable): Number of lines removed/changed by user in AI proposed changes.programming_language(string, if applicable): The programming language of the file.
- Attributes:
-
qwen-code.chat_compression(Counter, Int): Counts chat compression operations- Attributes:
tokens_before: (Int): Number of tokens in context prior to compressiontokens_after: (Int): Number of tokens in context after compression
- Attributes: