GenAI Observability — Four Schemas Compared

Why four standards exist

Each project optimizes for a different audience. OTel optimizes for vendor neutrality, OpenInference for evaluation tooling, OpenLLMetry for fast pragmatic instrumentation, Langfuse for a single product UX. Convergence on OTel is now real but uneven.

NewOTel Semantic Conventions v1.41.0 — released April 28, 2026

invoke_agent now has proper CLIENT/INTERNAL span separation (LangChain, CrewAI patterns)
invoke_workflow landed as a first-class operation alongside invoke_agent and execute_tool
Reasoning tokens tracked via gen_ai.usage.reasoning.output_tokens attribute
Streaming metrics: gen_ai.client.operation.time_to_first_chunk and gen_ai.client.operation.time_per_output_chunk
Cache token attributes: gen_ai.usage.cache_read.input_tokens / gen_ai.usage.cache_creation.input_tokens
Per-message events deprecated in favor of gen_ai.input.messages / gen_ai.output.messages / gen_ai.system_instructions

OpenTelemetry SemConv

Backed by CNCF · v1.41.0 (Apr 2026)

The neutral standard. Slow-moving by committee, but every other project is now converging on it. Defines spans (chat, embeddings, invoke_agent, invoke_workflow, execute_tool), metrics, and events. Status: Development.

⭐ Strength: backend-agnostic. Instrument once, swap vendors freely.

OpenInference

Arize · OSS

Schema-first. The openinference.span.kind attribute is required on every span and drives a strict taxonomy: LLM, EMBEDDING, CHAIN, RETRIEVER, TOOL, AGENT, RERANKER, GUARDRAIL, EVALUATOR, PROMPT. Built around tracing for evaluation pipelines (Phoenix, Arize AX).

⭐ Strength: opinionated taxonomy makes evals & dashboards cleaner.

OpenLLMetry

Traceloop (acq. ServiceNow, Mar 2026) · OSS

OTel-native extension. Ships drop-in instrumentations for 30+ LLM providers, frameworks (LangChain, LlamaIndex, Haystack), and vector DBs (Pinecone, Qdrant, Weaviate, Chroma…). Adds attributes upstream OTel hasn't standardized yet, then deprecates them as OTel catches up.

⭐ Strength: fastest path from pip install to traces.

Langfuse

Langfuse · OSS + Cloud

Product-driven data model. Trace → Observation tree where each observation is a SPAN, GENERATION, or EVENT (plus extended types: Agent, Tool, Chain, Retriever, Evaluator, Embedding, Guardrail). Accepts OTel data and maps it onto this model for the Langfuse UI.

⭐ Strength: rich product UX (sessions, users, evals, prompt mgmt).

The 30-second mental model

If you only remember one thing per project:

OTel · operation-name driven

Span name = {operation} {target}. Example: chat gpt-4o, invoke_agent customer_support, execute_tool get_weather.

OpenInference · span-kind driven

Every span carries openinference.span.kind = LLM | AGENT | TOOL | RETRIEVER | …. Backend uses this to decide rendering and aggregation.

OpenLLMetry · OTel + provider extras

Mostly aligns with OTel, but adds Traceloop-specific attrs (traceloop.workflow.name, traceloop.entity.path) for nested orchestration.

Langfuse · observation-type driven

Each observation has a type field. The UI groups GENERATION observations into "model calls" and rolls up cost/tokens to the trace.

Span types & operation names

The biggest divergence between schemas is how spans are named and categorized. OTel uses an operation-name attribute, OpenInference uses a required span-kind, Langfuse uses observation types, OpenLLMetry follows OTel.

OpenTelemetry — `gen_ai.operation.name`

Span kind: CLIENT (external) or INTERNAL (in-process)

chat text_completion embeddings generate_content create_agent invoke_agent invoke_workflow execute_tool retrieval

Span name format: {operation_name} {model_or_target} — e.g., chat gpt-4o, invoke_agent triage_agent.

OpenInference — `openinference.span.kind` (required)

Strict taxonomy, backend-aware

LLM EMBEDDING CHAIN RETRIEVER TOOL AGENT RERANKER GUARDRAIL EVALUATOR PROMPT

CHAIN is unique to OpenInference: it represents glue code linking other spans (the "request → retrieve → LLM → respond" wrapper). PROMPT captures prompt template rendering.

OpenLLMetry — follows OTel + Traceloop extensions

Built atop gen_ai.*, adds traceloop.*

chat embeddings execute_tool traceloop.workflow traceloop.task traceloop.agent traceloop.tool db.* (vector DBs)

Vector DB calls follow OTel db.* conventions plus Traceloop additions (db.vector.query.top_k, etc.).

Langfuse — observation `type`

Trace → tree of Observations

SPAN GENERATION EVENT AGENT * TOOL * CHAIN * RETRIEVER * EMBEDDING * EVALUATOR * GUARDRAIL *

* Extended types added Aug 2025 — semantically richer rendering, but the core data model is still the SPAN/GENERATION/EVENT triple.

Worked example: a RAG agent answering a question

Same workload, four different shapes.

OTel

# root
span: "invoke_agent rag_qa"
  gen_ai.operation.name=invoke_agent
  span.kind=CLIENT
  ├─ "embeddings text-embedding-3"
  │     gen_ai.operation.name=embeddings
  ├─ "db.search vector_index"  # OTel db.*
  ├─ "chat gpt-4o"
  │     gen_ai.operation.name=chat
  └─ "execute_tool get_doc"
        gen_ai.operation.name=execute_tool

OpenInference

# root
span: "agent.run"
  openinference.span.kind=AGENT
  ├─ kind=EMBEDDING
  │     embedding.embeddings.0.embedding.text=…
  ├─ kind=RETRIEVER
  │     retrieval.documents.0.document.id=…
  ├─ kind=LLM
  │     llm.input_messages.0.message.role=user
  └─ kind=TOOL
        tool.name=get_doc

OpenLLMetry

# root
span: "rag_qa.workflow"
  traceloop.workflow.name=rag_qa
  ├─ "openai.embeddings"
  │     gen_ai.system=openai
  ├─ "pinecone.query"
  │     db.system=pinecone
  ├─ "openai.chat"
  │     gen_ai.operation.name=chat
  └─ "tool.get_doc"
        traceloop.entity.name=get_doc

Langfuse

# trace (top-level)
trace: "rag_qa"  user_id=…  session_id=…
  ├─ obs type=EMBEDDING
  │     model=text-embedding-3
  ├─ obs type=RETRIEVER
  │     input="…question…"
  ├─ obs type=GENERATION
  │     model=gpt-4o
  │     usage.input/output/total
  └─ obs type=TOOL
        name=get_doc

Attribute reference

The same concept, four different attribute names. Filter by concept or toggle columns to focus on the schemas you care about. Attributes flagged v1.41 are new in OTel SemConv v1.41.0.

Concept	OTel SemConv	OpenInference	OpenLLMetry	Langfuse
Identity & routing
Model name	`gen_ai.request.model`	`llm.model_name`	`gen_ai.request.model`	`model` (on GENERATION)
Provider / system	`gen_ai.system` (openai, anthropic, …)	`llm.provider`	`gen_ai.system`	`model.provider` (mapped)
Operation	`gen_ai.operation.name`	`openinference.span.kind`	`gen_ai.operation.name`	observation `type`
Response model	`gen_ai.response.model`	`llm.model_name`	`gen_ai.response.model`	`completion_start_time` meta
Response ID	`gen_ai.response.id`	`llm.output_messages.{i}.id`	`gen_ai.response.id`	`id` on observation
Messages & content
System instructions	`gen_ai.system_instructions` v1.41	`llm.input_messages.{i}.message.role=system`	`gen_ai.system_instructions`	part of `input`
Input messages	`gen_ai.input.messages` v1.41	`llm.input_messages.{i}.message.role/content`	`gen_ai.input.messages`	`input` (JSON)
Output messages	`gen_ai.output.messages` v1.41	`llm.output_messages.{i}.message.role/content`	`gen_ai.output.messages`	`output` (JSON)
Reasoning content	message part `type=reasoning` v1.41	—	via OTel	in `output` JSON
Finish reason	`gen_ai.response.finish_reasons`	`llm.output_messages.{i}.message.finish_reason`	`gen_ai.response.finish_reasons`	in metadata
Token usage
Input (prompt) tokens	`gen_ai.usage.input_tokens`	`llm.token_count.prompt`	`gen_ai.usage.input_tokens`	`usage.input`
Output (completion) tokens	`gen_ai.usage.output_tokens`	`llm.token_count.completion`	`gen_ai.usage.output_tokens`	`usage.output`
Total tokens	derived	`llm.token_count.total`	`llm.usage.total_tokens`	`usage.total`
Cache-read input tokens	`gen_ai.usage.cache_read.input_tokens` v1.41	`llm.token_count.prompt_details.cache_read`	via OTel	mapped from OTel
Cache-write input tokens	`gen_ai.usage.cache_creation.input_tokens` v1.41	`llm.token_count.prompt_details.cache_write`	via OTel	mapped from OTel
Reasoning tokens	`gen_ai.usage.reasoning.output_tokens` v1.41	`llm.token_count.completion_details.reasoning`	via OTel	mapped from OTel
Request parameters
Temperature	`gen_ai.request.temperature`	`llm.invocation_parameters` (JSON)	`gen_ai.request.temperature`	`model_parameters.temperature`
Top-p	`gen_ai.request.top_p`	`llm.invocation_parameters` (JSON)	`gen_ai.request.top_p`	`model_parameters.top_p`
Max tokens	`gen_ai.request.max_tokens`	`llm.invocation_parameters` (JSON)	`gen_ai.request.max_tokens`	`model_parameters.max_tokens`
Stop sequences	`gen_ai.request.stop_sequences`	`llm.invocation_parameters` (JSON)	`gen_ai.request.stop_sequences`	`model_parameters.stop`
Tools
Tool name	`gen_ai.tool.name`	`tool.name`	`gen_ai.tool.name`	`name` on TOOL obs
Tool description	`gen_ai.tool.description`	`tool.description`	`gen_ai.tool.description`	in `metadata`
Tool call ID	`gen_ai.tool.call.id`	`tool_call.{i}.tool_call.id`	`gen_ai.tool.call.id`	in `metadata`
Tool args	in message part `tool_call`	`tool.parameters` / `tool_call.{i}.tool_call.function.arguments`	in messages	`input`
Agents & workflows
Agent name	`gen_ai.agent.name`	`agent.name` (with kind=AGENT)	`traceloop.agent.name`	`name` on AGENT obs
Agent ID	`gen_ai.agent.id`	`agent.id`	`traceloop.agent.id`	`id`
Agent description	`gen_ai.agent.description`	—	—	in `metadata`
Workflow / task	`gen_ai.operation.name=invoke_workflow` v1.41	kind=CHAIN	`traceloop.workflow.name` / `.task.name`	SPAN obs
Retrieval & embeddings
Embedding text	— (not standardized)	`embedding.embeddings.{i}.embedding.text`	in messages	`input`
Embedding vector	— (size only)	`embedding.embeddings.{i}.embedding.vector`	— typically dropped	in `output`
Embedding dimensions	`gen_ai.embeddings.dimension.count` v1.41	`embedding.model_name` implicit	via OTel	mapped
Retrieved doc ID	— (use `db.*`)	`retrieval.documents.{i}.document.id`	via vector DB instr.	in `output`
Retrieved doc score	—	`retrieval.documents.{i}.document.score`	—	in `output`
Retrieved doc content	—	`retrieval.documents.{i}.document.content`	—	in `output`
Trace context (user, session, etc.)
User ID	`user.id` (general)	`user.id`	`traceloop.association.properties.user_id`	`user_id` (trace + obs)
Session ID	`session.id`	`session.id`	`traceloop.association.properties.session_id`	`session_id` (trace + obs)
Tags	— (custom)	`tag.tags`	custom	`tags` (trace-level, propagated)
Metadata	— (custom)	`metadata` (JSON)	custom	`metadata` (trace + obs)
Errors & evaluation
API error	exception event with `error.type`	`exception.*` (OTel std)	OTel exception	`level=ERROR` + `status_message`
Rate limiting	`error.type=_OTHER` (provider-specific values not standardized)	—	OTel exception	`level=WARNING`
Evaluation result	— (not in spec)	kind=EVALUATOR span	—	scores via API

Metrics

Metrics are where OTel pulls clearly ahead. OpenInference and Langfuse focus on traces and compute roll-ups in the backend; OTel and OpenLLMetry emit native histograms you can scrape into Prometheus / Elastic / any TSDB.

OpenTelemetry — first-class GenAI metrics

gen_ai.client.operation.duration (histogram) gen_ai.client.token.usage (histogram) gen_ai.client.operation.time_to_first_chunk v1.41 gen_ai.client.operation.time_per_output_chunk v1.41 gen_ai.server.request.duration gen_ai.server.time_to_first_token

Streaming metrics are the standout in v1.41: time_to_first_chunk tells you when the user sees something, time_per_output_chunk tells you how snappy the stream feels.

OpenLLMetry — same as OTel + extras

gen_ai.client.operation.duration gen_ai.client.token.usage llm.usage.total_tokens (histogram, per-call) db.client.operations.duration (vector DB)

OpenInference — no native metrics

OpenInference is trace-only by design. The Phoenix / Arize backend computes aggregations (latency p99, token throughput, cost) from spans. If you need Prometheus-style metrics, add an OTel SDK alongside.

Langfuse — backend-computed dashboards

Langfuse derives metrics from observations on ingestion: cost (using its model price catalog), token rates, latency percentiles, score distributions, and per-user/session aggregates surface in the UI. No raw metric stream is emitted — this is a product, not a metrics pipeline.

Practical implication: if you're standardizing on a single observability backend (Elastic, Datadog, Grafana, New Relic), OTel + OpenLLMetry gives you the cleanest metric story today. OpenInference and Langfuse are richer for trace-level analysis but require their backends (or a custom roll-up) for metrics.

Project drill-downs

Pick a project to see its data model, key attributes, and example wire format.

Design philosophy

OTel SemConv defines a vendor-neutral wire format for GenAI telemetry — spans, metrics, and events. Every other project on this page eventually converges on it. The spec is in Development status and moving fast: v1.41.0 (April 28, 2026) added agent/workflow operation separation, streaming metrics, reasoning tokens, and a unified message model.

Span model

One gen_ai.operation.name attribute drives the taxonomy. Spans are CLIENT (external API call) or INTERNAL (in-process). Span name = {operation_name} {model_or_target}.

Key v1.41.0 changes

invoke_agent properly distinguishes external (CLIENT) vs in-process (INTERNAL) agent calls — closes the LangChain/CrewAI gap
invoke_workflow is now a first-class operation (was previously implicit)
Streaming metrics: gen_ai.client.operation.time_to_first_chunk, gen_ai.client.operation.time_per_output_chunk
Cache token attributes split (read vs creation)
Per-message events (gen_ai.user.message, etc.) deprecated → use gen_ai.input.messages attribute
Reasoning tokens tracked via gen_ai.usage.reasoning.output_tokens

Example span

// span name
"chat gpt-4o"

// attributes
gen_ai.operation.name:        "chat"
gen_ai.system:                "openai"
gen_ai.request.model:         "gpt-4o"
gen_ai.response.model:        "gpt-4o-2024-11-20"
gen_ai.request.temperature:   0.7
gen_ai.usage.input_tokens:    412
gen_ai.usage.output_tokens:   128
gen_ai.usage.cache_read.input_tokens:    300
gen_ai.usage.reasoning.output_tokens:    42
gen_ai.response.finish_reasons: ["stop"]
gen_ai.input.messages:        [/* JSON */]
gen_ai.output.messages:       [/* JSON */]

At a glance

Status: Development
Latest: v1.41.0 — Apr 28, 2026
Backed by: CNCF / OpenTelemetry
Span kinds: CLIENT, INTERNAL
Operations: chat, embeddings, text_completion, generate_content, invoke_agent, invoke_workflow, execute_tool, create_agent, retrieval
Metrics: Yes (histograms)
Best for: Vendor-neutral instrumentation, multi-backend portability

Design philosophy

OpenInference defines a strict span-kind taxonomy via the required openinference.span.kind attribute. The schema is opinionated: every span declares whether it's an LLM, EMBEDDING, RETRIEVER, TOOL, AGENT, CHAIN, RERANKER, GUARDRAIL, or EVALUATOR. This makes downstream evaluation and dashboard rendering deterministic, but it requires instrumentation to care about which kind a span is.

Attribute style

Dot-namespaced with indexed flattening: llm.input_messages.0.message.role, llm.input_messages.1.message.content, etc. Every list-valued attribute is exploded this way (as opposed to OTel v1.41's JSON-on-attribute approach).

Where it shines

RAG and retrieval: retrieval.documents.{i}.document.{id,content,score,metadata} is the cleanest schema for retrieval traces among the four. Phoenix / Arize AX render these directly into evaluation pipelines.

Example span (LLM kind)

openinference.span.kind:           "LLM"
llm.model_name:                    "gpt-4o"
llm.provider:                      "openai"
llm.input_messages.0.message.role: "system"
llm.input_messages.0.message.content: "You are…"
llm.input_messages.1.message.role: "user"
llm.input_messages.1.message.content: "Tell me…"
llm.output_messages.0.message.role: "assistant"
llm.token_count.prompt:            412
llm.token_count.completion:        128
llm.token_count.prompt_details.cache_read: 300
llm.invocation_parameters:         '{"temperature":0.7}'

At a glance

Status: Active
Backed by: Arize
Span kinds: LLM, EMBEDDING, CHAIN, RETRIEVER, TOOL, AGENT, RERANKER, GUARDRAIL, EVALUATOR, PROMPT
Attribute style: Dot-flattened with indices
Metrics: None (trace-only)
Best for: RAG, eval pipelines, opinionated dashboards

Design philosophy

OpenLLMetry is the pragmatic upstream contributor. Traceloop (acquired by ServiceNow on March 2, 2026) ships drop-in instrumentations for 30+ LLM providers, frameworks (LangChain, LlamaIndex, Haystack, CrewAI), and vector DBs (Pinecone, Qdrant, Weaviate, Chroma, Milvus, …) — many of which are fed back into OTel SemConv. When OTel adopts a convention, OpenLLMetry deprecates its own and aliases.

Where it differs from raw OTel

traceloop.workflow.name / traceloop.task.name for orchestration nesting (a workflow is a sequence of tasks)
traceloop.entity.name / traceloop.entity.path for dotted-path call hierarchy
traceloop.association.properties.* for attaching user/session/custom IDs
Vector DB instrumentations expose db.vector.query.top_k, db.vector.namespace, etc.

Migration trajectory

Older versions emitted gen_ai.prompt / gen_ai.completion; the project is migrating to OTel's gen_ai.input.messages / gen_ai.output.messages. Issue #3515 tracks this transition.

Example span (orchestrated workflow)

// span name
"openai.chat"

gen_ai.system:                "openai"
gen_ai.operation.name:        "chat"
gen_ai.request.model:         "gpt-4o"
gen_ai.usage.input_tokens:    412
gen_ai.usage.output_tokens:   128
traceloop.workflow.name:      "customer_qa"
traceloop.entity.name:        "answer_step"
traceloop.entity.path:        "customer_qa.retrieve.answer_step"
traceloop.association.properties.user_id: "u-42"

At a glance

Status: Active, fast-moving
Backed by: Traceloop / ServiceNow (acq. Mar 2026)
Span kinds: OTel CLIENT/INTERNAL + named entities
Coverage: 30+ LLM providers, frameworks, vector DBs
Metrics: OTel-native
Best for: Quick instrumentation, multi-provider apps, OTel-aligned vendors

Design philosophy

Langfuse models traces as a tree of Observations, with three core types: SPAN (generic unit of work), GENERATION (LLM call with prompt/output/usage), and EVENT (point-in-time log). Extended types (Agent, Tool, Chain, Retriever, Evaluator, Embedding, Guardrail) layer semantic meaning onto SPAN. Trace-level attributes (user_id, session_id, tags, metadata) are propagated to every observation for single-table queries.

OTel mapping

Langfuse accepts OTLP and maps incoming OTel attributes onto its data model:

spans with gen_ai.operation.name=chat → GENERATION observation
gen_ai.usage.input_tokens → usage.input
gen_ai.request.model → model (and matched against Langfuse's price catalog)
session.id → trace-level session_id
user.id / OpenInference / OpenLLMetry equivalents are recognized

Example observation (GENERATION)

{
  "id": "obs_abc",
  "trace_id": "trc_xyz",
  "type": "GENERATION",
  "name": "answer_step",
  "model": "gpt-4o",
  "model_parameters": { "temperature": 0.7 },
  "input": [{"role":"user", "content":"…"}],
  "output": { "role":"assistant", "content":"…" },
  "usage": { "input":412, "output":128, "total":540 },
  "user_id": "u-42",         // propagated
  "session_id": "s-7",
  "level": "DEFAULT"
}

Strengths beyond the schema

Cost catalog (auto-prices by model), prompt management, evaluators, datasets, sessions UI. The schema decisions reflect product priorities — they're easier to query at the UI level than to interop with externally.

At a glance

Status: Active, OSS + Cloud
Data model: Trace → Observations
Core types: SPAN, GENERATION, EVENT
Extended types: Agent, Tool, Chain, Retriever, Evaluator, Embedding, Guardrail
Ingest: SDKs + OTLP
Best for: End-to-end product UX (sessions, costs, evals, prompts)

Where the ecosystem is converging

Two years ago this would have been four entirely separate worlds. Today the gravitational pull is clearly toward OpenTelemetry — but each project still differentiates where the spec doesn't yet meet their needs.

What's already aligned

Token usage

OTel gen_ai.usage.input_tokens / output_tokens are now read by all four projects (Langfuse and OpenLLMetry natively, OpenInference via interop layers).

Model + provider

gen_ai.request.model + gen_ai.system is universally understood.

Tool calls

Tool name and call ID converged on gen_ai.tool.name / gen_ai.tool.call.id in v1.41.0; OpenInference and Langfuse map theirs.

OTLP transport

Langfuse, Phoenix (OpenInference), Traceloop, and every observability vendor accept OTLP. The wire is settled even when the schema isn't.

Where divergence remains

Span taxonomy

OTel uses operation names, OpenInference uses required span-kinds, Langfuse uses observation types. There's no plan to unify; this is the deepest philosophical split.

Message representation

OTel v1.41 picked JSON-on-attribute (gen_ai.input.messages); OpenInference uses indexed flattening (llm.input_messages.0…). Both are still in production code.

Retrieval / RAG attributes

OpenInference has the richest schema (retrieval.documents.{i}.*). OTel still says "use db.*" and leaves the retrieval-specific bits unstandardized.

Trace-level context

Langfuse propagates user/session/tags to every observation by design. OTel puts session.id on the span; downstream propagation is the consumer's job.

Practical takeaways for instrumenting today

Emit OTel SemConv v1.41.0. Every backend can read it. Lock-in costs you nothing.
If you use Phoenix/Arize: add openinference.span.kind alongside the OTel attributes — they coexist on the same span.
If you use Langfuse: set session.id and user.id as OTel attributes. Langfuse maps them automatically.
If you use OpenLLMetry: you're already on OTel. Watch for deprecations as Traceloop aliases land upstream.
Avoid hard-coding any single project's attribute names in your app code. Use the SDK abstraction; let it emit the right names per backend.

Bottom line: the GenAI observability space is early enough that convergence is still possible. Every project that aligns with OpenTelemetry makes the pie bigger for everyone — vendors compete on value, not on proprietary schemas. v1.41.0 is the most significant step in that direction so far.

Why four standards exist

NewOTel Semantic Conventions v1.41.0 — released April 28, 2026

OpenTelemetry SemConv

OpenInference

OpenLLMetry

Langfuse

The 30-second mental model

OTel · operation-name driven

OpenInference · span-kind driven

OpenLLMetry · OTel + provider extras

Langfuse · observation-type driven

Span types & operation names

OpenTelemetry — gen_ai.operation.name

OpenInference — openinference.span.kind (required)

OpenLLMetry — follows OTel + Traceloop extensions

Langfuse — observation type

Worked example: a RAG agent answering a question

OTel

OpenInference

OpenLLMetry

Langfuse

Attribute reference

Metrics

OpenTelemetry — first-class GenAI metrics

OpenLLMetry — same as OTel + extras

OpenInference — no native metrics

Langfuse — backend-computed dashboards

Project drill-downs

Design philosophy

Span model

Key v1.41.0 changes

Example span

At a glance

Design philosophy

Attribute style

Where it shines

Example span (LLM kind)

At a glance

Design philosophy

Where it differs from raw OTel

Migration trajectory

Example span (orchestrated workflow)

At a glance

Design philosophy

OTel mapping

Example observation (GENERATION)

Strengths beyond the schema

At a glance

Where the ecosystem is converging

What's already aligned

Token usage

Model + provider

Tool calls

OTLP transport

Where divergence remains

Span taxonomy

Message representation

Retrieval / RAG attributes

Trace-level context

Practical takeaways for instrumenting today

OpenTelemetry — `gen_ai.operation.name`

OpenInference — `openinference.span.kind` (required)

Langfuse — observation `type`