Fixing AI Observability: How I Added GenAI Semantic Support for RAG Embedding Spans in Mastra (opens in new tab)

Discussed on DEV

OpenTelemetry has become the standard for observing modern systems. But when you start building AI applications, traditional traces aren't enough. You don't just want to know that a request happened. You want to know: Which model generated the output? Which provider was used? How many tokens were consumed? What embedding model processed the documents? How much did the operation cost? These questions become even more important when building Retrieval-Augmented Generation (RAG) systems. Recentl...

Read the original article