Glossary

5.2 Observability

Log every request with: request ID, timestamp, query, retrieval latency, generation latency, total latency, token count, guardrail outcomes, and model used. - Implement structured logging (JSON format) using Python's `logging` module or `structlog`. - Track the following metrics: - Request rate (req

Learn More

Related Terms