devops

  • home
  • cloud computing career paths
  • Kubernetes
  • git commands
  • gcp devops
  • CI CD pipeline

GCP Observability questions for exam

Share
Share on Facebook
Share
Share this
Pin
Pin this

Table of Contents

Toggle
  • Section 4: Implementing observability practices (~20% of the exam)
    • 4.1 Managing logs. Considerations include:
    • 4.2 Managing metrics. Considerations include:
    • 4.3 Managing dashboards and alerts. Considerations include:
  • questions
    • GCP Cloud Logging Vs error reporting vs cloud monitoring
    • Uptime checks & alerts
  • GCP Cloud Logging
    • Error reporting
    • Trace vs profiler in php
    • cloud trace vs cloud profiler gcp
    • GCP Cloud trace
    • cloud profiler
  • GCP Observability Pricing
  • OpenTelemetry

Section 4: Implementing observability practices (~20% of the exam)

4.1 Managing logs. Considerations include:

● Collecting and importing logs (e.g., Cloud Logging agent, Cloud Audit Logs, VPC Flow
Logs, Cloud Service Mesh)
● Logging optimization (e.g., sltering, sampling, exclusions, cost, source considerations)
● Exporting logs (e.g., BigQuery, Pub/Sub, for auditing)
● Retaining logs
● Analyzing logs
● Handling sensitive data (e.g., personally identiable information [PII], protected health
information [PHI])

4.2 Managing metrics. Considerations include:

● Collecting and analyzing metrics (e.g., application, platform, networking, Cloud Service
Mesh, Google Cloud Managed Service for Prometheus, hybrid/multi-cloud)
● Creating custom metrics from logs
● Using Metrics Explorer for ad hoc metric analysis
● Creating synthetic monitors

4.3 Managing dashboards and alerts. Considerations include:

● Managing dashboards (e.g., creating, altering, sharing, playbooks)
● Conguring alerting and alerting policies (e.g., SLIs, SLOs, cost control)
● Widely used third-party alerting tools (Datadog, New Relic, Dynatrace, Zabbix, Nagios, Sematext, Grafana, SolarWinds, Splunk, and Azure Monitor)

Note: cloud trace (for latency of requests) and application profiler (performance bottlenecks within specific code functions) covered in Nex part

questions

General Observability Concepts:
  • What is observability in the context of cloud computing?
    • Explain how observability allows you to understand the internal state of an application by collecting metrics, logs, and traces.
  • What are the three pillars of observability?
    • Metrics (quantitative data like CPU usage), Logs (textual events), and Traces (detailed execution paths through a system). 
  • What is the difference between monitoring and observability?
    • Monitoring focuses on collecting data from predefined points, while observability aims to gather comprehensive data to understand system behavior beyond just predefined metrics. 
Google Cloud Monitoring Components:
  • What are the main components of Google Cloud Monitoring?
    • Metrics, Logs, Traces, Alerting, Dashboards 
  • How do you collect custom metrics from your applications running on GCP?
    • Use the Cloud Monitoring API to send custom metrics data to Google Cloud Monitoring. 
  • What are the different types of logs available in Google Cloud Monitoring?
    • System logs (generated by GCP services), Application logs (custom logs from your application), and Audit logs (information about user activity). 
  • Explain the concept of distributed tracing and how it is used in GCP?
    • Distributed tracing allows you to follow a request as it travels through multiple services in a distributed system, using tools like Stackdriver Trace to identify performance bottlenecks. 
Alerting and Notification:
  • How do you set up alerts in Google Cloud Monitoring?
    • Define conditions based on metrics or log entries, choose notification channels (email, SMS, etc.), and set thresholds for triggering alerts.
  • What are the different types of alerting policies in Google Cloud Monitoring?
    • Threshold-based, Metric-based, and Log-based alerts. 
Dashboarding and Visualization:
  • How can you create custom dashboards in Google Cloud Monitoring?
    • Use the dashboard builder to add charts, graphs, and tables to visualize collected metrics and logs.
  • What are some best practices for designing effective monitoring dashboards?
    • Focus on key performance indicators, use clear visualizations, and group related metrics together. 
Advanced Topics:
  • What is the role of “Uptime Checks” in Google Cloud Monitoring?
    • Uptime checks periodically verify the availability of your applications and services by sending requests to designated endpoints. 
  • How can you use Cloud Logging to analyze large volumes of log data?
    • Utilize log sinks to export logs to BigQuery for advanced analysis and reporting. 
  • How do you troubleshoot a performance issue using distributed tracing in Google Cloud Monitoring?
    • Analyze traces to identify slow-performing components (hardware, software, and malware)  within a distributed system and pinpoint bottlenecks. latency and errors.

GCP Cloud Logging Vs error reporting vs cloud monitoring

Cloud monitoring for metrics monitor over 1500+ metric 100+ resources it can be system resoruces: like CPU, Disk , memory, Network,

application metrics like latency, throughput, responsetime, error rate, requset rate etc.

Network metrics 

Jitter: An indicator of network stability
Throughput: A measure of how many messages or packets reach their destination
Latency: The time it takes for data to travel across a network
Packet loss: The number of data packets that are lost during transmission
Round trip time: The time it takes for data to travel from a device to a server and back again
Bandwidth: The maximum rate at which data can be transmitted over a network connection
Error rate: A metric that affects user experience, business operations, and network efficiency

Uptime checks & alerts

gcp cloud monitoring sample

GCP Cloud Logging

Logs automatically collected in Google managed services like GKE, app engine, cloud run etc. but if you Only VM computing purose you need install an agent as of 2025 its called Ops Agent. (we cover billing for logs in nex part) usally 0.50$ for 1gb as on date.

these logs just like syslogs, server logs like access_log and error logs and application all in one place with help of ops agent.

gcp cloud logging

Error reporting

we have options for errors open, acknowleged, resloved, and muted,

GCP Error Reporting essentially “grabs” information from Cloud Logging, (if nginx errors >> sysops agent >>cloudlogging>>error reporting)

cost of gcp error reporting: no extra cost already for clog logging $0.50 per GB of logs ingested with the first 50 GB per month being free for each project for 30 days additional charge of $0.01 per GB per month. 

Benefit: instead of checking error logs (sys, db, server, application etc) in system old fashioned we can use error reporting for fast resolving.

GCP error reporting

Trace vs profiler in php

in order to idnetifty application performance bottlenecs we use trace and then profiling.

In PHP development, a “trace” generally refers to a detailed record of function calls and execution flow within your code, often used for debugging and understanding the path of data through a program, while a “profiler” is a tool that measures the performance of your code, identifying specific sections that are taking the most time to execute and thus helping you pinpoint performance bottlenecks.

cloud trace vs cloud profiler gcp

Cloud Trace to understand why individual requests are slow in districbuted system (OS, application1, app2, database) . If after that you still wonder where CPU time is spent, then Cloud Profiler mostly in application code.

Tracing is about describing transactions. 

new-relic-trace-visualization-min_0

GCP Cloud trace

enable the Cloud Trace API within your Google Cloud project, which is essentially done through the Google Cloud Console, and then install the appropriate client library for your programming language to start sending traces from your application.

google cloud-trace-overview-page

cloud profiler

Types of profiling available  with google profiles, you may promoteous, newrelic datadog for measuring application performance.

gcp profiler-interface

Profile type Go Java Node.js Python
CPU time Y Y Y
Heap memory Y Y Y
Allocated heap Y
Contention Y
Threads Y
Wall time Y Y Y

GCP Observability Pricing

all logs are priced VPC flow logs, (based on storage and querying)

cloud logging

$0.50/GiB; One-time charge for streaming logs into log bucket storage for indexing, querying, and analysis; includes up to 30 days of storage in log buckets. free First 50 GiB/project/month.

more details here.  better to create budget & quotas limits alert..

gcp billing alert for logging

CLoud monitoring:

free First 150 MiB per billing account for metrics charged by bytes ingested

$0.2580/MiB1: first 150–100,000 MiB  (All Monitoring data except data ingested by using Managed Service for Prometheus)

$0.1510/MiB: next 100,000–250,000 MiB$0.0610/MiB: >250,000 MiB

 

more details here

OpenTelemetry

OpenTelemetry SDK that export telemetry signals (traces, metrics, logs) directly into a backend. OpenTelemetry SDK using OpenTelemetry protocol (OTLP) — or other collectors (using the OTLP exporter) that send telemetry signals to a collector

OpenTelemetry in gcp

About Raju Ginne

AMFI Registered mutual fund distributor based in Hyderabad. you may contact me for mutual funds SIP investments Whatsapp: 9966367675.
nism certified research analyst

Ask a Question: Cancel reply

Your email address will not be published. Required fields are marked *

devops learning path

Cloud security engineer learning path

cloud engineer vs devops engineer

cloud network engineer path

Recent Posts

  • Google SRE devops Phiosophy SRE methodology Cultural concepts
  • GCP Observability questions for exam
  • Google Cloud Developer vs Cloud Engineer: Key Differences & Similarities
  • Network Engineer Roadmap from Zero to Hero.
  • devops vs sre difference salary