Hi! I'm

Federico Calò

Software Developer | Technical Writer

I create modern web applications and custom digital tools to help businesses grow through technological innovation. My passion is combining computer science and economics to generate real value.

Contact Me

About Me

My passion for computer science was born at the Technical Commercial Institute of Maglie, where I discovered the power of programming and the fascination of creating digital solutions. From the start, I understood that computer science was not just code, but an extraordinary tool for turning ideas into reality.

During my studies in Business Information Systems, I began to interweave computer science and economics, understanding how technology can be the engine of growth for any business. This vision accompanied me to the University of Bari, where I obtained my degree in Computer Science, deepening my technical skills and passion for software development.

Today I put this experience at the service of businesses, professionals and startups, creating tailor-made digital solutions that automate processes, optimize resources and open new business opportunities. Because true innovation begins when technology meets the real needs of people.

My Skills

Data Analysis & Predictive Models

I transform data into strategic insights with in-depth analysis and predictive models for informed decisions

Process Automation

I create custom tools that automate repetitive operations and free up time for value-added activities

Custom Systems

I develop tailor-made software systems, from platform integrations to customized dashboards

const federico = {
  nome: "Federico Calò",
  ruolo: "Sviluppatore Software",
  città: "Bari, Italia",
  missione: "Aiutare attraverso l'informatica",
  passioni: [
    "Codice Pulito",
    "Innovazione",
    "Crescita Continua"
  ]
};

La Mia Missione

Credo fermamente che l'informatica sia lo strumento più potente per trasformare le idee in realtà e migliorare la vita delle persone.

Democratizzare la Tecnologia

La mia missione è rendere l'informatica accessibile a tutti: dalle piccole imprese locali alle startup innovative, fino ai professionisti che vogliono digitalizzare la propria attività. Ogni realtà merita di sfruttare le potenzialità del digitale.

Unire Informatica ed Economia

Non è solo questione di scrivere codice: è capire come la tecnologia possa generare valore reale. Intrecciando competenze informatiche e visione economica, aiuto le attività a crescere, ottimizzare processi e raggiungere nuovi traguardi di efficienza e redditività.

Creare Soluzioni su Misura

Ogni attività è unica, e così devono esserlo le soluzioni. Sviluppo strumenti personalizzati che rispondono alle esigenze specifiche di ciascun cliente, automatizzando processi ripetitivi e liberando tempo per ciò che conta davvero: far crescere il business.

Trasforma la Tua Attività con la Tecnologia

Che tu gestisca un negozio, uno studio professionale o un'azienda, posso aiutarti a sfruttare le potenzialità dell'informatica per lavorare meglio, più velocemente e in modo più intelligente.

Parliamone Insieme →

Join the Community

Join the developer community where we discuss software, AI, architecture and DevOps. Share ideas, ask questions and grow with us.

Channel

FC Dev Blog

Get notifications on new articles, complete series, weekly tips and featured tools. Bilingual IT/EN content directly in your Telegram.

New articles as they are published
Weekly tips and code snippets
Polls on future topics

Subscribe to Channel

Group

FC Dev Community

A bilingual IT/EN community for developers. Discussions, Q&A, mutual help and networking with other professionals.

Discussions on articles and technologies
Coding help and code review
Job opportunities and collaboration

Join the Group

Discussion Topics

View

Master SQL

RoadMap.sh

November 2024

View

Oracle Certified Foundations Associate

Oracle

October 2024

View

People Leadership Credential

Connect

September 2024

💻 Languages & Technologies

Java

Python

JavaScript

Angular

React

TypeScript

SQL

PHP

CSS/SCSS

Node.js

Docker

Git

💼

12/2024 - Present

Custom Software Engineering Analyst

Accenture

Bari, Puglia, Italy · Hybrid Analysis and development of computer systems through the use of Java and Quarkus in Health and Public Sector. Continuous training on modern technologies for creating customized and efficient software solutions and on agents.

💼

06/2022 - 12/2024

Software analyst and Back End Developer Associate Consultant

Links Management and Technology SpA

Experience analyzing as-is software systems and ETL flows using PowerCenter. Completed Spring Boot training for developing modern and scalable backend applications. Backend developer specialized in Spring Boot, with experience in database design, analysis, development and testing of assigned tasks.

💼

02/2021 - 10/2021

Software programmer

Adesso.it (prima era WebScience srl)

Experience in AS-IS and TO-BE analysis, SEO evolutions and website evolutions to improve user performance and engagement.

🎓

2018 - 2025

Degree in Computer Science

University of Bari Aldo Moro

Bachelor's degree in Computer Science, focusing on software engineering, algorithms, and modern development practices.

📚

2013 - 2018

Diploma - Corporate Information Systems

Technical Commercial Institute of Maglie

Technical diploma specializing in Business Information Systems, combining IT knowledge with business management.

Contattami

Hai un progetto in mente? Parliamone! Compila il form qui sotto e ti risponderò al più presto.

* Campi obbligatori. I tuoi dati saranno utilizzati solo per rispondere alla tua richiesta.

Case Study: Implementing End-to-End Observability for Microservices

In this final article of the series, we will analyze a real-world case of observability implementation in a microservices architecture. We will follow the journey of an e-commerce startup with 5 microservices adopting OpenTelemetry to solve chronic debugging problems, reduce Mean Time To Resolution (MTTR), and gain complete visibility into business flows.

We will document every phase of the implementation, from the initial situation (zero observability) to the complete production stack, with before and after metrics that quantify the impact of the observability investment.

What You Will Learn in This Article

How to plan an observability implementation step-by-step
The complete stack configuration for 5 microservices
Instrumentation patterns for a real e-commerce flow
Grafana dashboards for operational and business monitoring
Before/after metrics: MTTR, incident detection, SLO compliance
Lessons learned and adoption recommendations

The Context: ShopFlow E-Commerce Platform

ShopFlow is a microservices-based e-commerce platform with the following stack:

      ShopFlow Architecture
      
        
          Service
          Language
          Database
          Responsibility
        

          API Gateway
          Node.js (Express)
          Redis (cache)
          Routing, authentication, rate limiting
        

          Order Service
          Java (Spring Boot)
          PostgreSQL
          Order creation, state management
        

          Inventory Service
          Python (FastAPI)
          PostgreSQL
          Stock management, reservations
        

          Payment Service
          Java (Spring Boot)
          PostgreSQL
          Payments, refunds
        

          Notification Service
          Python (FastAPI)
          MongoDB
          Email, SMS, push notifications
        

    

Services communicate via synchronous HTTP and Kafka for asynchronous events. Deployment is on Kubernetes (EKS) with approximately 500 orders/hour at peak.

Situation Before Observability

Before OpenTelemetry adoption, ShopFlow had limited system visibility:

Unstructured logs: each service logged in a different format, without trace_id. Finding logs for a request required manual grep across 5 services
Basic metrics: only infrastructure metrics (CPU, memory, disk) from CloudWatch, no application metrics
Zero tracing: impossible to follow a request across services. Debugging cross-service issues took days
Reactive alerts: alerts only on CPU > 80% and generic 5xx errors, without context on error type or business impact

      Pre-Observability Metrics (Baseline)
      
          Metric
          Value
        
          MTTR (Mean Time To Resolve)
          4.5 hours
        
          MTTD (Mean Time To Detect)
          45 minutes
        
          P1 incidents/month
          8
        
          SLO compliance (99.5%)
          94% of months
        
          Cross-service debug time
          2-4 hours
        
          Incident cost/month (estimate)
          $12,000

Phase 1: Auto-Instrumentation and Collector (Week 1-2)

The first phase focuses on deploying the observability infrastructure and auto-instrumenting services, without modifying application code.


# Phase 1: Deploy OTel Collector as DaemonSet
apiVersion: apps/v1
kind: DaemonSet
metadata:
  name: otel-collector-agent
  namespace: observability
spec:
  selector:
    matchLabels:
      app: otel-agent
  template:
    spec:
      containers:
        - name: collector
          image: otel/opentelemetry-collector-contrib:0.96.0
          args: ["--config=/etc/otel/config.yaml"]
          env:
            - name: NODE_NAME
              valueFrom:
                fieldRef:
                  fieldPath: spec.nodeName
          resources:
            requests:
              cpu: 200m
              memory: 256Mi
          ports:
            - containerPort: 4317
              hostPort: 4317
---
# Auto-instrumentation for Java services
apiVersion: opentelemetry.io/v1alpha1
kind: Instrumentation
metadata:
  name: shopflow-instrumentation
  namespace: shopflow
spec:
  exporter:
    endpoint: http://otel-collector-agent.observability:4317
  propagators:
    - tracecontext
    - baggage
  sampler:
    type: parentbased_traceidratio
    argument: "1.0"  # 100% in initial phase
  java:
    image: ghcr.io/open-telemetry/opentelemetry-operator/autoinstrumentation-java:latest
  python:
    image: ghcr.io/open-telemetry/opentelemetry-operator/autoinstrumentation-python:latest
  nodejs:
    image: ghcr.io/open-telemetry/opentelemetry-operator/autoinstrumentation-nodejs:latest
---
# Annotate deployments for auto-instrumentation
# Order Service (Java)
apiVersion: apps/v1
kind: Deployment
metadata:
  name: order-service
  namespace: shopflow
spec:
  template:
    metadata:
      annotations:
        instrumentation.opentelemetry.io/inject-java: "shopflow-instrumentation"

Phase 2: Manual Instrumentation and Log Correlation (Week 3-6)

In the second phase, the team adds manual instrumentation for critical business flows and configures log-trace correlation to link logs to distributed traces.


// Order Service: manual instrumentation of checkout flow
@Service
public class CheckoutService {

    private final Tracer tracer;
    private final LongCounter ordersCreated;
    private final DoubleHistogram orderValue;

    public CheckoutService(OpenTelemetry otel) {
        this.tracer = otel.getTracer("order-service");
        Meter meter = otel.getMeter("order-service");
        this.ordersCreated = meter.counterBuilder("shopflow.orders.created")
            .setDescription("Orders created").build();
        this.orderValue = meter.histogramBuilder("shopflow.orders.value")
            .setDescription("Order value in EUR").setUnit("EUR").build();
    }

    public Order processCheckout(CheckoutRequest req) {
        Span span = tracer.spanBuilder("checkout.process")
            .setAttribute("customer.id", req.getCustomerId())
            .setAttribute("customer.tier", req.getCustomerTier())
            .setAttribute("cart.items_count", req.getItems().size())
            .setAttribute("cart.total", req.getTotal())
            .startSpan();

        try (Scope scope = span.makeCurrent()) {
            // Validation
            Span valSpan = tracer.spanBuilder("checkout.validate").startSpan();
            try (Scope s = valSpan.makeCurrent()) {
                validateCheckout(req);
                valSpan.setStatus(StatusCode.OK);
            } finally { valSpan.end(); }

            // Reserve inventory
            Span invSpan = tracer.spanBuilder("checkout.reserve-inventory")
                .setAttribute("inventory.items", req.getItems().size())
                .startSpan();
            try (Scope s = invSpan.makeCurrent()) {
                reserveInventory(req.getItems());
            } finally { invSpan.end(); }

            // Payment
            Span paySpan = tracer.spanBuilder("checkout.payment")
                .setAttribute("payment.method", req.getPaymentMethod())
                .setAttribute("payment.amount", req.getTotal())
                .startSpan();
            try (Scope s = paySpan.makeCurrent()) {
                processPayment(req);
            } finally { paySpan.end(); }

            // Create order
            Order order = createOrder(req);
            span.setAttribute("order.id", order.getId());
            span.setAttribute("order.status", "created");

            // Business metrics
            ordersCreated.add(1, Attributes.of(
                AttributeKey.stringKey("customer.tier"), req.getCustomerTier(),
                AttributeKey.stringKey("payment.method"), req.getPaymentMethod()
            ));
            orderValue.record(req.getTotal(), Attributes.of(
                AttributeKey.stringKey("customer.tier"), req.getCustomerTier()
            ));

            span.setStatus(StatusCode.OK);
            return order;
        } catch (Exception e) {
            span.recordException(e);
            span.setStatus(StatusCode.ERROR, e.getMessage());
            throw e;
        } finally {
            span.end();
        }
    }
}

Phase 3: Dashboards, Alerting, and SLOs (Month 2-3)

In the third phase, the team creates Grafana dashboards for operational and business monitoring, configures SLO-based alerting, and optimizes sampling to reduce costs.


# SLO-based alerts for ShopFlow
groups:
  - name: shopflow-slo-alerts
    rules:
      # SLO: 99.5% checkout requests successful
      - alert: CheckoutSLOBreach
        expr: |
          1 - (
            sum(rate(http_server_request_duration_seconds_count{
              service="order-service",
              http_route="/api/checkout",
              http_status_code=~"2.."
            }[1h]))
            /
            sum(rate(http_server_request_duration_seconds_count{
              service="order-service",
              http_route="/api/checkout"
            }[1h]))
          ) > 0.005
        for: 5m
        labels:
          severity: critical
          slo: checkout-success-rate
        annotations:
          summary: "Checkout success rate below 99.5% SLO"

      # SLO: P99 checkout latency under 3 seconds
      - alert: CheckoutLatencySLO
        expr: |
          histogram_quantile(0.99,
            sum(rate(http_server_request_duration_seconds_bucket{
              service="order-service",
              http_route="/api/checkout"
            }[5m])) by (le)
          ) > 3
        for: 5m
        labels:
          severity: warning
          slo: checkout-latency
        annotations:
          summary: "Checkout P99 latency above 3s SLO"

      # SLO: payment success above 98%
      - alert: PaymentSuccessRateSLO
        expr: |
          sum(rate(shopflow_payments_total{status="success"}[1h]))
          /
          sum(rate(shopflow_payments_total[1h])) < 0.98
        for: 5m
        labels:
          severity: critical
        annotations:
          summary: "Payment success rate below 98%"

Results: Before and After Metrics

After 3 months of full implementation, ShopFlow measured the following improvements:

      Observability Impact on ShopFlow
      
        
          Metric
          Before
          After
          Improvement
        

          MTTR
          4.5 hours
          1.2 hours
          -73%
        

          MTTD
          45 minutes
          3 minutes
          -93%
        

          P1 incidents/month
          8
          3
          -62%
        

          SLO compliance
          94%
          99.2%
          +5.2pp
        

          Cross-service debug
          2-4 hours
          10-30 minutes
          -87%
        

          Incident cost/month
          $12,000
          $3,200
          -73%
        

    

Lessons Learned

The observability implementation at ShopFlow produced several useful lessons for any organization undertaking the same journey:

Key Recommendations

Start with auto-instrumentation: 70% of the value arrives in the first month with auto-instrumentation and Collector, without modifying code
Invest in correlation: log-trace correlation is the single improvement with the highest ROI. It reduces debug time by 90%
Define SLOs before dashboards: SLOs guide the choice of which metrics to collect and which alerts to configure
Don't instrument everything immediately: start with the 3-5 most critical business flows, expand gradually
Monitor the Collector: the Collector is a critical component. If it fails, you lose all visibility
Tail sample errors: after the initial phase (100% sampling), implement tail sampling while keeping 100% of errors
Train the team: observability has value only if the team knows how to use the tools. Invest in Grafana, PromQL, and trace reading training

Implementation Cost

It is important to document the observability investment cost to calculate ROI. For ShopFlow, the open source stack had a cost primarily in engineering time and infrastructure resources:

      Total Implementation Cost
      
          Item
          Estimated Cost
        
          Engineering time (setup + instrumentation)
          ~120 hours (3 engineers x 2 weeks full + part-time)
        
          Collector infrastructure (DaemonSet + Gateway)
          ~$200/month (CPU + memory on EKS)
        
          Backend storage (Jaeger, Prometheus, Loki)
          ~$350/month (EBS volumes + compute)
        
          Grafana Cloud (alternative to self-hosted)
          $0 (self-hosted) or ~$500/month (cloud)
        
          Monthly total
          ~$550/month (self-hosted)
        
          Incident savings
          ~$8,800/month
        
          ROI
          16x (savings / cost)

Implementation Timeline

The complete journey from idea to production took approximately 3 months, with incremental value at each phase:

Summary Timeline

Week 1: Deploy Collector, backends (Jaeger, Prometheus, Grafana), auto-instrumentation on all 5 services. First value: distributed traces visible in Jaeger.

Week 2: Configure Collector pipeline (filtering, batching). Grafana dashboards with RED metrics for each service.

Week 3-4: Manual instrumentation of checkout and payment flows. Log-trace correlation with Loki. First cross-service debug in 15 minutes (previously took 3 hours).

Week 5-6: Custom business metrics (orders/hour, revenue, conversion rate). Dashboard for the product team.

Month 2: SLO definition, SLO-based alerting, tail sampling. Reduced trace volume by 80% while keeping 100% of errors.

Month 3: Optimization, team training, runbook documentation. Stabilization and results measurement.

Series Conclusions

This 12-article series has covered the entire spectrum of modern observability with OpenTelemetry, from theoretical foundations (Three Pillars, monitoring vs observability) to advanced implementations (eBPF, AI observability, tail sampling) to this practical case study with real metrics.

The key messages of the series are:

Observability is a system property, not a product to purchase. It is built through careful instrumentation and signal correlation.
OpenTelemetry is the standard: instrument once, export anywhere. The freedom to change backends without modifying code is a strategic advantage.
Start simple, evolve gradually: auto-instrumentation in month 1, manual instrumentation in month 2, optimization in month 3.
Correlation is the value multiplier: linking traces, logs, and metrics reduces debug time by 90%.
Observability has measurable ROI: reducing MTTR and incidents translates directly into cost savings and better user experience.

Observability is not a cost, it is an investment that pays for itself quickly in distributed systems. With OpenTelemetry, the standard is mature, the tools are available, and the adoption path is well documented. The best time to start is now.

Service	Language	Database	Responsibility
API Gateway	Node.js (Express)	Redis (cache)	Routing, authentication, rate limiting
Order Service	Java (Spring Boot)	PostgreSQL	Order creation, state management
Inventory Service	Python (FastAPI)	PostgreSQL	Stock management, reservations
Payment Service	Java (Spring Boot)	PostgreSQL	Payments, refunds
Notification Service	Python (FastAPI)	MongoDB	Email, SMS, push notifications

Metric	Value
MTTR (Mean Time To Resolve)	4.5 hours
MTTD (Mean Time To Detect)	45 minutes
P1 incidents/month	8
SLO compliance (99.5%)	94% of months
Cross-service debug time	2-4 hours
Incident cost/month (estimate)	$12,000

Metric	Before	After	Improvement
MTTR	4.5 hours	1.2 hours	-73%
MTTD	45 minutes	3 minutes	-93%
P1 incidents/month	8	3	-62%
SLO compliance	94%	99.2%	+5.2pp
Cross-service debug	2-4 hours	10-30 minutes	-87%
Incident cost/month	$12,000	$3,200	-73%

Item	Estimated Cost
Engineering time (setup + instrumentation)	~120 hours (3 engineers x 2 weeks full + part-time)
Collector infrastructure (DaemonSet + Gateway)	~$200/month (CPU + memory on EKS)
Backend storage (Jaeger, Prometheus, Loki)	~$350/month (EBS volumes + compute)
Grafana Cloud (alternative to self-hosted)	$0 (self-hosted) or ~$500/month (cloud)
Monthly total	~$550/month (self-hosted)
Incident savings	~$8,800/month
ROI	16x (savings / cost)