Hi! I'm

Federico Calò

Software Developer | Technical Writer

I create modern web applications and custom digital tools to help businesses grow through technological innovation. My passion is combining computer science and economics to generate real value.

Contact Me

About Me

My passion for computer science was born at the Technical Commercial Institute of Maglie, where I discovered the power of programming and the fascination of creating digital solutions. From the start, I understood that computer science was not just code, but an extraordinary tool for turning ideas into reality.

During my studies in Business Information Systems, I began to interweave computer science and economics, understanding how technology can be the engine of growth for any business. This vision accompanied me to the University of Bari, where I obtained my degree in Computer Science, deepening my technical skills and passion for software development.

Today I put this experience at the service of businesses, professionals and startups, creating tailor-made digital solutions that automate processes, optimize resources and open new business opportunities. Because true innovation begins when technology meets the real needs of people.

My Skills

Data Analysis & Predictive Models

I transform data into strategic insights with in-depth analysis and predictive models for informed decisions

Process Automation

I create custom tools that automate repetitive operations and free up time for value-added activities

Custom Systems

I develop tailor-made software systems, from platform integrations to customized dashboards

const federico = {
  nome: "Federico Calò",
  ruolo: "Sviluppatore Software",
  città: "Bari, Italia",
  missione: "Aiutare attraverso l'informatica",
  passioni: [
    "Codice Pulito",
    "Innovazione",
    "Crescita Continua"
  ]
};

La Mia Missione

Credo fermamente che l'informatica sia lo strumento più potente per trasformare le idee in realtà e migliorare la vita delle persone.

Democratizzare la Tecnologia

La mia missione è rendere l'informatica accessibile a tutti: dalle piccole imprese locali alle startup innovative, fino ai professionisti che vogliono digitalizzare la propria attività. Ogni realtà merita di sfruttare le potenzialità del digitale.

Unire Informatica ed Economia

Non è solo questione di scrivere codice: è capire come la tecnologia possa generare valore reale. Intrecciando competenze informatiche e visione economica, aiuto le attività a crescere, ottimizzare processi e raggiungere nuovi traguardi di efficienza e redditività.

Creare Soluzioni su Misura

Ogni attività è unica, e così devono esserlo le soluzioni. Sviluppo strumenti personalizzati che rispondono alle esigenze specifiche di ciascun cliente, automatizzando processi ripetitivi e liberando tempo per ciò che conta davvero: far crescere il business.

Trasforma la Tua Attività con la Tecnologia

Che tu gestisca un negozio, uno studio professionale o un'azienda, posso aiutarti a sfruttare le potenzialità dell'informatica per lavorare meglio, più velocemente e in modo più intelligente.

Parliamone Insieme →

Join the Community

Join the developer community where we discuss software, AI, architecture and DevOps. Share ideas, ask questions and grow with us.

Channel

FC Dev Blog

Get notifications on new articles, complete series, weekly tips and featured tools. Bilingual IT/EN content directly in your Telegram.

New articles as they are published
Weekly tips and code snippets
Polls on future topics

Subscribe to Channel

Group

FC Dev Community

A bilingual IT/EN community for developers. Discussions, Q&A, mutual help and networking with other professionals.

Discussions on articles and technologies
Coding help and code review
Job opportunities and collaboration

Join the Group

Discussion Topics

View

Master SQL

RoadMap.sh

November 2024

View

Oracle Certified Foundations Associate

Oracle

October 2024

View

People Leadership Credential

Connect

September 2024

💻 Languages & Technologies

Java

Python

JavaScript

Angular

React

TypeScript

SQL

PHP

CSS/SCSS

Node.js

Docker

Git

💼

12/2024 - Present

Custom Software Engineering Analyst

Accenture

Bari, Puglia, Italy · Hybrid Analysis and development of computer systems through the use of Java and Quarkus in Health and Public Sector. Continuous training on modern technologies for creating customized and efficient software solutions and on agents.

💼

06/2022 - 12/2024

Software analyst and Back End Developer Associate Consultant

Links Management and Technology SpA

Experience analyzing as-is software systems and ETL flows using PowerCenter. Completed Spring Boot training for developing modern and scalable backend applications. Backend developer specialized in Spring Boot, with experience in database design, analysis, development and testing of assigned tasks.

💼

02/2021 - 10/2021

Software programmer

Adesso.it (prima era WebScience srl)

Experience in AS-IS and TO-BE analysis, SEO evolutions and website evolutions to improve user performance and engagement.

🎓

2018 - 2025

Degree in Computer Science

University of Bari Aldo Moro

Bachelor's degree in Computer Science, focusing on software engineering, algorithms, and modern development practices.

📚

2013 - 2018

Diploma - Corporate Information Systems

Technical Commercial Institute of Maglie

Technical diploma specializing in Business Information Systems, combining IT knowledge with business management.

Contattami

Hai un progetto in mente? Parliamone! Compila il form qui sotto e ti risponderò al più presto.

* Campi obbligatori. I tuoi dati saranno utilizzati solo per rispondere alla tua richiesta.

Introduction: Reusing Model Knowledge

Advanced Transfer Learning is the key to leveraging models pre-trained on massive datasets without needing the computational resources for training from scratch. Models like BERT and GPT were trained on hundreds of billions of tokens, capturing deep language understanding that can be transferred to specific tasks with limited data and resources.

In this article we will explore modern fine-tuning strategies, prompt engineering, Retrieval-Augmented Generation (RAG), and the Hugging Face ecosystem, with comparisons between open-source models like Llama, Mistral, and Falcon.

What You Will Learn

BERT: bidirectional pre-training and fine-tuning for understanding tasks
GPT: auto-regressive generation and in-context learning
Fine-tuning strategies: full, LoRA, adapters, and QLoRA
Prompt engineering: techniques for better outputs
RAG: combining LLMs with search for accurate answers
Open-source models: Llama, Mistral, Falcon - when to use which
Hugging Face Hub: the pre-trained model ecosystem

BERT: Bidirectional Text Understanding

BERT (Bidirectional Encoder Representations from Transformers) revolutionized NLP by demonstrating that bidirectional pre-training produces extraordinarily rich language representations. During pre-training, BERT uses two objectives:

Masked Language Modeling (MLM): 15% of tokens are masked and the model must predict them from bidirectional context
Next Sentence Prediction (NSP): the model predicts whether two sentences are consecutive in the original text

For fine-tuning, simply add a classification layer on top of BERT's output and train the entire model on a few thousand labeled examples:


from transformers import BertTokenizer, BertForSequenceClassification
from transformers import Trainer, TrainingArguments
import torch

# Load BERT for sentiment classification
tokenizer = BertTokenizer.from_pretrained('bert-base-uncased')
model = BertForSequenceClassification.from_pretrained(
    'bert-base-uncased',
    num_labels=2  # positive/negative
)

# Tokenize data
texts = ["This movie is great!", "Terrible waste of time."]
labels = [1, 0]  # 1=positive, 0=negative

inputs = tokenizer(texts, padding=True, truncation=True,
                   max_length=128, return_tensors="pt")
inputs['labels'] = torch.tensor(labels)

# Forward pass
outputs = model(**inputs)
print(f"Loss: {outputs.loss:.4f}")
print(f"Logits: {outputs.logits}")

# Fine-tuning with Trainer API
training_args = TrainingArguments(
    output_dir='./results',
    num_train_epochs=3,
    per_device_train_batch_size=16,
    learning_rate=2e-5,
    weight_decay=0.01,
    warmup_steps=100,
    evaluation_strategy="epoch"
)

Efficient Fine-Tuning: LoRA and QLoRA

Full fine-tuning of models with billions of parameters requires enormous resources. Parameter-Efficient Fine-Tuning (PEFT) allows adapting the model by modifying only a small fraction of parameters:

LoRA (Low-Rank Adaptation)

LoRA freezes the original model weights and adds trainable low-rank matrices alongside attention layers. It typically modifies less than 1% of total parameters, achieving performance comparable to full fine-tuning.

QLoRA

QLoRA combines LoRA with 4-bit quantization, enabling fine-tuning of 65B parameter models on a single GPU with 48GB VRAM. It uses the NF4 (NormalFloat 4-bit) data type and double quantization for maximum efficiency.


from peft import LoraConfig, get_peft_model, TaskType
from transformers import AutoModelForCausalLM, AutoTokenizer

# LoRA configuration
lora_config = LoraConfig(
    task_type=TaskType.CAUSAL_LM,
    r=16,                    # Rank of LoRA matrices
    lora_alpha=32,           # Scaling factor
    lora_dropout=0.1,
    target_modules=["q_proj", "v_proj", "k_proj", "o_proj"],
    bias="none"
)

# Apply LoRA to model
model = AutoModelForCausalLM.from_pretrained("meta-llama/Llama-2-7b-hf")
peft_model = get_peft_model(model, lora_config)

# Count trainable parameters
trainable = sum(p.numel() for p in peft_model.parameters() if p.requires_grad)
total = sum(p.numel() for p in peft_model.parameters())
print(f"Trainable: {trainable:,} / {total:,} "
      f"({100*trainable/total:.2f}%)")
# Output: ~0.5% of parameters are trainable

Prompt Engineering: The Art of Communicating with LLMs

Prompt engineering is the practice of formulating instructions that guide the model toward the desired output without modifying its weights. Key techniques include: few-shot learning (providing examples in the prompt), chain-of-thought (asking the model to reason step by step), role prompting (assigning a specific role to the model), and structured output (requesting specific formats like JSON).

RAG: Retrieval-Augmented Generation

RAG combines the generative capability of LLMs with a search system to provide accurate answers based on specific documents. Instead of relying solely on knowledge memorized during pre-training, the model receives relevant context retrieved from a document database.

The RAG process consists of three phases:

Indexing: documents are split into chunks and transformed into vector embeddings
Retrieval: given a query, the most similar chunks are retrieved via similarity search
Generation: retrieved chunks are inserted into the prompt as context for the LLM


from langchain.text_splitter import RecursiveCharacterTextSplitter
from langchain.embeddings import HuggingFaceEmbeddings
from langchain.vectorstores import FAISS

# 1. Document splitting
text_splitter = RecursiveCharacterTextSplitter(
    chunk_size=500,
    chunk_overlap=50,
    separators=["\n\n", "\n", ". ", " "]
)
chunks = text_splitter.split_text(document_text)

# 2. Create embeddings and vector store
embeddings = HuggingFaceEmbeddings(
    model_name="sentence-transformers/all-MiniLM-L6-v2"
)
vector_store = FAISS.from_texts(chunks, embeddings)

# 3. Retrieval and generation
query = "How does transfer learning work?"
relevant_docs = vector_store.similarity_search(query, k=3)

# Build prompt with context
context = "\n".join([doc.page_content for doc in relevant_docs])
prompt = f"""Based on the following context, answer the question.

Context:
{context}

Question: {query}

Answer:"""

Open-Source Models: Llama, Mistral, Falcon

The open-source model ecosystem has exploded, offering competitive alternatives to proprietary models:

Llama (Meta): model family from 7B to 70B parameters, excellent for fine-tuning and on-premise deployment. Llama 3 achieves competitive performance with GPT-3.5
Mistral: efficient models with innovative architecture (Sliding Window Attention, Mixture of Experts). Mistral 7B outperforms Llama 2 13B on many benchmarks
Falcon: trained on high-quality dataset (RefinedWeb), offers good zero-shot performance

The choice depends on the use case: for general text generation, Llama 3 is often the best choice; for efficiency on limited resources, Mistral 7B is ideal; for specific tasks, fine-tuning with LoRA on any of these models produces excellent results.

Hugging Face: The Complete Ecosystem

Hugging Face has become the reference point for deep learning NLP, offering a complete ecosystem:

Model Hub: over 500,000 pre-trained models, downloadable with a single line of code
Transformers Library: unified APIs for all models (BERT, GPT, T5, Llama, etc.)
Datasets: thousands of datasets for training and evaluation
Trainer API: optimized training loop with distributed training, mixed precision, gradient accumulation
Spaces: free hosting for ML demos and apps

Next Steps in the Series

In the next article we will explore TinyML and Edge AI
We will see how to deploy deep learning models on embedded devices and smartphones
We will analyze quantization, pruning, and knowledge distillation for model compression