Transparent methodology
Each dataset declares its upstream sources, selection criteria, applied transformations and extraction date. Immutable READ ME+CHANGE LOG ensures reproducibility of the results.
Italian open datasets with transparent methodology, semver versioning, optional Zenodo DOIs and DCAT-AP 2.1 , plus dataset.schema.org metadata for automated indexing on the Google Dataset Search engine.
Each dataset declares its upstream sources, selection criteria, applied transformations and extraction date. Immutable READ ME+CHANGE LOG ensures reproducibility of the results.
Major for changes in the scheme, minor for added data, patch for fixes. Each release has a SHA-256 checksum that is immutable and can be cited over time.
DOI on optional for stable versions: APA citation, BibTex and DataCite ready to use with papers, theses and reviewed articles.
Downloadable data available as JSON and CSV files, including versions and DOIs when applicable.
Service temporarily unavailable
Three Quick-Starts for common Data Analysis environments: each dataset exposes a RESTful API and Static Files.
import pandas as pd
import requests
r = requests.get(
"https://federicocalo.dev/api/v1/open-data/"
"musei-italia/versions/1.0.0/data.json"
)
df = pd.DataFrame(r.json())
print(df.head())library(jsonlite)
df <- fromJSON(paste0(
"https://federicocalo.dev/api/v1/open-data/",
"musei-italia/versions/1.0.0/data.json"
))
head(df)# Download CSV con versioning
curl -L -o musei-italia.csv \
"https://federicocalo.dev/api/v1/open-data/\
musei-italia/versions/1.0.0/data.csv"
# Apri in Excel: Dati → Da testo/CSVLicenses, citations, updates. If something is missing open an issue or write to [email protected] Federico Calo dev.
Yes. My datasets are distributed under open licenses (CC BY 4.0, ODbl 1.0 or CC0-1.0), which allow commercial use. The only constraint of attribution licences (CC-BY, ODbL) is to cite the original source.
Each data set includes a "Citation" block in the CHANGELOG with an APA and BibTex format. When available, use a persistent DOIs from Zenodo because it is academically recognized.
The editorial cycle is quarterly. Major release (v2.0) when the schema changes, minor (v1.1) for added data, patches (v1.0.1) for fixes. Each version is immutable and citeable through a SHA-256 checksum.
Native JSON for the RESTful API, CSV for table analysis and data visualization, plus metadata in DCAT-AP 2.1 format with additional schema.org/Dataset properties encoded as RDF-JSON to enable automated indexing on both Google Dataset Search and the European Data Portal.
Each dataset has a "Methodology" section in the README that specifies: upstream sources (with URLs), selection criteria, applied transformations and extraction date. SemVer versioning plus an immutable CHANGELONG ensures reproducibility.
Open an issue on the Federico Calò's portfolio repository at GitHub with a "open-data" label. For precise corrections (errors or outliers) pull requests to source files are welcome and will be reviewed within seven days.
All datasets are published with an openly declared license (CC-BY 4.0, ODbL 1.0 or CC0 depending on compatibility with upstream sources). Each version has a SHA-256 checksum, immutable CHANGELOG and when available academic citation via Zenodo's DOI.
Complete policy03_pubblicazione_dataset.md · 04_licenze_attribuzione.md
If you work with Italian open data and are looking for a platform already configured with DCAT-AP metadata, DOIs on Zenodo, and schema.org, let's evaluate together if it can find its place here.