SDMX has been a real success for the harmonization of the IT infrastructure for theexchange of data and metadata.
Anyway, SDMX is a real puzzle for those who have (only) a statistical background (interms of concepts, organization of a data structure, …).
GSIM could help a lot a statistician in using SDMX, assigning a concrete statistical role toconcepts before their use in a DSD.
The use of GSIM concepts before their use in a DSD helps in harmonizing the descriptionof a data cube.
Among the concepts already available in GSIM, an additional concept (the “DataContent”) could be useful in order to feed in a standard and complete way a Measure of aData Structure (of macrodata).
This is what we have done in Istat. The corporate DWH (I.Stat) has almost 3000 “datacontents”. In SUM it is possible to search data through different facets:
Reference population of the data
Numerical variables used for the production of a data content
Categorical variables used to cross cut data contents
Categories of a categorical variable used in data structures
Furthermore it is easy to reconstruct the relationships between statistical programs (reuseof data for computation of other data)
This year we are including micro data (for the data collection and validation steps)
Mauro Scanu – Geneva, UNECE - 5 May 2015