Back to top

Labour input, gross wages and salaries

DownloadPrint

National Reference Metadata in Euro SDMX Metadata Structure (ESMS)

Compiling agency: Italian National Institute of Statistics (ISTAT)Directorate for Social Statistics and Population Census

Need help? Contact the Eurostat user support

Indices of Wages and Salaries.

These indicators are produced by the quarterly Oros survey, through the integration of monthly Social Security administrative data (collected by the Social Security Institute-INPS), used mainly for small and medium enterprises (SMEs) and the monthly survey data on labour input and labour costs on the large enterprises used for large enterprises (LES).

12 June 2025

Wages and Salaries are defined in coherence with the Commission Implementing Regulation 2020/1197 (Annex IV).

Reporting and observation unit: Enterprise. For some large enterprises reporting and observation units are KAUs.

Note: kind-of-activity unit.

All enterprises with at least one employee which were active in the reference quarter in the NACE economic activities of the coverage sectors.

In 2023 according to the last available Business Register version, these enterprises were, on average about 1,4 millions, distributed as follows:

  • 222,119 for  Industrial sector
  • 212,597 for Construction sector
  • 373,372  for Retail sector
  • 555,615 for Services sector

(source: BR-ASIA)

Geographically the STS indicators on Wages and Salaries cover the whole country. Activities outside the country are excluded.

Quarter.

Assessing accuracy on the indicator of Wages and Salaries, compiled using admin data integrated at micro level with survey data implies taking into account only non-sampling errors. Infact, these data sources refer to the census of the target units.

The massive quantity of administrative micro data (used for SMEs) covers almost the 94% in terms of number of jobs. However, it requires a very careful processing phase of E&I, since if the non-response for the total economy is not relevant, it is not the same for the different economic sectors.  At this stage a micro level model-based process of imputation is performed, so for an estimate of the accuracy the main components to take into account are the estimates of the parameters, and the residual bias for some sectors for which the model does not fit well, for which a macro adjustment  based on the time series analysis is performed. Another source of bias comes from the definition of the statistical population. If from one hand the use of auxiliary information from the Business Register and from the more updated Tax Register helps defining the list of target units in the Social Security Register reducing problems of over-coverage, on the other hand the delay of the Business Register's updating date implies an additional bias due to the misclassification of the units. Survey data (used for LES) refer to enterprises with more than 500 employees at the base year 2021, this group adds up to about 1,630 enterprises covering about 20% of total employees in Italy in the STS sectors. Each one of these firms has a considerable influence on the estimates. Editing and imputation on this data are global (all units are checked) and performed by very expert personnel, assuring very high quality data and fast management of changes in units legal asset, non-responses, errors, etc. When considering accuracy, the micro level integration between the administrative records and the Large Enterprises Survey data must also be considered. Record linkage and computation of harmonized variables are the main processes to be taken into account.

Index.

Once admin data on DMs are acquired (about 10 million records per month) the monthly basic variables are calculated at unit level through a complex pre-treatment procedure based on a data base of metadata on rules, laws and regulations on social security contributions declaration. At the end of the process information on the main target variables referred to each unit identification is summarized in a single record (about 1.4 million records per month). Finally, quarterly variables are calculated as simple means of the related monthly statistical variables retrieved as above mentioned.  Once the main quarterly variables have been derived from DMs, the survey target population has to be outlined, excluding the out-of-scope units  (public sector or whose economic activity is not included in the target Nace). This operation is performed using the auxiliary information drawn from the Business Register (ASIA) and from the Tax Register. 

At the scheduled time for the acquisition of the provisional population, it may happens that some DM are missing due to delays depending on firms liability or administrative system flaws. These missing units (late reporters) usually correspond to about 2-5% of the final population. That means that in the admin data an almost complete coverage of the target population occurs. Nevertheless, late reporting has a non-negligible impact in the estimates of variables expressed as totals, like Wages and salaries: even small differences from the final values could give misleading signals on the short dynamic of the target variables. In the Oros process, the jobs level estimate is massively adjusted to correct for the incompleteness of the preliminary data file due to the late reporters and Wages and salaries are adjusted proportionally under the hypothesis that the per-capita wages are not affected by late reporting. Micro imputation is the approach used in this phase of the process. The micro imputation procedure consists of two steps: a) the identification stage, that is the definition of a list of non-reporting units to impute; b) the imputation stage, that is the assignment of imputed values to the list of units above defined. Due to the absence of a theoretical list of active units for the admin data, a list of non-reporting active units has to be predicted. It’s derived on the basis of the structure of the preliminary admin data itself: based on the observation of the reporting patterns of the units. In the second imputation phase, the value of the units identified as active (expected late reporters) is imputed. The availability of a large quantity of micro data at longitudinal level on non-reporting units and considering the inertial trend of employment, a natural choice to reconstruct the missing values of jobs is to use a general regression model, where only the lagged values of the same variable are used as auxiliary variables, in detail, job values on the previous month and on the same month of the previous year. The adjusted estimates on jobs are applied to the per-capita wages calculated before imputation to get total wages.

OROS quarterly indicators derives from the integration of administrative and Large Enterprises survey (LES) data. The integration process aims at producing quarterly micro data by replacing the admin source with the large firms survey data, for the overlapping enterprises. This integration improves the estimates’ quality given both the higher quality of survey data and their adherence to statistical contents and of the overall admin-LES (Oros) estimates in case of missing response (MR), considered the relevant influence of large enterprises especially in some economic activity sectors. Producing quarterly integrated admin-LES microdata implies identifying and excluding from the admin source the enterprises belonging to the LES survey and estimating coherent variables from data collected for different purposes. This process implies taking into account linkage and variables harmonization issues. The main phase of the integration process is the definition of quarterly lists of common units in both admin source and LES. To  identify correctly this sub-population avoiding double counting, record linkage and micro-integration are implemented. For units belonging to the defined quarterly list, Oros economic variables are replaced with the LES ones. The higher detail of LES variables on employment allows to perform coherent estimates from the two sources at the basis of Oros. To this scope an harmonization procedure according to the statistical requirements is carried out. Before the aggregation of the main variables, units not belonging to LES (already including E&I operations from Les survey) undergo an editing and imputation procedure. This operation is aimed at identifying and correcting missing response and/or outliers that could generate bias in the estimates both in preliminary and final data. It is based on selective criteria: influent units (less than 50 units a quarter) are automatically detected through functional relations on information on the same quarter of previous year (t-4), according to established cut-off thresholds. Specifically, the quarter-on-quarter variations are the target variables of the selective editing procedure,  to better detect outlier incorporating seasonality, contributing to reduce the number of influent units. The most anomalous values or missing response are interactively analyzed and, if necessary, imputed by an automatic deterministic procedure.

For furter details on the methodology at the basis of the Oros process see:

Istat, 2019. La Rilevazione trimestrale Oros su occupazione e costo del lavoro: indicatori e metodologie. Letture Statistiche - Metodi  (in english: "The Oros quarterly survey on employment and labor costs: indicators and methodologies", Statistical readings – Methods - Istat (ISTAT archive).

Rapiti F.M., F. Ceccato, M.C. Congia, S. Pacini e D. Tuzi. 2010. What have we learned in almost 10-years experience in dealing with administrative data for short term employment and wages indicators? Paper presented at the seminar: Using administrative data in the production of business statistics”. Roma, 18-19 marzo (Ine session2).

Data used to compile Wages and Salaries indices are drawn from the Oros and LES surveys.

The Oros Survey, based mainly on the administrative data on the Social Security contributions declarations (DM) collected by INPS (National Social Security Institute), is aimed at covering all size classes without increasing the statistical burden on respondents. The survey has been designed to satisfy the EU requirements on short-term statistics (EBS Regulation n.2019/2152 and LCI-Labour Cost Index Regulations n.450/2003). INPS data are integrated with the monthly Istat Survey on Labour input variables in large enterprises (LES-Large Enterprise Survey). The data from INPS cover the population of SMEs and the data of the LES cover the population of large enterprises. Some large enterprises not covered by LES are covered by the INPS data. The main source for the NACE code is the Business Register (BR) and, for residual units also the Tax Register (TR). Both the BR and the TR give also information on the legal characterization of the units, useful to restrict administrative data to the Oros target population.

Quarterly (to Eurostat).

Preliminary: before 90 days after the end of the reference quarter, usually 60 days anticipating the deadline (90 days after the reference quarter).

Final: about 1 year and 60 days after the end of the reference quarter.

Wages and Salaries indicators are defined in coherence with EBS Regulation. The data cover the entire national territory.

The methodology at the basis of the estimation of the Wages and Salaries has been subjected to several innovations over the time, implying discontinuities in the time series. The same effect occurs in the occasion of the transition to a new reference base, when the most relevant innovations are introduced, or when non-ordinary interventions are performed. In all these cases, to guarantee coherence of all the occurrences of the time series, linking factors are normally calculated on overlapping periods and applied to the quarters estimated in old methodology.