Back to top
National reference metadata

Luxembourg

Reference metadata describe statistical concepts and methodologies used for the collection and generation of data. They provide information on data quality and, since they are strongly content-oriented, assist users in interpreting the data. Reference metadata, unlike structural metadata, can be decoupled from the data.

For more information, please consult our metadata website section.

Close

Structural business statistics (sbs)

National Reference Metadata in SBS Euro-SDMX Metadata Structure (ESMS) - from reference year 2021 onwards (ESSBS21)

Compiling agency: STATEC

Need help? Contact the Eurostat user support


Short metadata
Full metadata

Structural business statistics (SBS) describes the structure, conduct and performance of economic activities, down to the most detailed activity level (several hundred economic sectors). SBS covers all activities of the business economy with the exception of agricultural activities, public administration and (largely) non-market services such as education and health. Main characteristics (variables) of the SBS data category:

• "Business demographic" variables (e.g. Number of active enterprises)

• "Output related" variables (e.g. Net turnover, Value added)

• "Input related" variables: labour input (e.g. Number of employees and self-employed persons, Hours worked by employees); goods and services input (e.g. Purchases of goods and services); capital input (e.g. Gross investments)

Business services statistics (BS) collection contains harmonised statistics on business services. From 2008 onwards BS become part of the regular mandatory annual data collection of SBS. The BS’s data requirement includes variable “Turnover” broken down by products and by type of residence of client. 

The annual regional statistics collection includes three characteristics due by NUTS-2 country region and detailed on NACE Rev 2division level (2-digits).

9 August 2024

SBS constitutes an important and integrated part of the new European Business Statistics Regulation N° 2152/2019

Data requirements, simplifications and technical definitions are defined in Commission Implementing Regulation (EU) 2020/1197

Enterprise (national level); 

Local unit (regional level)

Only active market units are included.

Active units encompass the following units:

* units with a non-zero value of turnover or non-zero employees;

* units which are active according to the SBS survey and not properly recorded in the administrative sources (e.g. VAT exempt units without employment);

The market status of a unit is defined in terms of the national accounts institutional sectors. Sectors 12.7, 13 and 15 are excluded from the target population. The following units are specifically excluded:

* temporary partnerships;

* private persons producing energy using photovoltaic installation;

* artificial subsidiaries and other special purpose entities dealing almost exclusively with group affiliates.

There are no exclusions in terms of NACE. However, branch H.50 (water transport) is only available as a branch estimate and not as micro-data. Consequently, the size class dimension is not reliable for this activity branch.

Luxembourg economic territory.

2022

The reference year corresponds with the calendar year.

For each company with a different financial year than the calendar year, we take a decision regarding the SBS reference period. Normally, this decision is taken in such a way that at least 6 months of the financial year for a given company are included in the corresponding SBS reference period. Once that decision is taken, it is maintained every subsequent year for comparability reasons.

Sampling error is mainly relevant for small size classes but also more generally for certain characteristics (e.g. unpaid persons employed). This error has most often a local impact. Since 2015, the sampling error has been reduced due to the integration of administrative source on accounting data.

Coverage errors are relevant for activities which carry little to no employment and which are not or only partially covered by VAT data. Examples of such activities are real estate activities, parts of the financial and health sectors. The failure to identify special purpose entities early on is another potential source of coverage error. Overall, the impact of coverage errors is local.

Non-response error is generally low due to imputation strategies. For certain activities (e.g. construction, real estate), imputation is nonetheless more challenging and can thus locally generate significant misstatements.

Measurement errors pertain mainly but not only to errors in the NACE classification and can be of significant impact locally as well as globally. NACE errors are the most difficult to deal with because nothing can be done other than accepting them and, if necessary, producing revised data for the past. NACE errors are the main cause for documented breaks in the time series.

Errors due to the misinterpretation of or incomplete information on globalisation phenomena are rare but can have significant global impacts due to the bigger figures of multinational enterprises.

Massive imputation via administrative sources is another source of measurement error: understated investment variables, slightly understated personnel costs (until reference year 2014 included), and misstated profit&loss variables as of the reference year 2015 (new data source). However, reasonable data processing and analysis procedures are in place to ensure that the most significant errors are captured before dissemination. Therefore, such errors bear only a local significance.

Data processing errors are rare but may have a significant impact when they go undetected. The aforementioned data processing and analysis procedures are normally sufficient but in the past a few rare cases of such errors slipped into published data.

Preliminary results for SBS are mostly burdened by undercoverage errors (insufficient VAT data available for units with no employment), data processing errors (due to imputation) and measurement errors (mainly due to NACE misclassifications). The period from the production of preliminary results to the production of final results is approximately 9 months. Since reference year 2019, an administrative source on accounting data is additionally used to complete the SBS population, including the one used for preliminary results.

• Number of enterprises and number of local units are expressed in units.

• Monetary data are expressed in millions of €.

• Employment variables are expressed in units.

• Per head values are expressed in thousands of € per head. 

Ratios are expressed in percentages.

Unit non-response imputation

Please refer to chapter 13.

Item non-response imputation

Given that SBS data are above all quantitative and that they also serve the needs of national accounts, it is very difficult to identify item non-response. This is in particular true for units who are included only once in the sample in a given time series.

For the bigger companies, we are able to follow their structure over a larger period and thus to identify item non-response in an easier way. Item non-response is therefore dealt with on a case-by-case basis during editing procedures.

Missing or erroneous information in the administrative source

For some economic activities, VAT data are not available. No imputation is done on administrative data. If employment data are significant for a given unit, it is very likely included in the survey data. Consequently, the occurrence of missing data in the VAT and social security administrative source is rare and of low impact. In the rare case of missing or the more frequent case of erroneous administrative data, adjustments are performed in a dedicated working area.

For the administrative accounting data source (available as of 2015), any missing data is imputed using the either cold-deck ratio imputation or hot-deck ratio imputation (grossing up).

Grossing up

When grossing up survey data, we always take into account ancillary data available in the administrative sources, i.e. the ancillary variables “turnover” and “number of employees”, and not only the sample design. The aforementioned ancillary variables most often have a linear correlation with the variables of interest in SBS. Consequently, the ratio estimator is used for grossing up.

With the variable of interest being available only for the sample (y) and with the ancillary variable being available both for the sample (x) and the SBS target population (X), the ratio estimator of the variable of interest , expressed as an average, can be written:

 2024-09-16_105647

The ratio R has been formulated in the following way:

 2024-09-16_105727

If for a given stratum no unit is available in the sample, i.e. neither for the historical periods nor for the reference period, the units in this stratum are grouped together with another stratum for which sample data are available and are then grossed up.

Variables "Number of enterprises", "Personnel costs", "Wages", "Social security charges", "Number of employees", "Number of full-time equivalent employees" and "Number of hours worked by employees" are directly derived from administrative source data and therefore do not need to be grossed up (as from reference year 2009). If prior year survey data are available, a cold-deck ratio imputation is performed – this significantly reduces the bias in case there are permanent differences between the admin source and the survey data for a given unit. For the other units, administrative data are simply pasted into the data set.

Please also refer to chapter 13.

Statistical survey combined with administrative sources

The annual structural business survey used to be at the heart of the data collection. While this is still true today for bigger firms, for smaller entities, the administrative accounting sources (mainly Central Balance sheet Office) have progressively taken over since reference year 2015. This has enabled to alleviate the statistical reporting burden on smaller businesses.

Even with the partial shift from survey to administrative sources, the data collection still requires a sample based on a stratified random sampling design. With the extension of the activity coverage as of reference year 2021, a new sample strategy had to be devised.

Using strata based on the number of employees and turnover applied on the population, a random stratified sample is drawn and then divided into two subsamples:

* an exhaustive survey subsample: any legal units employing either more than 50 employees or having declared a turnover excluding VAT of more than 12 million EUR per annum are selected every year. For certain activities in the financial sector, the thresholds are increased for survey cost reasons. Any legal units linked to the aforementioned units (e.g. group or enterprise links) are also selected to form part of the survey every year. To cover the needs of national accounts and SBS, the survey sample size is roughly equal 3000 reporting units;

* a random administrative accounting source subsample: any legal units having both less than 50 employees and a turnover of less than 12 million EUR are drawn randomly by stratum and exhaustively when the stratum only consists of singletons. Instead of being surveyed, these units are later on imputed using detailed administrative accounting data available for the reference year. The sample design allows for the possibility to use these units for handling administrative "non-response", that is units which have not yet reported their accounting data at the statistical production moment. The sample size is roughly equal to 7000 reporting units;

Administrative accounting data may be available for any firm in the population but outside of the aforementioned sample. When such data is available, these units, which are not part of the sample and which can therefore only represent themselves in the total population, are removed from the subpopulation to gross up, thus contributing to the reduction of the sampling variance due to grossing up.

Credit establishments and insurance companies are measured entirely through administrative sources and only abide by the guidelines here above for investment variables, which are entirely measured by survey.

The stratification used in the sample design does not take into account the size classes defined in the transmission format of the EBS SBS regulation. However, mass imputation procedures exist on a micro data level, so that data can be broken down according to any spanning variable defined a posteriori, albeit with an increased risk of bias:

* since 2015, detailed accounting data from the Central Balance Sheet Office have been used to impute units which are not part of the sample or which no recent survey data are available (with a sample weight of 1) - this source covers balance sheet and profit & loss data. In recent years, the data source accounts for more than half of the SBS population;

* since 2005, social security data have been used to impute the number of employees, personnel costs and number of hours worked by employees;

* detailed VAT data are used to impute investment variables;

* VAT and social security key indicators are used to impute non-respondent units using the cold-deck ratio imputation method.

Data sources used for the population frame or target population

The statistical business register is the primary source for building the target population. The said register is enriched by key data from social security and VAT, and since 2019 from the administrative accounting data. These data are used to determine the activity status of any enterprise. Only active units are retained. There is no threshold on employment and turnover, except that either has to be positive for the reference year in question.

In addition, for units having no employment and not covered by either VAT or accounting data source, the survey helps to identify a few active units. They are added to the population frame.

NACE classification

The difference between principal and secondary activities is directly available in the business register. The approach to identify principal activities is the top-down approach. Stability rules are in place to avoid yearly shifting between principal and secondary activities.

The update frequency of the business register is daily. However, for SBS purposes the extract of the business register is frozen at a given date. From that moment on, changes in NACE are only performed on an ad hoc basis and through a dedicated working area.

Relationship between the reporting unit and the enterprise

In the SBS survey, the reporting unit is the legal unit. However, for a very few legal units, the data are collected separately for the economic subdivisions of the legal unit. Moreover, local unit data are collected at legal unit level for units being part of the SBS survey. In the administrative sources, the reporting unit is the legal unit.

Finally, the statistical unit "enterprise" can be a legal unit or a combination of legal units. Enterprises which are formed by a group of legal units are not the general case in Luxembourg but their occurrence is still significant, in particular due to many legal units specialized in ancillary activities for other legal units belonging to the same enterprise group.

Annual.

Time between end of reference period and national data dissemination: t+22 to t+24 months

Not applicable.

Length of comparable time series

1995 - 2002 (KAU concept)

2003 - 2009 (enterprise concept NACE Rev.1.1)

2005 - 2022 (enterprise concept NACE Rev.2)

Important events in the time series

revision of SBS data series was completed in summer 2014 for the reference years 2005 to 2010 included. The revision was performed using NACE Rev.2 and included the revision of the profiling of some major players. The impact of the revision was significant for a few activities.

As of 2015, Central Balance Sheet Office data are used to impute units which are not part of the sample or which no recent survey data are available. The impact of the data integration is an overall improvement of the sampling error, potentially traded for an increased measurement error at a local level.

As of 2019 reference year, a rolling revision policy has been introduced. After the official production of reference year T (e.g. 2019) published during the year T+2 (e.g. 2021), a revision for the reference year T data is performed during T+3 (e.g. 2022). The impact of this revision not only removes local breaks in series due to temporary errors in NACE but also introduces a less desirable side-effect: due to the consideration of late VAT data during the revision, there is a potentially significant upward impact on the number of smaller active enterprises and the number of persons employed (excluding employees).

As of 2021 reference year, the SBS have been extended to NACE sections P, Q, R and K as well as NACE division S96, in conformity with the EBS regulation. These more recently added activity branches present certain challenges, most of which will be ironed out in the years to come.