Structural business statistics - historical data (sbs

Structural business statistics (SBS) describes the structure, conduct and performance of economic activities, down to the most detailed activity level (several hundred economic sectors).

SBS covers all activities of the non-financial business economy with the exception of agricultural activities and personal services. Limited information is available on banking, insurance and pension funds.

Main characteristics (variables) of the SBS data category:

Business demographic variables (e.g. Number of enterprises)
"Output related" variables (e.g. Turnover, Value added)
"Input related" variables: labour input (e.g. Employment, Hours worked); goods and services input (e.g. Total of purchases); capital input (e.g. Material investments)

28 March 2023

The statistical characteristics are defined in Annex I of Commission Regulation (EC) No 250/2009

survey unit = legal unit, local unit and, for a few select cases, subdivisions of a legal unit;

reporting unit / statistical unit = KAU, enterprise, local unit;

Only active market units are included.

Active units encompass the following units:

* units with a non-zero value of turnover or non-zero employees;

* units which are active according to the SBS survey and not properly recorded in the administrative sources (e.g. VAT exempt units without employment);

The market status of a unit is defined in terms of the national accounts institutional sectors. Sectors 12.7, 13 and 15 are excluded from the target population.

The following units are specifically excluded:

* temporary partnerships;

* private persons producing energy using photovoltaic installation;

* artificial subsidiaries and other special purpose entities dealing almost exclusively with group affiliates.

There are no exclusions in terms of NACE. However, branch H.50 (water transport) is only available as a branch estimate and not as micro-data. Consequently, the size class dimension is not available.

The reference area is the Luxembourgish economic territory as defined by the national accounts ESA 2010 definition.

Consequently, resident branches of foreign enterprises are included in the target population if the said units are subject to fiscal and other legal obligations of the national economy beyond a simple registration for VAT purposes.

Symetrically, foreign branches of resident enterprises are excluded from the target population. Immobile fixed tangible assets (e.g. buildings) located in foreign economic territories but legally owned by resident enterprises are also considered foreign branches and thus excluded from the target population. The exclusions can only be performed on a case by case basis because the information is not easily available.

2020

The reference year corresponds with the calendar year.

For each company with a different financial year than the calendar year, we take a decision regarding the SBS reference period. Normally, this decision is taken in such a way that at least 6 months of the financial year for a given company are included in the corresponding SBS reference period. Once that decision is taken, it is maintained every subsequent year for comparability reasons.

Restricted from publication

Number of enterprises and number of local units are expressed in units.
Monetary data are expressed in millions of €.
Employment variables are expressed in units.
Per head values are expressed in thousands of € per head.

Ratios are expressed in percentages.

Unit non-response imputation

Please refer to chapter 13.

Item non-response imputation

Given that SBS data are above all quantitative and that they also serve the needs of national accounts, it is very difficult to identify item non-response. This is in particular true for units who are included only once in the sample in a given time series.

For the bigger companies, we are able to follow their structure over a larger period and thus to identify item non-response in an easier way. Item non-response is therefore dealt with on a case-by-case basis during editing procedures.

Missing or erroneous information in the administrative source

For some economic activities, VAT data are not available. No imputation is done on administrative data. If employment data are significant for a given unit, it is very likely included in the survey data. Consequently, the occurrence of missing data in the VAT and social security administrative source is rare and of low impact. In the rare case of missing or the more frequent case of erroneous administrative data, adjustments are performed in a dedicated working area.

For the standard chart of accounts data source (available as of 2015), any missing data is imputed using the either cold-deck ratio imputation or hot-deck ratio imputation (grossing up).

Grossing up

When grossing up survey data, we always take into account ancillary data available in the administrative sources, i.e. the ancillary variables “turnover” and “number of employees”, and not only the sample design. The aforementioned ancillary variables most often have a linear correlation with the variables of interest in SBS. Consequently, the ratio estimator is used for grossing up. With the variable of interest being available only for the sample (y) and with the ancillary variable being available both for the sample (x) and the SBS target population (X), the ratio estimator of the variable of interest , expressed as an average, can be written:

The ratio R has been formulated in the following way:

If for a given stratum no unit is available in the sample, i.e. neither for the historical periods nor for the reference period, the units in this stratum are grouped together with another stratum for which sample data are available and are then grossed up.

Variables 11 11 0, 13 31 0, 13 32 0, 13 33 0, 16 13 0 and 16 15 0 are available directly in the administrative source and therefore do not need to be grossed up (as from reference year 2009). If prior year survey data are available, a cold-deck ratio imputation is performed – this significantly reduces the bias in case there are permanent differences between the admin source and the survey data for a given unit. For the other units, administrative data are simply pasted into the data set.

Please also refer to chapter 13.

Statistical survey combined with administrative sources

At the heart of the data collection is the annual structural business survey. The annual sample of the said survey is based on a stratified random sampling design. Using number of employees and turnover thresholds, the target population is divided into two parts:

* any legal units employing either more than 45 employees or having declared a turnover excluding VAT of more than 7 million EUR per annum are selected every year. Any legal units linked to the aforementioned units (e.g. group and enterprise links) are also selected every year;

* the other legal units are selected using a stratified random probability sampling design. Units in strata consisting of only one unit are always selected. Furthermore, we apply a rotation principle to ensure that a unit can only be surveyed at maximum once every three years – this procedure helps to reduce the administrative burden for the smaller entities. The following years, these units can be satisfactorily imputed using cold-deck ratio imputation or, as of 2015, be imputed using detailed accounting data.

To cover the needs of national accounts and SBS, the sample size is between 3000 and 3600 reporting units.

The stratification used in the sample design does not take into account the size classes defined in the SBS regulation. However, mass imputation procedures exist on a micro data level, so that data can be broken down according to any spanning variable defined a posteriori:

* as of 2015, detailed accounting data from the Central Balance Sheet Office are used to impute units which are not part of the sample or which no recent survey data are available - this source covers balance sheet and profit & loss data. In 2019, the data source accounts for more than half of the SBS population;

* as of 2005, social security data to impute number of employees, personnel costs and number of hours worked by employees;

* detailed VAT data to impute investment variables;

* VAT and social security key indicators to impute non-respondent units using the cold-deck ratio imputation method.

Data sources used for the population frame or target population

The statistical business register is the prime source for building the target population. The said register is enriched by key data from social security and VAT. These data are used to determine the activity status of any enterprise. Only active units are retained. There is no threshold on employment and turnover, except that either has to be positive for the reference year in question.

In addition, for units having no employment and not covered by VAT, the survey helps to identify a few active units. They are added to the population frame.

NACE classification

The difference between principal and secondary activities is directly available in the business register. The approach to identify principal activities is the top-down approach. Stability rules are in place to avoid yearly shifting between principal and secondary activities.

The update frequency of the business register is daily. However, for SBS purposes the extract of the business register is frozen at a given date. From that point on, changes in NACE are only performed through a dedicated working area.

Relation between the reporting unit and the enterprise

In the SBS survey, the reporting unit is the legal unit. However, for a very few legal units, the data are collected separately for the economic subdivisions of the legal unit. Moreover, local unit data are collected at legal unit level. In the administrative sources, the reporting unit is the legal unit.

Finally, the statistical unit "enterprise" can be a legal unit or a group of legal units. Enterprises which are formed by a group of legal units are not the general case in Luxembourg but their occurrence is significant, in particular due to many legal units specialized in ancillary activities for other legal units belonging to the same enterprise group.

Annual

Restricted from publication

Not applicable.

Length of comparable time series

1995 - 2002 (KAU concept)

2003 - 2009 (enterprise concept NACE Rev.1.1)

2005 - 2020 (enterprise concept NACE Rev.2)

Important events in the time series

A revision of SBS data series was completed in summer 2014 for the reference years 2005 to 2010 included. The revision was performed using NACE Rev.2 and included the revision of the profiling of some major players. The impact of the revision was significant for a few activities.

As of 2015, data from the Central Balance Sheet Office are used to impute units which are not part of the sample or which no recent survey data are available. The impact of the data integration is an overall improvement of the sampling error, potentially traded for an increased measurement error on the local level.

As of 2019 reference year, a rolling revision policy has been introduced. After the official production of reference year T (e.g. 2019) published during the year T+2 (e.g. 2021), a revision for the reference year T data is performed during T+3 (e.g. 2022). The impact of this revision not only removes local breaks in series due to temporary errors in NACE but also introduces a less desirable side-effect: due to the consideration of late VAT data during the revision, there is a potentially significant upward impact on the number of active enterprises (mainly very small entities) and the number of persons employed (excluding employees).

Structural business statistics - historical data (sbs_h)

1. Contact

1.1. Contact organisation

1.2. Contact organisation unit

1.3. Contact name

1.4. Contact person function

1.5. Contact mail address

1.6. Contact email address

1.7. Contact phone number

1.8. Contact fax number

2. Metadata update

2.1. Metadata last certified

2.2. Metadata last posted

2.3. Metadata last update

3. Statistical presentation

3.1. Data description

3.2. Classification system

3.3. Coverage - sector

3.4. Statistical concepts and definitions

3.5. Statistical unit

3.6. Statistical population

3.7. Reference area

3.8. Coverage - Time

3.9. Base period

4. Unit of measure

5. Reference Period

6. Institutional Mandate

6.1. Institutional Mandate - legal acts and other agreements

6.2. Institutional Mandate - data sharing

7. Confidentiality

7.1. Confidentiality - policy

7.2. Confidentiality - data treatment

8. Release policy

8.1. Release calendar

8.2. Release calendar access

8.3. Release policy - user access

9. Frequency of dissemination

10. Accessibility and clarity

10.1. Dissemination format - News release

10.2. Dissemination format - Publications

10.3. Dissemination format - online database

10.4. Dissemination format - microdata access

10.5. Dissemination format - other

10.6. Documentation on methodology

10.7. Quality management - documentation

11. Quality management

11.1. Quality assurance

11.2. Quality management - assessment

12. Relevance

12.1. Relevance - User Needs

12.2. Relevance - User Satisfaction

12.3. Completeness

13. Accuracy

13.1. Accuracy - overall

13.2. Sampling error

13.3. Non-sampling error

14. Timeliness and punctuality

14.1. Timeliness

14.2. Punctuality

15. Coherence and comparability

15.1. Comparability - geographical

15.2. Comparability - over time

15.3. Coherence - cross domain

15.4. Coherence - internal

16. Cost and Burden

17. Data revision

17.1. Data revision - policy

17.2. Data revision - practice

18. Statistical processing

18.1. Source data

18.2. Frequency of data collection

18.3. Data collection

18.4. Data validation

18.5. Data compilation

18.6. Adjustment

19. Comment

Related metadata

Annexes

Data description

Metadata last update