Back to top

Structural business statistics - historical data (sbs_h)

DownloadPrint

National Reference Metadata in Euro SDMX Metadata Structure (ESMS)

Compiling agency: Office for National Statistics

Need help? Contact the Eurostat user support

Structural business statistics (SBS) describes the structure, conduct and performance of economic activities, down to the most detailed activity level (several hundred economic sectors).

SBS covers all activities of the non-financial business economy with the exception of agricultural activities and personal services. Limited information is available on banking, insurance and pension funds.

 Main characteristics (variables) of the SBS data category:

  • Business demographic variables (e.g. Number of enterprises)
  • "Output related" variables (e.g. Turnover, Value added)
  • "Input related" variables: labour input (e.g. Employment, Hours worked); goods and services input (e.g. Total of purchases); capital input (e.g. Material investments)

10 February 2020

The statistical characteristics are defined in Annex I of Commission Regulation (EC) No 250/2009

ONS sampling is done at Respondent Unit (RU) level rather than at Enterprise Unit (EU) level. This is due to historic reasons and is a process that ONS are unable to change due to the current systems and technology used in holding company/businesses information on the business register. The ONS business register is the Inter-Departmental Business Register (IDBR). The IDBR holds all business information and has a large and experienced team of business profilers who make changes to business structures on a regular basis based on current knowledge and meetings with businesses. These are often the large Multi-National Enterprises (MNEs) that have complex structures in place for reporting purposes.

When analysing whether the data held at RU level corresponds to EU level it was assessed that this was true for 98% of the businesses held on the IDBR – the 2% mismatch represented those large MNEs where multiple RUs where held for one Enterprise.

As an example we can consider a large supermarket chain where they typically have separate RUs for Retail, Wholesale, Distribution, Transport, Catering services, financial services and mobile services etc. All data would be reported according to their management information and company accounts – at an RU or activity/SIC level. In order to obtain data for the Enterprise a bespoke process was implemented in a special analysis system whereby all RU level data were aggregated up to the Enterprise level based on the dominant employment – so in the case of our large supermarket the majority of employees work in the Retail sector. All data would therefore be reported under SIC 47 retail. This was repeated for each MNE where there was no a like for like match at RU to EU level.

The introduction of the Large Cases Unit means that better information can be obtained on MNEs reporting structures and changes can be made to the IDBR on an on-going basis to ensure better quality data are collected on the Enterprise as a whole.

All United Kingdom businesses registered for either Value Added Tax (VAT) or Pay as You Earn (PAYE). 

United Kingdom - Regions of England, Scotland, Wales and Northern Ireland.

2017

The Annual Business Survey (ABS) meets its legal requirements for statistical accuracy. However, as in all surveys, the estimates from the ABS are subject to various sources of error. Provisional data are published at t+10 and Revised at t+17. 

  • Number of enterprises and number of local units are expressed in units.
  • Monetary data are expressed in millions of €.
  • Employment variables are expressed in units.
  • Per head values are expressed in thousands of € per head.

Ratios are expressed in percentages.

The imputation approach was determined through simulation, by suppressing real data and running imputation using median ratios which result in imputed values close to the corresponding true values.

The imputation method used is based upon the principle of ratio imputation where an imputation link is calculated using information from similar business within the same industry and size band. There are various constraints which are applied to calculating these links. The constraints on the responders (in the industry concerned) used to calculate the links are that they must have:

  • employment of more than 100
  • returned turnover greater than 0
  • data available for the previous and current periods

 Case 1: businesses that have responded in the previous period

For example, if business X has not returned a value for total turnover in the current period, but did in the previous period, businesses in the same sector that have returned a value for total turnover in the current period are considered.

Of these, those who have returned a value for total turnover for both the current and the previous periods are identified. The ratio of the current value of total turnover to the previous value for total turnover is then calculated.

For each responding business to both periods:

The median ratio value, referred to as the imputation link, is then calculated and applied to the returned value for the non-responding business in the previous period.

Case 2: businesses that are first-time responders

In the example below, business X is a first-time responder and so does not have any returned value for the previous period. In these cases the IDBR turnover of businesses is used in the calculation of the imputed value for X.

When calculating median ratios as described above, the calculation routine will attempt to identify respondents at a 4-digit SIC level. Should for some reason (e.g. insufficient number of businesses) there be no median at this level, the routine will try to use a median calculated at 3-digit SIC level. If this fails to find a median, the routine will try at the 2-digit SIC level.

In order to calculate the estimates for an entire population from data collected from a sample, ABS uses standard statistical weighting methods. Essentially the results received from the sample are multiplied by two weights:

  • the a-weight, also known as the design weight, which accounts for the sample design so that a business’s probability of selection is properly reflected. So, for example, a business with a small probability of being selected for the survey will have a large design weight.
  • the g-weight, or calibration factor, makes a correction for any potential bias in the selected sample. For example, in a random selection of five businesses out of a population of ten, it is possible that the five businesses selected have, by chance, higher values for the variables of interest than the non-sampled businesses. If no correction is made, the population total would be over-estimated. Auxiliary information, i.e. information not collected by the survey, which acts as a proxy for the variable of interest, is used to correct for this effect. The ratio of the actual population total for the auxiliary variable to the population total estimated from the sample’s auxiliary variables is calculated, and this is called the g-weight. For ABS, the auxiliary variables are the IDBR employment and turnover, with the choice dependent on the variable being estimated.

The weighted value is then:

weighted value = returned value of the variable * a-weight * g-weight

Estimates of population totals are then found by simply summing the weighted values over the whole sample 

The Annual Business Survey (ABS) is a sample is designed as a stratified random sample from the sampling frame held on the Inter-Departmental Business Register (IDBR).  The IDBR is composed of legal units registered for Value Added Tax (VAT) or Pay As You Earn (PAYE) schemes. The inquiry population is stratified by NACE rev 2, employment, and country using the information from the register. The survey results are grossed up to the register population, so that they relate to all active UK businesses on the IDBR for the sectors covered.

Stratification criteria; Activity,Employment size class, Region 

The sampling scheme is designed to give best estimates of the population totals for a given sample size and involves selecting all the largest businesses with a progressively reducing fraction of smaller businesses. This method ensures the sample size is kept to a minimum. The sample is selected using the Neyman Optimal allocation scheme.

For most businesses with employment of 0-9 Osmotherly rules apply. These rules state that when a business with 0-9 employment has been selected in a survey, it will only be selected for a single year, and it will not be reselected for at least three years following selection.

The sample design is constructed so that a sample for a cell will be selected for two years only, and the units in that sample will largely not be re-selected for at least two years after that selection. The random sample selection uses the permanent random number (PRN), the unique nine digit identifier which is randomly assigned to each unit when it is added to the IDBR. The sample from each cell is constructed from the required number of units with consecutive PRNs in that cell. For ABS, each sample is selected for two years, and there is a year-to-year overlap of half the sample. That is, in any year, half of the sample will be newly-selected, and half will have been selected in the previous year as well.

For 2017 the sample for sectors covered by the SBSR was 61500.

Other than for a minority of larger business or businesses which have a more complex structure, the reporting unit is the same as the enterprise. For this reason, ABS reporting unit counts are presented as enterprise counts.

The dominant activity of a reporting unit is based on employment and calculated using a 'top down' method on a digit by digit basis for multiple activity enterprises. For example, the following four local units have SIC codes and employment as listed:  

Section/ SIC 2007

 Employment

C     18 20 1

50

C     26 11 0

20

C     26 12 0

10

C     26 40 0

25

The largest employment appears to be in SIC 18201 (50), but the IDBR first compares at section (letter “C”), then division (first two digits) and so on. In this case division 26 is largest (55). This comparison cascades further through group (261 has 30 employment, compared to 264 having 25) and finally class and sub-class to give a dominant SIC of 26110 (20).

The classification of businesses on the Inter-Departmental Business Register (IDBR) is updated annually. The Business Employment Register Survey (BRES) provides a ‘future local unit results’ file to the Business Register Unit (BRU) containing employees and classification data, which is loaded onto the IDBR in July of each year. In addition to this, classification details for the production sector are provided by PRODCOM (Products of the European Community) and also files are received from the Annual Business Survey (ABS), this data is also loaded onto the IDBR. For Classifications updating there are certain priority rules to be followed. Except for BRES the other updating sources are bottom down, starting at the Enterprise. The priority for classifications updating is as follows:

BPT – Business Profiling Team

Prodcom / Retail Surveys / Financial Surveys

ABS - Annual Business Survey

BRES – Business Register Employment Survey

MPS – Monthly Production Survey

Other ONS surveys

CISTATS – Construction Industry Statistics Address File

VAT – Value Added Tax

PAYE – Pay-As-You-Earn 

Annual

Data-collection phase

From 12/2017

Post-collection phase

Provisional results - 10/2018

Revised results - 03/2019

Dissemination

Provisional results – 11/2018

Revised results – 05/2019

Differences between Great Britain and Northern Ireland methodology.

The Department for Finance and Personnel Northern Ireland (DFPNI), rather than ONS, conduct the ABS in Northern Ireland. The survey process in Northern Ireland is similar but not exactly the same as that for Great Britain. For example, DFPNI despatch questionnaires in March, after ONS despatch to businesses in Great Britain in January/February. ONS receive reporting unit and local unit level data for businesses sampled by DFPNI in September and February of each survey year. These Northern Ireland reporting unit data are then processed together with the Great Britain data collected by ABS to produce estimates for the whole of the UK at various industry aggregations, as well as producing regional estimates. DFPNI also process the data to produce their own estimates for Northern Ireland. These differ from the ONS estimates for Northern Ireland for a number of reasons:

Calculation of the a-weights The ONS National Results System computes the a-weights (design weights) for all United Kingdom data using the sample design of the Great Britain sample; and while the DFPNI sample design used to have the same design as ONS prior to 2008, DFPNI changed their design in 2008, and again in 2009. The new DFPNI sample design is quite different from that used by the ABS Great Britain - it is similar to that used in the Business Register and Employment Survey. Therefore, the design weights computed in the ONS system for Northern Ireland units can be quite different from those computed in the DFPNI system.

Calculation of the g-weights

The ONS National Results System computes two sets of g-weights: one based on register turnover and another based on register employment. The latter is used for employment costs, whereas the former is used for all the other variables. The Regional System computes g-weights based on local unit employment. In the Northern Ireland methodology, register employment is used for all variables in the calibration in their national system. In their regional system, the g-weights are computed with respect to local unit register employment but using a different calibration method to that used in the ONS regional system.

Regional apportionment

ONS collects all ABS data at reporting unit level; the regional system apportions reporting unit returns between local units using factors obtained from multiple regression models. DFPNI collects turnover data at local unit level, but does not use these data in their apportionment; their current apportionment is based on the median of per head returns. When Northern Ireland data are processed in the ONS system, new apportioned local unit values, based on the ONS methodology, are obtained and used to produce estimates. Also, ONS and DFPNI use different methods to deal with local units operating in industries that are out of scope.

DFPNI does not collect data for all the variables included in the Great Britain questionnaire; in the ONS system, values are derived for the missing variables using a model, and these values contribute towards the estimation of derived variables.

Comparable time series – 1996-2007 ; 2008-2015 ; 2016-2017

The UK believes there should be breaks within the time series shown above.

  • The change in the NACE rev 2 code resulted in data for 2008 onwards being reported on a slightly different structure to that of NACE rev 1.2 (from 1996 – 2007)
  • Data for 1996-2015 were supplied at the UK Reporting Unit level, from 2016 data was supplied at the UK Enterprise level.