ICT usage in enterprises (isoc_e)

National Reference Metadata in SIMS structure for INFOSOC Enterprises

Compiling agency: Institut national de la statistique et des études économiques (STATEC)


Eurostat metadata
Reference metadata
1. Contact
2. Metadata update
3. Statistical presentation
4. Unit of measure
5. Reference Period
6. Institutional Mandate
7. Confidentiality
8. Release policy
9. Frequency of dissemination
10. Accessibility and clarity
11. Quality management
12. Relevance
13. Accuracy
14. Timeliness and punctuality
15. Coherence and comparability
16. Cost and Burden
17. Data revision
18. Statistical processing
19. Comment
Related Metadata
Annexes (including footnotes)



For any question on data and metadata, please contact: Eurostat user support

Download


1. Contact Top
1.1. Contact organisation

Institut national de la statistique et des études économiques (STATEC)

1.2. Contact organisation unit

ENT3 - Structural Business Statistics

1.5. Contact mail address

STATEC
B.P. 304
L-2013 Luxembourg


2. Metadata update Top
2.1. Metadata last certified 12/03/2024
2.2. Metadata last posted 12/03/2024
2.3. Metadata last update 12/03/2024


3. Statistical presentation Top
3.1. Data description

Data on the Information and Communication Technologies (ICT) usage and e-commerce in enterprises are survey data. They are collected by the National Statistical Institutes or Ministries and are in principle based on Eurostat's annual model questionnaires on ICT usage and e-commerce in enterprises.

Large part of the data collected is used to measure the progress in the implementation of one of the main political priorities of the European Commission for 2019 to 2024 – A Europe fit for the digital age. Part of this is the "European strategy for data", envisioning a single market for data to ensure the EU's global competitiveness and data sovereignty, in which context a comprehensive set of new rules for all digital services was proposed: the Digital Services Act and the Digital Markets Act, which are centrepieces of the EU digital strategy. Furthermore, the Commission and the High Representative of the Union for Foreign Affairs and Security Policy presented a new “EU cybersecurity strategy”, which is intended to bolster the EU's collective resilience against cyber threats, safeguard a global and open internet and protect EU values and the fundamental rights of its people. Furthermore, data will allow monitoring the progress towards  A Europe fit for the digital age, one of the six priorities for the period 2019-2024 of the von der Leyen European Commission.

The aim of the European survey on ICT usage and e-commerce in enterprises is to collect and disseminate harmonised and comparable information at European level.

 

Name of data collection
 Enquête relative  à l’usage des technologies de l’information et de la communication dans les entreprises – 2023
3.2. Classification system

 NACE Rev.2 2008

3.3. Coverage - sector

All economic activities in the scope of Annex I of the Commission Regulation are intended to be included in the general survey, covering enterprises with 10 or more employees and self-employed persons. These activities are: NACE Rev. 2 sections C, D, E, F, G, H, I, J, L, M and N, division 95.1.

For micro-enterprises see the sub-concepts below.

3.3.1. Coverage-sector economic activity for micro-enterprises - All NACE Rev. 2 categories are covered
3.3.2. Coverage sector economic activity for micro-enterprises - If not all activities were covered, which ones were covered?

Micro-enterprises are not covered by the survey in Luxembourg.

3.4. Statistical concepts and definitions

The model questionnaire on ICT usage and e-commerce in enterprises provides a large variety of variables covering among others the following areas:

-          Access to and use of the Internet

-          E-commerce and e-business

-          Use of cloud computing services

-          Artificial Intelligence

-          Other topics: Data utilisation, sharing, analytics and trading, Invoicing.

The annual model questionnaires and the European businesses statistics compliers’ manual for ICT usage and e-commerce in enterprises comprise definitions and explanations regarding the topics of the survey.

3.5. Statistical unit

The enterprise unit was used.

3.6. Statistical population

Target Population

As required by Annex of the Commission Implementing Regulation, enterprises with 10 or more employees and self-employed persons shall be covered by the survey.

For micro-enterprises see the sub-concepts below.

3.6.1. Coverage of micro-enterprises
No
3.6.2. Breakdown between size classes [0 to 1] and [2 to 9]
No
3.6.3. If for micro-enterprises different size delimitation was used, please indicate it.

Micro-enterprises are not covered by the survey in Luxembourg.

3.7. Reference area

The whole territory was covered.

3.8. Coverage - Time

Years 2022 and 2023.

3.9. Base period

Not applicable


4. Unit of measure Top

Percentages of enterprises, Percentages of turnover, Percentages of employees and self-employed persons.


5. Reference Period Top

The reference periods defined in the model questionnaire were followed in the national survey: “Where not otherwise specified, the reference period is the current situation (year 2023). The reference period for the percentages of sales/orders data is the financial year 2022.”


6. Institutional Mandate Top
6.1. Institutional Mandate - legal acts and other agreements

Complementary national legislation constituting the legal basis for the survey on the use of ICT in enterprises:

Loi du 10 juillet 2011 portant sur l’établissement de l’Institut national de la statistique et des études économiques (STATEC).

6.2. Institutional Mandate - data sharing

Not applicable for this survey


7. Confidentiality Top
7.1. Confidentiality - policy

Regulation (EC) No 223/2009 on European statistics (recital 24 and Article 20(4)) of 11 March 2009 (OJ L 87, p. 164), stipulates the need to establish common principles and guidelines ensuring the confidentiality of data used for the production of European statistics and the access to those confidential data with due account for technical developments and the requirements of users in a democratic society.

At national level : 

 

Quantitative variables
For quantitative variables, the same policy as in SBS is applied.
Primary confidentiality, if:
• n units dominate the total turnover by at least k% ;
• the cell contains less than n units.


Secondary confidentiality if:
• protection is needed to address primary confidentiality or linked table risks (historical data, subdivisions, etc.) ;
• there is a link with the SBS suppression pattern.


Every cell linked to the variable "turnover" is confidential if the cell is confidential for the variable "turnover".
A link can be established as:
• direct link: the value of the variable of interest depends on the value of turnover (e.g. value-added, cost of sales, gross operating margin, etc.) ;
• indirect link: the value of turnover provides an idea of the dimension of the variable of interest (e.g. data calibrated with the variable « turnover », etc.).


In addition to the link inherited via the variable "turnover", variables relating to e-commerce turnover ("awsval", "axsval", etc.) are also checked for primary confidentiality individually.

 

Qualitative variables
For qualitative variables in frequency tables, a distinction has to be made between "Higher level confidentiality" and "Variable-specific rules":


Higher level confidentiality
Higher level confidentiality is checked only for cells broken down exclusively by variables which are available for every unit in the target population.
Higher level primary confidentiality is applicable if a cell in the target population contains less than n units for a given subdivision or breakdown ;
Higher level secondary confidentiality is applicable for any cells which are
• either a subdivision or a breakdown of the primary confidential cell, including cells with zero units ;
• or needed to protect primary confidential cells.
On each level of breakdown, all the cells have to be checked for both primary and secondary confidentiality. This rule is applied on the highest level for the total number of units, e.g. units per economic activity, units per size class, units per economic activity and size class, i.e. no additional variable of interest is used to subdivide the population (e.g. enterprises using computers, enterprises selling online, etc.). Each variable of interest inherits the higher-level suppression pattern.


Variable-specific rules
Each variable needs to be checked a priori for the following risk potentials:
• disclosure risk (none/low/high): the potential of identification of the variable ;
• sensitivity (none/low/high): the potential damage dealt by the information in case of the disclosure ;
• group sensitivity (none/low/high): the potential damage dealt by the information in case of group disclosure.

 
One should be aware of the consequences of a « none » risk, meaning that no confidentiality is applied at all.

 

7.2. Confidentiality - data treatment

Data are transmitted via eDamis (encrypted) and delivered to a secure environment where they are treated. Flags are added for confidentiality in case results must not be disclosed.

At national level :

Data are transmitted via eDamis (encrypted) and delivered to a secure environment where they are treated. National Statistical Institutes are requested to add flags for confidentiality in case results must not be disclosed.

 

Quantitative variables

The basis for any suppression pattern is the software package tau-Argus. However, the process also involves manual procedures, i.e. checking the tau-Argus output, comparing the historical data series and addressing linked table disclosure risks (see secondary confidentiality for further details).

The statistical disclosure control procedures are not performed for every variable individually but only for a primary shadow variable, i.e. "Turnover". If a given cell is confidential for that variable (no matter if primary or secondary), the same cell will be suppressed for all the other available quantitative variables. Variables relating to e-commerce turnover are also checked individually, an pass their flags on to related qualitative variables as well.

 

Primary confidentiality rules

 a) Sensitivity rule: We apply the (n,k)-dominance rule, i.e. a cell is suppressed if n units separately or jointly dominate the total value of a cell by at least k% .

 b) Minimum frequency rule: For any cells that are left after applying the sensitivity rule, a minimum frequency is applied. A cell is suppressed if there are less than n units in a given cell.

 

Secondary confidentiality rules

The secondary suppression is calculated by tau-Argus using the ‘Modular’ algorithm. Manual suppressions or cost adjustments are performed using the tau-Argus ‘apriori’ file facility.

 a) Secondary suppression within a table

 - A cell is suppressed for secondary confidentiality if n units dominate jointly or separately the confidential total value by at least k% ;

 - special attention is paid to the impact of singletons, a risk which is in most cases directly addressed by the tau-Argus Modular algorithm ;

 - tau-Argus is set to minimise the cost when determining the secondary suppressed cells. However, we also want to provide the user with useful data, whether it is in terms of interpretation and/or availability of time series. Consequently, the cost minimisation can be overridden for economic and/or historical reasons.

 

b) Secondary suppression due to linked tables disclosure risks

 - historical disclosure: in conformity with the SDC handbook, we ensure that no historical cell is compromised by disclosing the same cell for the current reference year. As long as there is any significant link with prior year data, a cell may not be disclosed for the current reference year.

 - Links to any other statistics: Turnover for the ICT survey is compared to the SBS preliminary series of the same reference year (e.g. T-1 for the survey carried out in year T).

 


8. Release policy Top
8.1. Release calendar

Not available.

8.2. Release calendar access

Not available.

8.3. Release policy - user access

Not available.


9. Frequency of dissemination Top

Annual


10. Accessibility and clarity Top
10.1. Dissemination format - News release

 

National dissemination of results

 

None available at the time of this report.

 

10.2. Dissemination format - Publications

 

National dissemination of results

 

Selected results will also appear in the annual statistical yearbook, “Luxembourg in figures” and in “Un portrait chiffré des entreprises au Luxembourg”.

 

10.3. Dissemination format - online database

See detailed section 10.3.1.

10.3.1. Data tables - consultations

Results for selected variables collected in the framework of this survey are available for all participating countries on Digital economy and society of Eurostat website.

At national level :

 

Results will be published on https://statistiques.public.lu/fr/donnees/themes/entreprises/sciences-technologies.html

 

10.4. Dissemination format - microdata access

Not applicable

10.5. Dissemination format - other

Not requested

10.5.1. Metadata - consultations

Not requested

10.6. Documentation on methodology

Not available

10.6.1. Metadata completeness - rate

Not requested

10.7. Quality management - documentation

Not available


11. Quality management Top
11.1. Quality assurance

The European businesses statistics compliers’ manual for ICT usage and e-commerce in enterprises provides guidelines and standards for the implementation of the surveys. It is updated every year according to the changed contents of the model questionnaires.

At national level :

Not available.

11.2. Quality management - assessment

European level :

At European level, the recommended use of the annual Eurostat model questionnaire aims at improving comparability of the results among the countries that conduct the survey on ICT usage and e-commerce in enterprises. Moreover, the Methodological Manual provides guidelines and clarifications for the implementation of the surveys.

National level :

 

The online questionnaire was maintained as the main survey vehicle with no paper questionnaires sent out after the previous survey showed sharp increases in online response rates using this procedure. Alternatively, a PDF version of the questionnaire was available for downloaded and print out.

 

Experiences gained with past questionnaires as well as from other domains (R&D, CIS) helped to improve the quality of the online questionnaire while trying to reduce the response burden as much as possible.

 

The 2023 data collection is the tenthconsecutive edition completely managed in-house. This allows for a better overview of the status of the survey, the problems encountered by respondents, as well as an improved follow-up with 2 reminders, as well as a planned 3rd reminder to high-impact enterprises sent as a registered letter.

 


12. Relevance Top
12.1. Relevance - User Needs

European level : 

At European level, European Commission users (e.g. DG CNECT, DG GROW, DG JUST, DG REGIO, DG JRC) are the principal users of the data on ICT usage and e-commerce in enterprises and contribute in identifying/defining the topics to be covered. Hence, main users are consulted regularly (at hearings, task forces, ad hoc meetings) for their needs and are involved in the process of the development of the model questionnaires at a very early stage.

User needs are considered throughout the whole discussion process of the model questionnaires aiming at providing relevant statistical data for monitoring and benchmarking of European policies.

National level :

STATEC's research unit is routinely asked to provide feedback on new modules in the model questionnaire, as well as to indicate any variables used in research projects that might be missing from the model questionnaire. These additional variables, the pursued goal and their value-added are generally discussed between both units, in order to reach a consensus that allows to keep the questionnaire relatively short, while not unnecessarily reducing its usefulness to the research community.

12.2. Relevance - User Satisfaction

European level : 

At European level, contacts within the Commission, the OECD and other stakeholders give a clear picture about the key users' satisfaction as to the following data quality aspects: accuracy and reliability of results, timeliness, satisfactory accessibility, clarity and comparability over time and between countries, completeness and relevance. Overall users have evaluated positively (good, very good) the data quality on the ICT usage and e-commerce in enterprises.

National level :

There is no survey led at the national level to assess the user's satisfaction on the data quality on the ICT usage and e-commerce in enterprises.

12.3. Completeness

Detailed information is available in “ Annex I _ Completeness “ excel file - related to questionnaire, coverage, additional questions.

12.3.1. Data completeness - rate

Not requested. 


13. Accuracy Top
13.1. Accuracy - overall

Comments on reliability and representativeness of results and completeness of dataset

These comments reflect overall standard errors reported for the indicators and breakdowns in section 13.2.1 (Sampling error - indicators) and the rest of the breakdowns for national and European aggregates, as well as other accuracy measurements. The estimated standard error should not exceed 2pp for the overall proportions and should not exceed 5pp for the proportions related to the different subgroups of the population (for those NACE aggregates for the calculation and dissemination of national aggregates). If problems were found, these could have implications for future surveys (e.g. need to improve sampling design, to increase sample sizes, to increase the response rates).

More detailed information is available in “ Annex II. _ Accuracy “ excel file - related to European aggregates, comments on reliability and use of flag.

13.2. Sampling error

For calculation of the standard error see 13.2.1.1.

13.2.1. Sampling error - indicators

Standard error (for selected indicators and breakdowns)

Precision measures related to variability due to sampling, unit non-response (the size of the subset of respondents is smaller than the size of the original sample) and other (imputation for item non-response, calibration etc.) are not (yet) required from the Member states for all indicators.  Eurostat will make basic assumptions to compute these measures for all indicators produced (e.g. stratified random sampling assuming as strata the crossing of the variables “Number of employees and self-employed persons” and “Economic Activity” as it was defined in the 3 tables of section 18.1).

More detailed information is available in“ Sample and standard error tables 2023 “ excel file – worksheets starting with “Standard error".

13.2.1.1. Sampling error indicator calculation

Calculation of the standard error

Various methods can be used for the calculation of the standard error for an estimated proportion. The aim is to incorporate into the standard error the sampling variability but also variability due to unit non-response, item non-response (imputation), calibration etc. In case of census / take-all strata, the aim is to calculate the standard errors comprising the variability due to unit non-response and item non-response.

a) Name and brief description of the applied estimation approach
 

Estimations of the variables of interest were performed with the calibrated weights.

For a sufficiently large sample size, the calibration estimator is equivalent to the linear regression estimator and its bias tends to be minor. Consequently, the variance of the estimation is based on the residuals resulting from the relationship between the variable of interest and the ancillary variables which have been used for the calibration.

The standard error for a given survey stratum (which is normally also the stratum used for grossing-up) is calculated based on these properties. If necessary, these standard errors are then aggregated to the breakdowns requested by Eurostat.

 

b) Basic formula
  See Annex 'basic formula Luxembourg'

 

c) Main reference in the literature
  “Techniques de sondage” by Pascal Ardilly (ISBN 10: 2-7108-0847-1)

 

d) How has the stratification been taken into account? 
 See Annex 'stratification Luxembourg'

 

e) Which strata have been considered? 
 

The strata used were the ones used for grossing-up. In cases where several strata had to be combined to allow calibration, these regroupings were also taken into account for the calculation of standard errors.

Generally, strata were defined by crossing the following size classes and NACE groupings (exceptions in parenthesis):

 

Size classes:

S - 10-49 employees

M - 50-249 employees

L - 250 employees

 

NACE groupings:

C10_18

C19_23

C24_25

C26_33

D35_39 (S+M combined)

F41_43

G45_46

G47

H49_53

I55 (M + L combined)

I56

J58_63

L68 (S+M combined)

M69_75 (M + L combined)

N77_82

S951

13.3. Non-sampling error

See detailed sections below.

13.3.1. Coverage error

See concept 18.1.1. A) Description of  frame population.

13.3.1.1. Over-coverage - rate

0.4%

13.3.1.2. Common units - proportion

Not requested

13.3.2. Measurement error

No measurement errors detected.

13.3.3. Non response error

See detailed sections below.

13.3.3.1. Unit non-response - rate

See detailed sub-concepts below.

13.3.3.1.1. Unit response

The following table contains the number of units (i.e. enterprises), by type of response to the survey and by the percentage of these values in relation to the gross sample size.

 

Type of response Enterprises
0-9 employees and self-employed persons 10 or more employees and self-employed persons
Number % Number %
Gross sample size (as in section 3.1 C)   100%  2493 100%
1. Response (questionnaires returned by the enterprise)     2087  83.7%
1.1 Used for tabulation and grossing up (Net sample or Final Sample; as in section 3.1 D)     2076  83.3%
1.2 Not used for tabulation     11  0.4%
1.2.1 Out of scope (deaths, misclassified originally in the target population, etc.)      11  0.4%
1.2.2 Other reasons (e.g. unusable questionnaire)        
2. Non-response (e.g. non returned mail, returned mail by post office)     407  16.3%

 

Comments on unit response, if unit response is below 60%
 
13.3.3.1.2. Methods used for minimizing unit non-response

In order to reduce the non-response, 3 reminders are sent (with a registered letter to high-impact enterprises, on the 3rd reminder).

13.3.3.1.3. Methods used for unit non-response treatment
1. No treatment for unit non-response  
2. Treatment by re-weighting
2.1 Re-weighting by the sampling design strata considering that non-response is ignorable inside each stratum (the naïve model)  
2.2 Re-weighting by identified response homogeneity groups (created using sample-level information)  
2.3 Re-weighting through calibration/post-stratification (performed using population information) by the groups used for calibration/post-stratification X
3. Treatment by imputation (done distinctly for each variable/item) X
4. Method(s) and the model(s) corresponding to the above or other method(s) used for the treatment of unit non-response. (e.g. Re-weighting using Horvitz-Thompson estimator, ratio estimator or regression estimator, auxiliary variables )
 
To treat non-response, the initial sampling weight is first adjusted using the response rate for each stratum. Strata are defined by crossing the following size classes and NACE groupings.
In order to obtain reliable results for quantitative variables (that are in line with SBS totals), the corrected weights are calibrated to the number of units, using the total turnover and the total employment per stratum as auxiliary information. Calibration is carried out in R, using the “calib” method of the “sampling” package with a “logit” distance function.
13.3.3.1.4. Assessment of unit non-response bias

Not available

13.3.3.2. Item non-response - rate

There are no questions with response rates lower than 90%.

13.3.3.2.1. Methods used for item non-response treatment
1. No treatment for item non-response  
2. Deductive imputation
An exact value can be derived as a known function of other characteristics.
 X
3. Deterministic imputation (e.g. mean/median, mean/median by class, ratio-based, regression-based, single donor nearest-neighbour)
Deterministic imputation leads to estimators with no random component, that is, if the imputation were to be re-conducted, the outcome would be the same
 
4. Random imputation (e.g. hot-deck, cold-deck)
Random imputation leads to estimators with a random component, that is, if the imputation were re-conducted, it would have led to a different result
 X
5. Re-weighting  
6. Multiple imputation
In multiple imputation each missing value is replaced (instead of a single value) with a set of plausible values that represent the uncertainty of the right value to impute. Multiple imputation methods offer the possibility of deriving variance estimators by taking imputation into account. The incorporation of imputation into the variance can be easily derived based on variability of estimates among the multiply imputed data sets.
 
7. Method(s) and the model(s) corresponding to the above or other method(s) used for the treatment of item non-response.
 
Deductive imputation was used in cases when filters were not respected.
Cold-deck imputation, based on 2022 data, was used for variables that were present in last year's survey and which do not change much over time.
 
Hot-deck imputation based on random sampling without replacement was performed by predefined strata for other variables.
13.3.3.2.2. Questions or items with item response rates below 90% and other comments

Other comments relating to the item non-response

Additional issues concerning "non-response" calculation (e.g. method used in national publications).
 Not available

 

Questions and items with low response rates (cut-off value is 90% ) and item non-response rate.
 Not available
13.3.4. Processing error

No processing errors were detected.

13.3.5. Model assumption error

Not requested


14. Timeliness and punctuality Top
14.1. Timeliness

See detailed section below.

14.1.1. Time lag - first result

Not applicable

14.1.2. Time lag - final result

European level : 

Data are to be delivered to Eurostat in the fourth quarter of the reference year (due date for the finalised dataset is 5th October). European results are released before the end of the survey year or in the beginning of the year following the survey year (T=reference year, T+0 for indicators referring to the current year, T+12 months for other indicators referring to the previous year e.g. e-commerce).

At national level : 

A first complete set of data has been sent to Eurostat on 05/10/2023. Final data, including a complete SDC pattern, has been sent to Eurostat on 27/10/2023.

14.2. Punctuality

 See detailed section below.

14.2.1. Punctuality - delivery and publication

A first complete set of data has been sent to Eurostat on 05/10/2023. Final data, including a complete SDC pattern, has been sent to Eurostat on 27/10/2023.


15. Coherence and comparability Top
15.1. Comparability - geographical

The model questionnaire is generally used by the countries that conduct the survey on ICT usage and e-commerce in enterprises. Due to (small) differences in translation, in the used survey vehicle, in non-response treatment or different routing through the questionnaire, some results for some countries may be of reduced comparability. In these cases, notes are added in the data.

Detailed information on differences in the wording of the questions in the national questionnaires is available in “ Annex I _ Completeness “ excel file - related to questionnaire, coverage, additional questions.

15.1.1. Asymmetry for mirror flow statistics - coefficient

Not applicable

15.2. Comparability - over time

See section below.

15.2.1. Length of comparable time series

The length of comparable time series depends on the module and the variable considered within each survey module. Additional information is available in annexes attached to the European metadata.

No methodological changes occurred compared to the 2022 survey.

15.3. Coherence - cross domain

Not applicable

15.3.1. Coherence - sub annual and annual statistics

 Not applicable

15.3.2. Coherence - National Accounts

Not applicable

15.4. Coherence - internal

Not applicable


16. Cost and Burden Top
Restricted from publication


17. Data revision Top
17.1. Data revision - policy

Currently, no revision of the ICT data is foreseen.

17.2. Data revision - practice

ICT data are currently not revised.

17.2.1. Data revision - average size

 Not requested


18. Statistical processing Top
18.1. Source data

A) Frame population description and distribution

For more information see concept 18.1.1.

 

B) Sampling design - Sampling method

Description of the sampling method used (e.g. stratified random sample, quota sampling, cluster sampling; one-stage or two-stage sampling) and information which variables were used to stratify, the categories of those variables, in particular for the NACE Rev. 2 categories related to the "possible calculation of European aggregates", and the final number of strata: 

 

This section includes a description of the sampling method used (e.g. stratified random sample, quota sampling, cluster sampling; one-stage or two-stage sampling) and information which variables were used to stratify, the categories of those variables, in particular for the NACE categories related to the "possible calculation of European aggregates", and the final number of strata.

 

 The method used for sampling was a stratified random sample, with varying sampling rates depending on size class:
• For the two size classes 50-249 and 250+, the sampling rate was 100% (i.e. a census);
• For the size class 10-49, the sampling rate was generally fixed at 60%, the only exceptions being F41_43, G45-46, G47, and I56 with a sampling rate of 35%. For small strata where the sample would have consisted of 3 units or less, a minimum of 3 units were sampled.

 

 

 

 

C) Gross sample distribution

More detailed information is available in “ Sample and standard error tables 2023  “ excel file (Worksheet: GROSS SAMPLE)

 

D) Net sample distribution

More detailed information is available in “ Sample and standard error tables 2023  “ excel file (Worksheet: NET SAMPLE)

18.1.1. Population frame

A) Description of frame population

a) When was the sample for the ICT usage and e-commerce in enterprise survey drawn?             06/02/2023                  
b) When was the last update of the Business register that was used for drawing the sample of enterprises for the survey?    06/02/2023
c) Please indicate if the frame population is the same as, or is in some way coordinated with, the one used for the Structural Business Statistics (different snapshots)  The frame population used for the sample differs from the one used in SBS. However, the frame population for the final data production uses the same snapshot as the SBS preliminary results.
d) Please describe if different frames are used during different stages of the statistical process (e.g. frame used for sampling vs. frame used for grossing up):  The frame population used for sampling is based on an early version of the business register, where not all information on turnover or employment is complete. Therefore, this frame will be updated in September for the grossing up procedure.
e) Please indicate shortcomings in terms of timeliness (e.g. time lag between last update of the sampling frame and the moment of the actual sampling), geographical coverage, coverage of different subpopulations, data available etc., and any measures taken to correct it, for this survey.  Given that the sampling frame only contains 9 months’ worth of employment data for the reference year, we use the average employment over the 12 previous months as a criteria for the size classes used in sampling. These size classes are recalculated when the final frame population is drawn for grossing up and production of final results. 

 

 B) Frame population distribution

More detailed information is available in “ Sample and standard error tables 2023  “ excel file (Worksheet: FRAME POPULATION)

18.2. Frequency of data collection

Annual

18.3. Data collection

See detailed sections below.

18.3.1. Survey period
Survey / Collection Date of sending out questionnaires Date of reception of the last questionnaire treated
General survey  06/03/2023  05/10/2023
Micro-enterprises  Not applicable  Not applicable
18.3.2. Survey vehicle – general survey
General survey - Stand-alone survey
18.3.3. Survey vehicle – micro-enterprises
The collection of micro-enterprises was integrated with the general survey
18.3.4. Survey type

Possibility for respondents to choose between an online questionnaire or printing out and filling in a PDF version of the questionnaire.

18.3.5. Survey participation
Mandatory
18.4. Data validation

 

Data has been validated by server-based EDIT validation.

 

Additional checks are performed on the microdata and aggregated levels.

 

18.5. Data compilation

Grossing-up procedures

 

To treat non-response, the initial sampling weight is first adjusted using the response rate for each stratum. Strata are defined by crossing the following size classes and NACE groupings.

 

In order to obtain reliable results for quantitative variables (that are in line with SBS totals) the corrected weights are calibrated using to the number of units, the total turnover and the total employment per stratum as auxiliary information.

 

Calibration is carried out in R, using the “calib” method of the “sampling” package with a “logit” distance function.

 

 Please note that due to the small number of observations leading to co-linearity problems, some strata cannot be calibrated over all size classes. For these strata, several size classes were combined.

 

 The strata used for calibration consist of the strata listed in 13.2.1.1.  (e).

 

18.5.1. Imputation - rate

The imputation rate ranges from 0,05% to 1.50% depending on the question.

18.6. Adjustment

Not applicable

18.6.1. Seasonal adjustment

Not applicable


19. Comment Top

Problems encountered and lessons to be learnt: 

19.1. Documents
Questionnaire in national language  FR/DE
Questionnaire in English (if available)  X
National reports on methodology (if available)  
Analysis of key results, backed up by tables and graphs in English (if available)  
Other Annexes  


Related metadata Top


Annexes Top
Annex I._Completeness 2023
Annex II._ Accuracy 2023
Sample and standard error tables 2023
Annex I
Annex III
Questionnaire DE
Questionnaire EN
Questionnaire FR
Accuracy QR
Sample and standard errors QR