|
For any question on data and metadata, please contact: Eurostat user support |
|
|||
1.1. Contact organisation | Institut national de la statistique et des études économiques (Insee) |
||
1.2. Contact organisation unit | Département Répertoires, Infrastructures et Statistiques Structurelles Division élaboration des statistiques annuelles d'entreprise |
||
1.5. Contact mail address | 88 avenue Verdier 92120 Montrouge Bureau 3-A-044 |
|
|||
2.1. Metadata last certified | 31/03/2023 | ||
2.2. Metadata last posted | 31/03/2023 | ||
2.3. Metadata last update | 31/03/2023 |
|
|||
3.1. Data description | |||
Structural business statistics (SBS) describes the structure, conduct and performance of economic activities, down to the most detailed activity level (several hundred economic sectors). SBS covers all activities of the non-financial business economy with the exception of agricultural activities and personal services. Limited information is available on banking, insurance and pension funds. Main characteristics (variables) of the SBS data category:
|
|||
3.2. Classification system | |||
Statistical Classification of Economic Activities in the European Community (NACE): NACE Rev.1 was used until 2001, NACE Rev. 1.1 since 2002, and NACE Rev 2 is used from 2008 onwards. Key data were double reported in NACE Rev.1.1 and NACE Rev.2 for 2008. From 2009 onwards, only NACE Rev.2 data are available. The regional breakdown of the EU Member States is based on the Nomenclature of Territorial Units for Statistics (NUTS). Detailed information about the consecutive NUTS Regulations can be found at Eurostat's website The product breakdown is based on the Classification of Products by Activity (CPA) as stated in the Regulation establishing CPA 2008 and its amending Commission Regulation (EU) No 1209/2014 (from reference year 2015 onwards) |
|||
3.3. Coverage - sector | |||
The SBS coverage was limited to Sections C to K of NACE Rev.1.1 until 2007. Starting from the reference year 2008 data is available for Sections B to N and Division S95 of NACE Rev.2. With 2013 as the first reference year information is published on NACE codes K6411, K6419 and K65 and its breakdown. |
|||
3.4. Statistical concepts and definitions | |||
The statistical characteristics are defined in Annex I of Commission Regulation (EC) No 250/2009 |
|||
3.5. Statistical unit | |||
Since the reference year 2017, the statistical unit is the enterprise, as defined in the statistical unit Regulation N° 696/93 : « The enterprise is the smallest combination of legal units that is an organizational unit producing goods or services, which benefits from a certain degree of autonomy in decision-making, especially for the allocation of its current resources. An enterprise carries out one or more activities at one or more locations. An enterprise may be a sole legal unit ».
Since the reference year 2017, the French SBS results are produced in enterprise, and are fully complient with the statistical unit Regulation N° 696/93. However, the data transmitted by the French National Bank for class 64.11, 64.19 and division 65 are in legal units.
The data collection units are mainly the legal units. For the biggest groups, which are manually profiled, some data are collected at the enterprise level. For the others, automatic profiling is carried out using legal unit data only. |
|||
3.6. Statistical population | |||
The French SBS system Esane covers all market and economically consistent units, without any threshold of size, excluding financial units (save for class 64.20 and division 66) and agriculture. The statistics produced by the Esane process are supplemented by results produced by the French National Bank for class 64.11, 64.19 and division 65. |
|||
3.7. Reference area | |||
Only activities conducted on the French territory including overseas departments are considered. Branches of foreign units – that is local units not constituting separate legal entities but dependent on foreign enterprises and classified as quasi-corporations in accordance with ESA –, were, until 2014, with some exceptions, automatically considered as out of scope. They are now potentially in the scope, on the condition that they have declared a fiscal form to the French tax administration for the reference year or the previous year and secondly that they have at least one employee. |
|||
3.8. Coverage - Time | |||
1996-2020 |
|||
3.9. Base period | |||
Not applicable. |
|
|||
Ratios are expressed in percentages. |
|
|||
2020 |
|
|||
6.1. Institutional Mandate - legal acts and other agreements | |||
Year 1995 was the first year for the implementation of the Council Regulation No 58/97 (SBS Regulation). The Council Regulation No 58/97 has been amended three times: by Council Regulation No 410/98, Commission Regulation No 1614/2002 and European Parliament and Council Regulation No 2056/2002. As a new amendment of the basic Regulation it was decided to recast the Regulation No 58/97 in order to obtain a new "clean" legal text. The European Parliament and Council Regulation No 295/2008 was adopted on 14/02/2008 and the provisions of this Regulation are applicable from the reference year 2008. Regulation No 295/2008 has been amended by Commission Regulation (EU) No 446/2014. |
|||
6.2. Institutional Mandate - data sharing | |||
Not applicable. |
|
|||
7.1. Confidentiality - policy | |||
Statistical confidentiality is defined in Act no. 51-711 of 7 June 1951 (amended) on legal obligation, coordination and confidentiality in the field of statistics. Economic or financial information may not be communicated to anyone for twenty-five years, unless special dispensation is given, after an opinion from the statistical confidentiality committee, and the purpose must not be to use this information for tax inspection or punitive measures. For businesses, no results are published that concern fewer than three enterprises, nor any data where a single enterprise represents 85% or more of the total value obtained. The secondary confidential cells are also taken into account. However, the dissemination of lists extracted from the register of enterprises and establishments may mention economic activity, number of employees category and turnover size bracket. |
|||
7.2. Confidentiality - data treatment | |||
Primary suppression:
Secondary suppression:
To reduce the number of confidential cells, we have used a minimization algorithm (Modular) and the cost function for secondary suppression is equal to the response variable (except in case of negative values). |
|
|||
8.1. Release calendar | |||
Business data have been published on our website between August and September. Data are announced in a general calendar of INSEE publications. |
|||
8.2. Release calendar access | |||
8.3. Release policy - user access | |||
We release each year aggregated business data on our website. Moreover, we provide to the public authorities and a limited set of registered main users information which is not publicly available . |
|
|||
Annual |
|
|||
10.1. Dissemination format - News release | |||
There are no specific press releases directly linked to SBS data |
|||
10.2. Dissemination format - Publications | |||
Electronic publications are available on our website: - Synthetic data named "chiffres clés" are disseminated in about fifty data with possibly a graph. These data are available in French and in English, but only for one year, as they are replaced with data from the following year. - More detailed data: main aggregates, profit and loss account, balance sheet (liabilities, gross assets, accounting depreciation) , employment, investment, rates and branches data are released: |
|||
10.3. Dissemination format - online database | |||
No online database available |
|||
10.4. Dissemination format - microdata access | |||
non-anomnymized microdata are available on restricted access to researchers only, after examination by the French Secret Committee, which vouches for the respect by the researchers of the statistical confidentiality. |
|||
10.5. Dissemination format - other | |||
The data are sent to Eurostat, to be used in European aggregates and to be released also as national data |
|||
10.6. Documentation on methodology | |||
Methodology is available on the Website: Documentation about the statistical process Esane, definitions of variables: https://www.insee.fr/fr/information/4226820
https://www.insee.fr/fr/statistiques/6482871#documentation-sommaire https://www.insee.fr/fr/information/3056089
Documentation about the survey: http://www.insee.fr/fr/metadonnees/source/s1269 Methodology is available in English: |
|||
10.7. Quality management - documentation | |||
A national quality report on 2020 Esane results will be available by the end of 2023 . This report will be available in French. |
|
|||
11.1. Quality assurance | |||
A continuous quality control process is performed on the Esane system (French SBS system). Each year, the different sub-processes – the handling of restructuring, the data collection of the survey, the data editing process of the survey data, the data editing process of the administrative data, the consistency monitoring process between the different sources, and the final output editing step – are reviewed with a view toward ongoing improvement. The documentation and the training related to each sub-process are adjusted accordingly. |
|||
11.2. Quality management - assessment | |||
In addition to the quality report mentionned in §10.7 and to the process review mentioned in §11.1, a quality approach involving our main internal users and coordinated by the Quality Unit of the French NSI is conducted. Quality is evaluated as medium. |
|
|||
12.1. Relevance - User Needs | |||
We have regular consultations with some of our main users: National accounts, technical ministries for quantifying policy measures, professional organizations. The national data are more detailed (level of nomenclature, and additional variables) compared to SBS. The data available in the fiscal source are implemented with more details than asked for by SBS; results are also given at the NAF level (French version of the NACE), which is more detailed. |
|||
12.2. Relevance - User Satisfaction | |||
We have not organised a punctual or a regular survey related to the public users' satisfaction. The users are very interested by the results, especially at detailed levels. The satisfaction of our main internal users is regularly assessed, by regular meetings and through the quality approach previously mentioned. An internal mission of the General Inspectorate has been conducted to estimate the needs satisfaction degree of early users. |
|||
12.3. Completeness | |||
We provided the characteristic 15110 on Gross investment in tangible good but we are not able to provide the breakdown between 15120-15130-15140-15150 which are unavailable (nor in tax data, nor in the scope of our annual structural survey). The characteristic 16150, Number of hours worked by employees, is not available because we have no labor market aggregates for the class size and class level Eurostat asks for. The characteristic 13131, Amounts paid to temporary work agencies is not available, in our annual structural survey, for most of the requested sectors. Therefore it is not transmitted. Until 2018, variables 13320, Wages and Salaries and 16110, Number of person employed, were not available for the geographic area NUTS FRY5 (Mayotte). |
|
|||
13.1. Accuracy - overall | |||
As in all statistical process involving a survey, the results of the Esane system can be affected by sampling errors, coverage errors, non-response and measurement errors. However, as we will see in the following paragraphs, in principle, none of these potential sources of errors leads to a significant bias on the results. The French business register has no under coverage problems, and the use of external administrative information allows us to deal with the problem of over coverage. A complex data editing process – based on selective editing combined with micro-editing for the data editing carried out on a continuous flow basis during the production process supplemented by a final output editing process – aims to reduce as much as possible the significant measurement errors. Imputation procedures – imputation by historical information, failing that by class imputation means or medians – and reweighting procedures are used to deal with missing units/items, both in survey and administrative data. We try also to minimise the non-response in the survey, thanks to the mandatory character of the survey and to a strict follow-up procedure. Finally, the use of a complex adjustment procedure – involving the non-response adjustment steps already mentioned, a winsorization procedure and a calibration on margins based on administrative data – and of a specific estimator using jointly survey and administrative data tends to reduce as much as possible the non-response bias and to improve the accuracy of the statistics on the main variables of interest. |
|||
13.2. Sampling error | |||
Starting with the reference year 2016, the survey is drawn by stratified simple sample random sampling of enterprises. Except for the biggest groups, which are manually profiled and for which data are directly collected at the enterprise level, data are still collected on legal units, with the following rule: when an enterprise is selected, then all legal units within this enterprise will be surveyed (so, from the legal unit point of view, the sample design can be seen as a two-stage cluster sampling). Automatic algorithms allow then to obtain consolidated data at the enterprise level, based on the data collected at the legal unit level. A Neyman allocation under local precision constraints is applied (in the sampling frame of enterprises) in order to optimise the accuracy of the estimation of the total turnover, both at the global level and by activities (with enterprises concepts). Few months after the drawing of the sample, the delineation of the enterprises are updated thanks to the application of the weight share method (See Fizzala EESW 2019) Sample survey is combined with administrative sources. The use of a complex adjustment procedure – involving the non-response adjustment steps detailed in §13.3, a winsorization procedure to deal with outliers and a calibration on margins based on administrative data – and of a specific estimator using jointly survey and administrative data tends to reduce as much as possible the non-response bias and to improve the accuracy of the statistics on the main variables of interest. The computed CVs take the following aspects into account :
Taking the first two points into account, we use a self‑made R function which analytically computes variance under this context. The use of calibration techniques is taken into account by applying our self‑made R function to the residuals derived from the weighted least squares regression of the variable of interest on calibration variables. Outlier treatment is taken into account by adjusting these residuals for detected and treated outliers. Consequently, neither the unit non‑response adjustment by imputation in the take‑all part of the survey nor the unit non‑response adjustment by imputation in the administrative data is taken into account for computation of Cvs. |
|||
13.3. Non-sampling error | |||
Methods for taking into account the unit non-responses: - In the survey, the unit non response is taken into account by two different ways:
- Unit non‑response is treated by an imputation procedure for the administrative data. This imputation procedure (presented at the EESW 2013) is mostly based on the use of historical information. When this is not possible, the imputed values are implemented by strata medians. - Moreover, a calibration procedure is applied on the take‑some part of the sample, using turnover and number of enterprise – available in administrative data – as auxiliary variables. For minimising the unit non-response:
Weighted unit non-response rate: The sampling weights of legal units was used for weighting non-response. Groups 531, 559, 721, 722, 750 and 813 are not in the scope of the survey. Consequently there is no non-response rate for these groups. The recorded unit non-response rate in the overall context is evaluated as low. Bias: Bias resulting from non-response or from the estimation method: the estimates are unbiased under the RHG model (for the take‑some part of the sample) and under the assumptions of the imputation models (for the take‑all part of the sample and for the administrative data). Non response rate is higher this year because of the Covid crisis, so there should be more residual bias after non response treatments than usual. The bias of the estimate: small bias (with a limited effect in the overall accuracy of the estimate). The impact of imputation: Not important. Imputation of administrative data concerns only small or very small businesses and represent a very limited share of the aggregated fiscal turnover. Imputations of survey responses for businesses in the take-all strata mainly use past responses of the firm. Coverage errors - Frame: Almost none, because of the statistical inference that is made Out-of-scope units: The business register has no under coverage problems. External information (VAT files, fiscal data, registers of compulsory liquidations) is used to identify and discard out-of-scope units, which do not participate in the ex-post treatments and in the estimates. |
|
|||
14.1. Timeliness | |||
For the statistical survey, data collection spreads from February to December N+1. For the administrative data: we receive different files between April N+1 and January N+2, considered for SBS data of reference year N. Preliminary data (built from the administrative source) are sent to Eurostat in October N+1. First dissemination in France is in January N+2 (only in legal unit and for internal use by the national accounts). Definitive dissemination – both in legal units for internal users and in enterprises – is in July N+2. |
|||
14.2. Punctuality | |||
Data with 2020 as reference year has been delivered on June 29th 2022 |
|
|||
15.1. Comparability - geographical | |||
The same statistical concepts are applied in the entire national territory. |
|||
15.2. Comparability - over time | |||
Series are comparable from 1996 to 2007. There was a change of system for the production of results 2008. Moreover, the results of 2010 and years after include a new category of small enterprises created in 2009 (“autoentrepreneurs” in French); this category, if not very important considering macro-economic aggregates (as turnover), is composed of numerous small units. Moreover, the results from 2012 include for the first time some categories of small units without employees, kinds of French civil property companies and private management leasers, that are composed of numerous and in general small units, for most of them in sections F and L. In addition, in 2015, the variable that defines the productive character of the statistical units belonging to the field has changed, which improves the way we define the SBS statistical population. Furthermore, branches of foreign units – that is local units not constituting separate legal entities but dependent on foreign enterprises and classified as quasi-corporations in accordance with ESA – which until 2014 were, with some exceptions, automatically considered as out of scope, are now potentially part of the scope, on the condition that they have declared a fiscal form to the French tax adminstration for the 2015 year or the previous year, and secondly that they have at least one employee. Finally, since the reference year 2017, the French SBS results are produced in enterprise, which leads to a major break in time series compared to the previous years, when the SBS results were produced mainly in legal units, except for some of the biggest groups which were manually profiled. |
|||
15.3. Coherence - cross domain | |||
1. The number of enterprises of the register has not to be considered as the reference for the number of enterprises (since problems of over coverage do exist). 2. The SBS results, in terms of number of enterprises, number of persons employed or number of employees – are not consistent with the ones of the Business demography, for many reasons: the business demography statistics don’t deal with the problem of over coverage as in the SBS statistics, the business demography statistics are in legal units and not in enterprise, the statistical and inference procedures are not the same, etc. The consistency between SBS & BD will be improved in the next years, mainly by dealing with the first two causes mentioned above. 3. The results of the national accounts take different sources into account, not only SBS: there is a “final synthesis” - with also different concepts - that may be different from the SBS results. |
|||
15.4. Coherence - internal | |||
The aggregates are always consistent with their main sub-aggregates (e.g. the total for different size classes, NACE, CPA, NUTS...). The modality FRZ (extra region) of NUTS is not taken into account for EDIT controls. That explains messages appearing as "error" which are not relevant. |
|
|||
Esane was especially implemented to reduce businesses response burden. The use of administrative fiscal data enabled to drastically reduce the size of surveys questionnaire. An optimization of sampling design was carried on that enabled to reduce the size of the sample by half in take-some strata while keeping the same precision for estimates. Finally, a new coordination method has been introduced in 2013 : it enables coordination of all business surveys together, so as to smooth response burden between small and medium businesses while respecting each survey' sampling design. The method is described in more details in "Sampling coordination of business surveys conducted by Insee" a paper by Fabien Guggemos and Olivier Sautory, presented at the fourth International Conference on Establishment Survey (Montreal, 2012). |
|
|||
17.1. Data revision - policy | |||
No revision policy has been put into place. However, revisions occur at least once a year when provisional data published in November is replaced by definitive data. Revisions also occur when methodological or scope changes are made. We estimate these revisions to approximately represent 0.2 point of the value added growth rate. |
|||
17.2. Data revision - practice | |||
The data used for compiling the preliminary results comes from a first delivery of the fiscal files (arriving in Insee in July N+1). Definitive data are elaborated with both supplementary files from the fiscal source and data coming from the surveys. |
|
|||
18.1. Source data | |||
The sources are statistical survey combined with administrative source. The sample survey is conducted on a sample of legal units, and the administrative information is exhaustive The survey:
Administrative source:
- data source, basic data for some characteristics - data source for imputation, for strata not covered by the survey - data source for calibration to optimize inference from survey results - access to administrative micro-data
Frame: The register from which the sample is drawn is named "Ocsane"
|
|||
18.2. Frequency of data collection | |||
Annual data collection |
|||
18.3. Data collection | |||
It was initially an exclusively postal survey by questionnaire. From the reference year 2012, it was possible to answer via internet by filling out a form identical to the paper questionnaire, with a specific file format. Web questionnaire is available since the reference year 2018. The number of units surveyed is about 160 000. Dashboards are used to follow-up non-response. Enterprises are contacted again by mail twice if necessary and the fourth time with the threat of a financial sanction. Specific e-mail are sent to very large companies if they did not answer the first time.
We use an extraction from the fiscal database, transmitted by fiscal administration |
|||
18.4. Data validation | |||
Checks consist in:
The editing system is based on selective editing. For a description, see: https://www.unece.org/fileadmin/DAM/stats/documents/ece/ces/ge.44/2009/wp.41.e.pdf https://www.unece.org/fileadmin/DAM/stats/documents/ece/ces/ge.44/2012/25_France.pdf Moreover, this selective editing procedure carried out on a continuous flow basis during the production process is supplemented by a final output editing process. Finally, Edit tool controls are computed before sending data to Eurostat (format and file structure checks, Intra-dataset checks, Inter-dataset checks, times series checks). Inference (grossing-up): Statistical combined estimates are used, described in detail in: |
|||
18.5. Data compilation | |||
Imputation is used to treat item non‑response for all take‑some units in the survey. For the take‑all units as well as the administrative data, imputation is used to treat both item and unit non‑response. Imputation by historical data – using the previous reference period – is carried out as far as possible, otherwise, the mean, median or ratio imputation by class method is used. |
|||
18.6. Adjustment | |||
In case the enterprise has a fiscal year different from the calendar year, the data of this fiscal year are affected to the calendar year including the most important part of the fiscal year |
|
|||
Further information on employment data concept: Concerning employment data our concepts differ from those of Eurostat. For variable V16130 (number of employees) we take number of employees at 31/12, while Eurostat expects an average number of employees over the year. At the moment we don’t have the possibility to do so. These differences in concepts generate, for only few sectors, warning messages in the Eurostat control tool because the number of employees at 31/12 may be lower than the number of employees who worked on average over the year and even the number of employees in full-time equivalent units ==> so we can sometimes have V16130 lower than V16140 and therefore a warning in the controls.
From the year 2020, the number of employees provided in the Structural Business Statistics is estimated from a new source, which significantly improves the accuracy of the estimates. This change is being made in coordination with Business Demography Statistics as well as with the Business Register.
The new source led to an increase in the number of employees by approximately 6% for a given year. As a result of this change, the numbers of employees provided for the year 2020 (and any statistics broken down by employee size classes) are not comparable to those provided for the year 2019.
|
|
|||
|
|||
Methodological note |