1.1. Contact organisation
CZECH STATISTICAL OFFICE
1.2. Contact organisation unit
Research, Development and Information Society Statistics Unit
1.3. Contact name
Confidential because of GDPR
1.4. Contact person function
Confidential because of GDPR
1.5. Contact mail address
Czech Statistical Office
Society Development Statistics Department
Na padesatem 81
Praha 10
100 82
1.6. Contact email address
Confidential because of GDPR
1.7. Contact phone number
Confidential because of GDPR
1.8. Contact fax number
2.1. Metadata last certified
16 September 2024
2.2. Metadata last posted
16 September 2024
2.3. Metadata last update
16 September 2024
3.1. Data description
The Community Innovation Survey (CIS) is a survey about innovation activities in enterprises. The survey is designed to collect the information on types of innovation, processes of development of innovation like cooperation patterns, financing and expenditure, objectives of innovation activities or barriers for initiating or implementing innovation.
The CIS provides statistics by type of innovators, economic activity and size class of enterprises. The survey is currently carried out every two years across the EU Member States, EFTA countries and EU candidate countries.
In order to ensure comparability across countries, Eurostat together with the countries develops a Harmonised Data Collection (HDC) questionnaire and drafts the methodological recommendations for implementation of each survey round.
The CIS 2022 implements the concepts and methodology of the Oslo Manual 4th Edition revised in 2018. The changes that the CIS has undergone due to the revision of the manual and their impact on the indicators collected are described in the Statistics Explained article: Community Innovation Survey – new features.
The legal framework for CIS 2022 is the Commission Implementing Regulation (EU) 2022/1092, which sets out the quality conditions and identifies the obligatory cross-coverage of economic sectors, size class of enterprises and innovation indicators. The target population is enterprises with at least 10 employed persons (sum of employees and self-employed persons) classified in the core NACE economic sectors (see 3.3). Further activities may be covered on a voluntary basis in national datasets. Most statistics are based on the 3-year reference period (t, t-1, t-2), but some use only one calendar year (t or t-2).
3.2. Classification system
Indicators related to the enterprises are classified by country, economic activity (NACE Rev. 2), size class of enterprises and type of innovation.
The main typology of classification of enterprises in reference to innovation is the distinction between innovation-active enterprises (INN) and not innovation-active enterprises (NINN).
The enterprise is considered as innovative (INN) if during the reference period it successfully introduced a a) product or a) business process innovation, c) completed but not yet implemented the innovation, d) had ongoing innovation activities, e) abandoned innovation activities or was f) engaged in in-house R&D or R&D contracted out. Non-innovative (NINN) enterprises had no innovation activity mentioned above whatsoever during the reference period.
3.3. Coverage - sector
3.3.1. Main economic sectors covered - NACE Rev.2
In accordance with the Commission Implementing Regulation (EU) 2022/1092 on innovation statistics, the following sectors of the economic activity are included in the core target population: NACE Sections B, C, D, E, H, J, K, and Divisions 46, 71, 72 and 73.
3.3.1.1. Main economic sectors covered - NACE Rev.2 - national particularities
No national particularities.
3.3.2. Sector coverage - size class
In accordance with Commission Implementing Regulation (EU) 2022/1092 on innovation statistics, only the enterprises with 10 or more employed persons (sum of employees and self-employed persons) are included in the core target population.
3.3.2.1. Sector coverage - size class - national particularities
No national particularities.
3.4. Statistical concepts and definitions
The description of concepts, definitions and main statistical variables will be available in CIS 2022 European metadata file (ESMS) Results of the community innovation survey 2022 (CIS2022) (inn_cis13) in Eurostat database.
3.5. Statistical unit
The statistical unit is an enterprise which corresponds to the legal unit with the identification number given by the Business Register.
The unit of observation was the legal unit. Sampling was carried out at the level of the legal units.
Process for obtaining results at Statistical Unit Enterprise level:
- For qualitative variables:
For the few complex enterprises in the CZ-CIS, data were collected for the leading legal unit as designated by the statistical business register, which is treated as the representative of the enterprise.
- For quantitative variables:
No aggregation method was used. The value of the leading unit is used as an approximation.
3.6. Statistical population
Core target population is all enterprises in CORE NACE activities (see 3.3.1) with 10 or more employed persons (sum of employees and self-employed persons).
3.7. Reference area
The geographic area to which the measured statistical phenomenon relates is Czechia (NUTS1) for all variables (data) and NUTS2 for selected variables (data).
NUTS2 was used as a geographical stratification dimension.
The CIS2022 regional data are calibrated at the level of the legal units.
3.8. Coverage - Time
Several rounds of Community Innovation Survey have been conducted so far at two-year interval since the end of the 90’s.
3.8.1. Participation in the CIS waves
| CIS wave | Reference period | Participation (Yes/No) | Comment (deviation from reference period) |
|---|---|---|---|
| CIS2 | 1994-1996 | No | |
| CIS3 | 1998-2000 | Yes | 1999-2001 |
| CIS light | 2002-2003* | Yes | |
| CIS4 | 2002-2004 | Yes | 2003-2005 |
| CIS2006 | 2004-2006 | Yes | |
| CIS2008 | 2006-2008 | Yes | |
| CIS2010 | 2008-2010 | Yes | |
| CIS2012 | 2010-2012 | Yes | |
| CIS2014 | 2012-2014 | Yes | |
| CIS2016 | 2014-2016 | Yes | |
| CIS2018 | 2016-2018 | Yes | |
| CIS2020 | 2018-2020 | Yes | |
| CIS2022 | 2020-2022 | Yes |
*two reference periods can be distinguished for CIS light: 2000-2002 and 2001-2003
3.9. Base period
Not relevant.
CIS indicators are available according to 3 units of measure:
NR: Number for number of enterprises and number of persons employed.
THS_EUR: Thousands of euros. All financial variables are provided in thousands of euros, i.e. Turnover or Innovation expenditure.
PC: Percentage. The percentage is the ratio between the selected combinations of indicators.
For CIS 2022, the time covered by the survey is the 3-year period from the beginning of 2020 to the end of 2022.
Some questions and indicators refer to one year — 2022.
The list of indicators specifying whether they cover the 3-year period or refer to one year according to the HDC will be available in the Annex section of the European metadata (ESMS).
6.1. Institutional Mandate - legal acts and other agreements
The CIS is based on the Commission Implementing Regulation (EU) 2022/1092, implementing Regulation (EU) 2019/2152 of the European Parliament and of the Council on the production and development of Community statistics on science and technology.
This Regulation establishes innovation statistics on a statutory basis and makes the delivery of certain variables compulsory e.g. innovation activities, cooperation, development, expenditures and turnover (see the Regulation). Each survey wave may additionally include further variables.
In addition, the Regulation defines the obligatory cross-coverage of economic sectors and size class of enterprises.
6.1.1. National legislation
There is no specific Czech legislation for the production of CIS statistics. The European Legislation applies.
The Law on Statistics of the Czech Republic is the main national legal act regarding official statistics. Production of CIS statistics as well as other official statistics is included in the annual Official Statistics Work Programme.
6.2. Institutional Mandate - data sharing
Not requested.
7.1. Confidentiality - policy
See the Framework Security Policy in the attached document.
Annexes:
the_czso_framework_security_policy.pdf
7.2. Confidentiality - data treatment
In producing the standard statistical outputs, a number of procedures were implemented to prevent the release of information that identified characteristics about an enterprise.
1) Restricting the number of output categories into which a variable may be classified.
2) Where the number of enterprises in a group falls below a minimum threshold, the statistical output is not published.
The rule concerning the confidentiality applied for treating the tabular data is following: less than 3 enterprises or at least 85% of turnover for one enterprise.
8.1. Release calendar
The main indicators were published in a press release in June 2024.
All indicators (data) were published in the regular publication Innovation activities of enterprises during 2020-2022 in June 2024.
8.2. Release calendar access
8.3. Release policy - user access
The CIS data are disseminated to all users through the Official Statistics Portal. The data are released simultaneously to all interested parties by issuing a news release. At the same time the news release is also e-mailed to the media. The news release is issued in Czech only. All published CIS data are available for users for free. Special datasets made ad hoc for researches/students are chargeable. Anonymized microdata are available at the Safe Centre of CZSO. The access to these microdata is paid.
Annexes:
diseminace csu_aj.pdf
CIS is conducted and disseminated at two-year interval in pair years.
Accessibility and clarity refer to the simplicity and ease for users to access statistics using simple and user-friendly procedure, obtaining them in an expected form and within an acceptable time period, with the appropriate user information and assistance: a global context which finally enables them to make optimum use of the statistics.
10.1. Dissemination format - News release
See below.
10.1.1. Availability of the releases
| Dissemination and access | Availability | Comments, links, ... |
|---|---|---|
| Press release | Yes | Only Czech version. Podíl inovujících firem klesá | Produkty (gov.cz) |
| Access to public free of charge | Yes | Only Czech version. Inovační aktivity podniků - 2020 až 2022 | Produkty (gov.cz) |
| Access to public restricted (membership/password/part of data provided, etc) | Yes | Only Czech version. |
10.2. Dissemination format - Publications
- Online database (containing all/most results): No.
- Analytical publication (referring to all/most results): Yes.
- Analytical publication (referring to specific results, e.g. only for one sector or one specific aspect): No.
10.3. Dissemination format - online database
Not available.
10.3.1. Data tables - consultations
Not requested.
10.4. Dissemination format - microdata access
Microdata are accessible only at the National SAFE centre and Eurostat SAFE centre.
10.4.1. Dissemination of microdata
| Mean of dissemination | Availability of microdata | Comments, links, ... |
|---|---|---|
| Eurostat SAFE centre | Yes | |
| National SAFE centre | Yes | Statistical Data for Scientific Research Purposes | Statistics (gov.cz) |
| Eurostat: partially anonymised data (SUF) | Yes | |
| National: partially anonymised data | No |
10.5. Dissemination format - other
No other means of dissemination.
10.5.1. Metadata - consultations
Not requested.
10.6. Documentation on methodology
The complete meta-information (methodology of survey, definition of variables) on the CIS is available for users on the website of CZSO.
10.6.1. Metadata completeness - rate
Not requested.
10.7. Quality management - documentation
Quality related documents (quality reports, studies, etc.) are not publicly available. Only selected indicators are published (e.g. overall non-response rate).
11.1. Quality assurance
See attached annex.
Annexes:
CZSO_quality_commitment.pdf
11.2. Quality management - assessment
Assessment of the quality uses the following components:
- Standard STS (Simple, Transparent, and Standardised) compliance assessments carried out by Eurostat. This assessment is used internally as well.
- Regular response rate measurement.
- Regular methodological update and ad hoc methodological audit carried out every 10 years.
The overall quality of the national methodology is good. Improving of the national methodology regarding CIS and data quality is ongoing process in line with ICT development.
12.1. Relevance - User Needs
The CZSO primarily fulfils its commitments towards the European Commission, which represents the high priority user. However, depending on the capacity available the CZSO treats all the users equally.
The relevance of an instrument has to be assessed in the light of the needs of its users. As for EU Statistics on innovations in enterprises (CIS) the main users are the following:
- Institutional users having access to aggregative data for policy-making;
- Statistical users in Eurostat or in National Statistical Institutes to feed sectorial or transversal publications;
- Researchers having access to microdata;
- End users – including the media - interested in innovations in the EU area.
12.1.1. Needs at national level
| User group | Short description of user group | Main needs for CIS data of the user group Users’ needs |
|---|---|---|
| 1. Institutions - European level | The European Commission (DG ENTR) | Data used for the European Innovation Scoreboard and its further development |
| 1. Institutions - International organisations | OECD | Data used for analyses |
| 1. Institutions - National level | Ministry of Industry and Trade | Data used for analyses (micro-data) |
| 1. Institutions - National level | R&D&I Council | Data used for analyses and policy making |
| 1. Institutions – Regional level | Regional Innovation Agencies | Data used for analyses and policy making (regional innovation strategies) |
| 2. Social actors | Association of Innovative Entrepreneurship CR, Technology Center Prague | Data used for analyses and research (using micro-data) |
| 3. Media | National and regional media | Data used for analyses |
| 4. Researchers and students | Center for Economic Research and Graduate Education – Economics Institute, Charles University, students | Data used for analyses and research |
12.2. Relevance - User Satisfaction
No user satisfaction survey is carried out exclusively for CIS indicators on the national level. However, CZSO is constantly in contact with different groups of users, e.g. Ministry of Industry and Trade, Czech National Bank, different associations, internal users etc. Formal meeting with users is organized regularly by the General Methodology Department. Data users are asked for a potential feedback if they receive data, publications or special outputs.
Every 5 years, CZSO organises the national user satisfaction survey. Last one was carried out in 2020.
12.3. Completeness
All mandatory CIS 2022 questions were implemented, except for the option "Protection of bio-diversity" in the question 6.1.
12.3.1. Data completeness - rate
Not requested.
13.1. Accuracy - overall
Accuracy in the statistical sense denotes the closeness of computations or estimates to the exact or true values. Statistics are not equal with the true values because of variability (the statistics change from implementation to implementation of the survey due to random effects) and bias (the average of the possible values of the statistics from implementation to implementation is not equal to the true value due to systematic effects).
13.2. Sampling error
Restricted from publication
13.2.1. Sampling error - indicators
The main indicator used to measure sampling errors for CIS data is the coefficient of variation (CV).
CV= Coefficient of variation (%) = 100 * (Square root of the estimate of the sampling variance) / (Estimated value)
Formula:

where

and

13.2.1.1. Coefficient of variations for key variables
Coefficient of variation (%) for key variables by NACE categories and for enterprises with 10 or more employed persons
| NACE | Size class |
(1) |
(2) |
(3) |
|---|---|---|---|---|
| Core NACE (B-C-D-E-46-H-J-K-71-72-73) | Total |
0.960 | 0.281 | 1.180 |
| Core industry (B_C_D_E - excluding construction) | Total |
0.880 |
0.326 | 1.126 |
| Core Services (46-H-J-K-71-72-73) | Total |
1.772 |
0.404 |
2.223 |
(1) = Coefficient of variation for the percentage of innovative enterprises (INN) in the total population of enterprises (ENT)
(2) = Coefficient of variation for the turnover of product innovative enterprises with new or improved products (TUR_PRD_NEW_MKT), as a percentage of total turnover of product innovative enterprises [TOVT,INNO_PRD].
(3) = Coefficient of variation for percentage of product and/or process innovative enterprises (incl. enterprises with abandoned and or on-going activities) involved in any innovation co-operation arrangement [COOP_ALL,INN], as a percentage of innovative enterprises (INN).
13.2.1.2. Variance estimation method
See the annex: 13.2.1.2. Variance estimation method.pdf.
Annexes:
13.2.1.2. Variance estimation method.pdf
13.3. Non-sampling error
Non-sampling errors occur in all phases of a survey. They add to the sampling errors (if present) and contribute to decreasing overall accuracy. It is important to assess their relative weight in the total error and devote appropriate resources for their control and assessment.
13.3.1. Coverage error
Coverage errors (or frame errors) are due to divergences between the target population and the frame population. The frame population is the set of target population members that have a chance to be selected into the survey sample. It is a listing of all items in the population from which the sample is drawn that contains contact details as well as sufficient information to perform stratification and sampling.
13.3.1.1. Over-coverage - rate
Not requested.
13.3.1.2. Common units - proportion
Not requested.
13.3.1.3. Under covered groups of the target population
Because the corresponding list of units with the number social of insurance policyholders, is handed over with monthly periodicity with about a two-month delay after the end of the reference period, some coverage errors can occur at time of sampling.
13.3.1.4. Coverage errors in coefficient variation
The coefficient of variation reported under 13.2.1.1 incorporates the effects of coverage errors.
13.3.2. Measurement error
Measurement errors occur during data collection and generate bias by recording values different than the true ones. The survey questionnaire used for data collection may have led to the recording of wrong values, or there may be respondent or interviewer bias.
13.3.2.1. Measures for reducing measurement errors
Regarding overvalued innovation expenditure the following ratio is calculated as innovation expenditures in the given year/ turnover in the given year which should be lower than 1. Innovation expenditures on in-house research and development is also compared with research and development expenditures in Annual Research and Development Survey. A special training is organized for staff that process the questionnaires and transfer data from printed questionnaires to databases (data entry).
13.3.3. Non response error
Non response occurs when a survey fails to collect data on all survey variables from all the population units designated for data collection in a sample or complete enumeration.
There are two types of non-response:
1) Unit non-response, which occurs when no data (or so little as to be unusable) are collected about a population unit designated for data collection.
- Un-weighted unit non-response rate (%) = 100*(Number of units with no response or not usable response) / (Total number of in-scope (eligible) units in the sample).
- Weighted unit non-response rate (%) = 100*(Number of weighted units with no response or not usable response) / (Total number of in-scope (eligible) units in the sample).
2) Item non-response, which occurs when only data on some, but not all survey data items are collected about a population unit designated for data collection.
- Un-weighted item non-response rate (%) = 100*(Number of units with no response at all for the item) / (Total number of eligible, for the item, units in the sample i.e. filters have to be taken into account).
The overall unit non-response rate is very good with low values. The weighted unit non-response rate and un-weighted unit non-response rate are both around 11 %.
No imputation regarding a non-responding unit is applied.
13.3.3.1. Unit non-response - rate
See below.
13.3.3.1.1. Un-weighted and weighted unit non-response rate by NACE categories and for enterprises with 10 or more employed persons
Un-weighted and weighted unit non-response rate by NACE categories and for enterprises with 10 or more employed persons
| NACE | Number of eligible units with no response | Total number of eligible units in the sample | Un-weighted unit non-response rate (%) | Weighted unit non-response rate (%) |
| Core NACE (B-C-D-E-46-H-J-K-71-72-73) | 576 | 7933 | 7.3 | 8.1 |
| Core industry (B_C_D_E - excluding construction) | 381 | 5079 | 7.5 | 8.0 |
| Core Services (46-H-J-K-71-72-73) | 195 | 2854 | 6.8 | 8.3 |
The number of eligible units is the number of sample units, that ultimately indeed belong to the target population.
13.3.3.1.2. Maximum number of recalls/reminders before coding
Two reminders are sent and also telephone follow-ups are carried out if it is needed.
13.3.3.2. Item non-response - rate
See below.
13.3.3.2.1. Item non-response rate for Turnover (in Core NACE: B-C-D-E-46-H-J-K-71-72-73 enterprises with 10 or more employed persons)
Item non-response rate for Turnover (in Core NACE: B-C-D-E-46-H-J-K-71-72-73 enterprises with 10 or more employed persons).
| Item non-response rate (un-weighted) (%) |
Imputation (Yes/No) |
If imputed, describe method used, mentioning which auxiliary information or stratification is used | |
| Turnover | 0 | No |
13.3.3.2.2. Item non response rate for new questions
Item non-response rate for new questions in CIS t (in Core NACE: B-C-D-E-46-H-J-K-71-72-73 enterprises with 10 or more employed persons)
| NEW QUESTIONS IN CIS 2022 | Inclusion in national questionnaire (Yes/No) | Item non response rate (un-weighted) (%) | Comments |
|---|---|---|---|
| 3.9 -- Reasons for not having more innovation activities | Yes | 9.5 | |
| 3.10 -- Reasons for having no innovation activities | Yes | 23.5 |
13.3.4. Processing error
Main data entry methods are: data scanning or keying for the printed questionnaire (1/3 of all) and CASI/e-PDF for the electronic questionnaire (2/3 of all). The errors’ catching is maintained by programming tools which identify hard and soft errors. Programming tools find incomplete answers and warn of the given type of error. Errors which were found during this checking process are corrected manually.
Processing errors are not significant.
13.3.5. Model assumption error
Not requested.
Timeliness and punctuality refer to time and dates, but in a different manner.
14.1. Timeliness
The timeliness of statistics reflects the length of time between data availability and the event or phenomenon they describe.
14.1.1. Time lag - first result
Timeliness of national data – date of first release of national level: 18 months after the reference year.
14.1.2. Time lag - final result
Not requested.
14.2. Punctuality
Punctuality refers to the time lag between the release date of data and the target date on which they were scheduled for release as announced officially.
14.2.1. Punctuality - delivery and publication
Date of transmission of complete and validated data to Eurostat (Number of days between that date and 30 June 2024) : 2 days.
15.1. Comparability - geographical
Comparability aims at measuring the impact of differences in applied statistical concepts and definitions on the comparison of statistics between geographical areas, non-geographical domains, or over time.
The coherence of statistical outputs refers to the degree to which the statistical processes by which they were generated used the same concepts (classifications, definitions, and target populations) and harmonised methods. Coherent statistical outputs have the potential to be validly combined and used jointly.
No significant problems of comparability between regions at the level NUTS 2 of Czechia. The stratification of the sample is set to eliminate possible problems regarding comparability between regions at the NUTS 2.
15.1.1. Asymmetry for mirror flow statistics - coefficient
Not requested.
15.1.2. National questionnaire – compliance with Eurostat model questionnaire
Methodological deviations from the CIS Harmonised Data Collection (HDC)
| Questions not included in national questionnaire compared to HDC | Comment |
| 2.1, 5.1, 5.3, 5.4, 6.2, 7.2, 7.6, 7.8, 7.9 | |
| 6.1 - Protection of bio-diversity |
| Changes in the filtering compared to HDC | Comment |
15.1.3. National questionnaire – additional questions
Methodological deviations from the CIS Harmonised Data Collection (HDC)
| Additional questions in national questionnaire (not included in HDC) | Comment |
|---|---|
| Geographic markets where an enterprise sells goods and/or services | Yes/No options |
| Results/impacts of implemented product innovations | Yes/No options |
| Design activities | Yes/No options, expenditure on design activities |
15.2. Comparability - over time
Due to important methodological changes introduced by Oslo Manual 2018, the data from 2018 onwards cannot be directly compared with CIS waves prior to 2018.
15.2.1. Length of comparable time series
Not requested.
15.3. Coherence - cross domain
See the comparison between SBS and CIS data in the section 15.3.3 below.
15.3.1. Coherence - sub annual and annual statistics
Not requested.
15.3.2. Coherence - National Accounts
Not requested.
15.3.3. Coherence – Structural Business Statistics (SBS)
This part compares key variables for aggregated CIS data with SBS data
Definition of relative difference between CIS and SBS data: DIFF = (SBS/CIS)*100
Comparison between SBS and CIS data (relative difference) by NACE categories and for enterprises with 10 or more employed persons
| NACE | Size class | Number of enterprises (SBS/CIS)* | Number of employed persons (SBS/CIS)* | Total Turnover (SBS/CIS)* |
|---|---|---|---|---|
| Core NACE (B-C-D-E-46-H-J-K-71-72-73) | Total | 98.6 | 95.4 | 104.2 |
| Core industry (B_C_D_E - excluding construction) | Total | 100.1 | 100.2 | 115.3 |
| Core Services (46-H-J-K-71-72-73) | Total | 96.9 | 87.1 | 88.9 |
* Numbers are to be provided for the last year of the reference period (t)
15.4. Coherence - internal
Not requested.
17.1. Data revision - policy
Not requested.
17.2. Data revision - practice
Not requested.
17.2.1. Data revision - average size
Not requested.
18.1. Source data
See below:
18.1.1. Sampling frame (or census frame)
The frame population is the same as the one for the Structural Business Survey (SBS). Following ESA 2010 sectors are included: S.11+S.12+S.141.
18.1.2. Sampling design
Sampling design is a combination of stratified random sampling (enterprises with 10-249 employees of selected two-digit NACE industries) and census (see 12.3.3). Two-digit NACE, NUTS2 and size-class according to the Business Register were used for stratification. The sample was designed with no reference to any other survey. Following ESA 2010 sectors are covered: S.11+S.12+S.141. The total number of strata is 1320 including special strata for units with unknown NACE.
18.1.3. Target population and sample size
| Sample/census indicator | Number of enterprises |
| Target population (A) (*) | 25228 |
| Sample (B = C+D) | 8367 |
| In case of combination sample/census: | |
| Sampled units (C) | 6932 |
| Enumerated units/census (D) | 1435 |
| Overall sample rate (E = 100*B/A) | 33.2 % |
(*) CIS core population, i.e. NACE Rev.2 B-C-D-E-46-H-J-K-71-72-73 enterprises with 10 or more employed persons.
18.1.4. Data source for pre-filled variables
Variables and indicators filled or prefilled from other sources.
| Variables/Indicators | Source | Reference year |
|---|---|---|
| TAX_CRED_RNDINN | General Financial Directorate (GFD) - data from Corporate Income Tax returns | 2020-2022 |
18.1.5. Data source and variables used for derivation and weighting
| Item | Response |
|---|---|
| Data source used for deriving population totals | Number of persons employed - SBS and other yearly business surveys Total turnover - SBS and administrative tax data |
| Variables used for weighting | Number of persons employed - SBS and other yearly business surveys Total turnover - SBS and administrative tax data Number of enterprises in the sampling frame Number of social security policyholder - from social security administrative data |
18.2. Frequency of data collection
According to the Commission Implementing Regulation (EU) 2022/1092, the innovation statistics shall be provided to Eurostat every two years in each even year. The data collection takes place every second year in year t-2 preceding the data provision.
18.3. Data collection
The CZ-CIS survey in the Czech Republic is a combination of computer assisted filling in (on-line web) and e-PDF questionnaires. In limited cases, the paper questionnaire can be also used. Respondents receive instructions (via a message in their data mailbox) describing how to get the online questionnaire (computer assisted) or have an opportunity to download the online questionnaire from the CZSO website. Paper questionnaires are sent to enterprises that do not own data mailboxes (limited number of enterprises).
During the data collection, checking and processing period all received data are checked primarily by junior statisticians (secondary by the Project manager). Online questionnaires (computer assisted) includes all necessary consistency checks (controls). There are approximately 150 of them every year. Inconsistencies in controls notify junior statisticians of necessity to check answers. Phone calls are used for validation of the data (corrections) and quality improvement if necessary. The year-to-year checks in the aggregate data (before data delivery to Eurostat) are made.
No imputation for item non-response is used, except for imputation of some background information that will not be available in the SBS database.
18.3.1. Survey participation
The survey is mandatory.
18.3.2. Survey type
Data are collected through the combination of a census and sample survey.
18.3.3. Combination of sample survey and census data
Covered by Census:
- Number of employees according to the Business Register >=250 of ALL CORE NACE.
- Number of employees according to the Business Register =50-249 of NACE=5-9, 12, 19, 37, 39, 50-51, 53, 59-60.
Covered by sample survey:
- Number of employees according to the Business Register =50-249 of NACE=10-11, 13-18, 20-36 , 38, 46, 49, 52, 58, 61-66, 71-73.
- Number of employees according to the Business Register =10-49 of all NACE.
18.3.4. Census criteria
Covered by Census:
- Number of employees according to the Business Register >=250 of ALL CORE NACE.
- Number of employees according to the Business Register =50-249 of NACE=5-9, 12, 19, 37, 39, 50-51, 53, 59-60.
18.3.5. Data collection method
Data collection method
| Survey method | Yes/No | Comment |
|---|---|---|
| Face-to-face interview | No | |
| Telephone interview | Yes | The telephone interview and email were used for checking and editing primary data. |
| Postal questionnaire | Yes | |
| Electronic questionnaire (format Word or PDF to send back by email) | Yes | e-PDF |
| Web survey (online survey available on the platform via URL) | Yes | |
| Other | - |
18.4. Data validation
Not requested.
18.5. Data compilation
Operations performed on data to derive new information according to a given set of rules.
18.5.1. Imputation - rate
Imputation is the method of creating plausible (but artificial) substitute values for all those missing.
Definition of imputation rate: Imputation rate (for the variable x) (%) = 100 * (Number of replaced values) / (Total number of values for a given variable).
Definition of weighted imputation rate: Weighted imputation rate= 100 * (Number of total weighted replaced values) / (Total number of weighted values for a given variable).
18.5.1.1. Imputation rate for metric variables
Imputation rate (%) for metric variables by NACE categories and for enterprises with 10 or more employed persons:
| NACE | Size class | Total Turnover (1) | Turnover from products new to the market (2) | R&D expenditure in-house (3) | |||
|---|---|---|---|---|---|---|---|
| Unweighted | Weighted | Unweighted | Weighted | Unweighted | Weighted | ||
| Core NACE (B-C-D-E-46-H-J-K-71-72-73) | Total | 0 | 0 | 0 | 0 | 0 | 0 |
| Core industry (B_C_D_E - excluding construction) | Total | 0 | 0 | 0 | 0 | 0 | 0 |
| Core Services (46-H-J-K-71-72-73) | Total | 0 | 0 | 0 | 0 | 0 | 0 |
(1) = Imputation rate (%) for the total turnover in the last year of the reference period (t) (TUR)
(2) = Imputation rate (%) for the share of the turnover in the last year of the reference period (t) due to new or improved product new to the market in the total turnover for product innovative enterprises TUR_PRD_NEW_MKT/TOVT(INNO_PRD)
(3) = Imputation rate (%) for the R&D expenditure performed in-house (EXP_INNO_RND_IH)
18.5.2. Weights calculation
Weights calculation method for sample surveys
| Method | Selected applied method | Comments |
|---|---|---|
| Inverse sampling fraction | ||
| Non-respondent adjustments | ||
| Other | GREG estimators (calibration method) | The starting weights are calibrated in 3 stages to treat non-response and provide more accurate and coherent estimates: + stage 1: weight are adjusted to preserve estimated totals of units and number of social security policyholders in the strata. + stage 2: weight from the previous are adjusted to preserve estimated totals of units and number of employees in the strata. + stage 3: weight from the previous are adjusted to preserve estimated totals of units and turnover in the strata |
18.6. Adjustment
No procedures employed to modify statistical data are applied.
18.6.1. Seasonal adjustment
Not requested.
No further comments.
The Community Innovation Survey (CIS) is a survey about innovation activities in enterprises. The survey is designed to collect the information on types of innovation, processes of development of innovation like cooperation patterns, financing and expenditure, objectives of innovation activities or barriers for initiating or implementing innovation.
The CIS provides statistics by type of innovators, economic activity and size class of enterprises. The survey is currently carried out every two years across the EU Member States, EFTA countries and EU candidate countries.
In order to ensure comparability across countries, Eurostat together with the countries develops a Harmonised Data Collection (HDC) questionnaire and drafts the methodological recommendations for implementation of each survey round.
The CIS 2022 implements the concepts and methodology of the Oslo Manual 4th Edition revised in 2018. The changes that the CIS has undergone due to the revision of the manual and their impact on the indicators collected are described in the Statistics Explained article: Community Innovation Survey – new features.
The legal framework for CIS 2022 is the Commission Implementing Regulation (EU) 2022/1092, which sets out the quality conditions and identifies the obligatory cross-coverage of economic sectors, size class of enterprises and innovation indicators. The target population is enterprises with at least 10 employed persons (sum of employees and self-employed persons) classified in the core NACE economic sectors (see 3.3). Further activities may be covered on a voluntary basis in national datasets. Most statistics are based on the 3-year reference period (t, t-1, t-2), but some use only one calendar year (t or t-2).
16 September 2024
The description of concepts, definitions and main statistical variables will be available in CIS 2022 European metadata file (ESMS) Results of the community innovation survey 2022 (CIS2022) (inn_cis13) in Eurostat database.
The statistical unit is an enterprise which corresponds to the legal unit with the identification number given by the Business Register.
The unit of observation was the legal unit. Sampling was carried out at the level of the legal units.
Process for obtaining results at Statistical Unit Enterprise level:
- For qualitative variables:
For the few complex enterprises in the CZ-CIS, data were collected for the leading legal unit as designated by the statistical business register, which is treated as the representative of the enterprise.
- For quantitative variables:
No aggregation method was used. The value of the leading unit is used as an approximation.
Core target population is all enterprises in CORE NACE activities (see 3.3.1) with 10 or more employed persons (sum of employees and self-employed persons).
The geographic area to which the measured statistical phenomenon relates is Czechia (NUTS1) for all variables (data) and NUTS2 for selected variables (data).
NUTS2 was used as a geographical stratification dimension.
The CIS2022 regional data are calibrated at the level of the legal units.
For CIS 2022, the time covered by the survey is the 3-year period from the beginning of 2020 to the end of 2022.
Some questions and indicators refer to one year — 2022.
The list of indicators specifying whether they cover the 3-year period or refer to one year according to the HDC will be available in the Annex section of the European metadata (ESMS).
Accuracy in the statistical sense denotes the closeness of computations or estimates to the exact or true values. Statistics are not equal with the true values because of variability (the statistics change from implementation to implementation of the survey due to random effects) and bias (the average of the possible values of the statistics from implementation to implementation is not equal to the true value due to systematic effects).
CIS indicators are available according to 3 units of measure:
NR: Number for number of enterprises and number of persons employed.
THS_EUR: Thousands of euros. All financial variables are provided in thousands of euros, i.e. Turnover or Innovation expenditure.
PC: Percentage. The percentage is the ratio between the selected combinations of indicators.
Operations performed on data to derive new information according to a given set of rules.
See below:
CIS is conducted and disseminated at two-year interval in pair years.
The timeliness of statistics reflects the length of time between data availability and the event or phenomenon they describe.
Comparability aims at measuring the impact of differences in applied statistical concepts and definitions on the comparison of statistics between geographical areas, non-geographical domains, or over time.
The coherence of statistical outputs refers to the degree to which the statistical processes by which they were generated used the same concepts (classifications, definitions, and target populations) and harmonised methods. Coherent statistical outputs have the potential to be validly combined and used jointly.
No significant problems of comparability between regions at the level NUTS 2 of Czechia. The stratification of the sample is set to eliminate possible problems regarding comparability between regions at the NUTS 2.
Due to important methodological changes introduced by Oslo Manual 2018, the data from 2018 onwards cannot be directly compared with CIS waves prior to 2018.


