|
![]() |
| For any question on data and metadata, please contact: Eurostat user support |
|
|||
| 1.1. Contact organisation | Statictics Poland |
||
| 1.2. Contact organisation unit | Social Surveys and Labour Market Department |
||
| 1.5. Contact mail address | kancelariaogolnaGUS@stat.gov.pl
|
||
|
|||
| 2.1. Metadata last certified | 15 April 2025 |
||
| 2.2. Metadata last posted | 15 April 2025 |
||
| 2.3. Metadata last update | 15 April 2025 |
||
|
||||||
| 3.1. Data description | ||||||
The European Union Statistics on Income and Living Conditions (EU-SILC) is an instrument aiming at collecting timely and comparable cross-sectional and longitudinal multidimensional microdata on income, poverty, social exclusion and living conditions. In addition, are collected module variables every three years, six years or ad-hoc new policy needs modules. The EU-SILC instrument provides two types of data:
Social exclusion and housing condition information is collected mainly at household level while labour, education and health information is obtained for persons aged 16 and over. The core of the instrument, income at very detailed component level, is mainly collected at personal level. This instrument is anchored in the European Statistical System (ESS).
|
||||||
| 3.2. Classification system | ||||||
For more details on the classification used please, see EU Vocabularies, Eurostat's metadata server or CIRCABC . |
||||||
| 3.3. Coverage - sector | ||||||
Data refer to all private households and individuals living in the private households in the national territory at the time of data collection. The EU-SILC survey is a key instrument for providing information required by the European Semester and the European Pillar of Social Rights, in particular for income distribution, poverty and social exclusion, as well as various related living conditions and poverty EU policies, such as on child poverty, access to health care and other services, housing, over indebtedness and quality of life. It is also the main source of data for microsimulation purposes and flash estimates of income distribution and poverty rates. |
||||||
| 3.4. Statistical concepts and definitions | ||||||
Statistical concepts and definitions for EU-SILC are specified in Regulation (EU) 2019/1700, Commission Implementing Regulation (EU) 2019/2181, and Commission Implementing Regulation (EU) 2019/2242. Additional information is available in the EU statistics on income and living conditions (EU-SILC) methodology and in the methodological guidelines and description of EU-SILC target variables (see CIRCABC). Further details are provided in items 5, 15.1, 15.2.2 and 18.3. |
||||||
| 3.5. Statistical unit | ||||||
Statistical units are private households and all persons living in these households who have usual residence in the Member State. Annex II of the Commission implementing regulation (EU) 2019/2242 defines specific statistical units per variable and specifies the, content of the quality reports on the organization of a sample survey in the income and living conditions domain pursuant to Regulation (EU) 2019/1700 of the European Parliament and of the Council. |
||||||
| 3.6. Statistical population | ||||||
The target population is private households and all persons composing these households having their usual residence in the Member State. |
||||||
| 3.6.1. Reference population | ||||||
Definitions of reference population, household and household membership The survey unit was a household and all the household members at least 16 years old at the end of the income reference period. The survey did not cover collective households or institutions
|
||||||
| 3.6.2. Population not covered by the data collection | ||||||
The sub-populations that are not covered by the data collection: persons living in collective accommodation establishments. |
||||||
| 3.7. Reference area | ||||||
The whole area of Poland. |
||||||
| 3.8. Coverage - Time | ||||||
Reference year 2024. The SILC data are available for the period 2005-2024. The reference period used for income and non-income variables: In EU-SILC different reference periods are used. The income reference period is the last calendar year preceding the survey, while for other variables presented in the tables the reference period is the current situation as well as the twelve-month or one week period before interview. |
||||||
| 3.9. Base period | ||||||
Not applicable. |
||||||
|
|||
The data involves several units of measure depending upon the variables. Income variables are transmitted to Eurostat in national currency. For more information, see methodological guidelines and description of EU-SILC target variables available on CIRCABC |
|||
|
||||||||
Description of reference period used for incomes
|
||||||||
|
|||
| 6.1. Institutional Mandate - legal acts and other agreements | |||
Regulation (EU) 2019/1700 was publish in OJ on 10 October 2019, establishing a common framework for European statistics relating to persons and households, based on data at individual level collected from samples (IESS). The Annex to the Commission implementing regulation (EU) 2019/2180 of 16 December 2019 specifies the detailed arrangements and content for the quality reports pursuant to Regulation (EU) 2019/1700 of the European Parliament and of the Council and Regulation (EU) 2019/2242. |
|||
| 6.2. Institutional Mandate - data sharing | |||
Confidential microdata are not disclosed by Eurostat. Access to confidential microdata for scientific purposes may be granted on the basis of Commission Regulation 557/2013 and Regulation 223/2009 of the European Parliament and the Council on European statistics. |
|||
|
|||
| 7.1. Confidentiality - policy | |||
The basic document is the Act on official statistics with its amendments. In addition, the CSO prepared a document: PERSONAL DATA PROTECTION POLICY (PODO) - a document describing the internal Personal Data Protection Policy regulating the principles of data processing in units of official statistics services. |
|||
| 7.2. Confidentiality - data treatment | |||
Confidentiality – data treatment: Rules for handling statistical data - appendix to the internal regulation of the President of the Statictics Poland. |
|||
|
|||
| 8.1. Release calendar | |||
Please refer to the publication calendar - Polish Public Statistics publicly available on the website of the Central Statistical Office. Tytułowy plan wydawniczy Głównego Urzędu Statystycznego i Urzędów Statystycznych na rok 2024 |
|||
| 8.2. Release calendar access | |||
Please refer to the Release calendar - Eurostat (europa.eu) publicly available on the Eurostat’s website. |
|||
| 8.3. Release policy - user access | |||
In line with the Community legal framework and the European Statistics Code of Practice, Eurostat disseminates European statistics on Eurostat's website (see section 10 - 'Accessibility and clarity'), respecting professional independence and in an objective, professional and transparent manner in which all users are treated equitably. The detailed arrangements are governed by the Eurostat protocol on impartial access to Eurostat data for users. Additional information about microdata access is available in EU statistics on income and living conditions - Microdata - Eurostat. |
|||
|
|||
Annual |
|||
|
||||||||
| 10.1. Dissemination format - News release | ||||||||
Did not occur |
||||||||
| 10.2. Dissemination format - Publications | ||||||||
Annual bilingual publication are available on the Poland statistical website. |
||||||||
| 10.3. Dissemination format - online database | ||||||||
Data from the EU-SILC study are published in publicly available databases:
Warunki życia ludności | Dashboard | DBW. The Knowledge Database focuses on presenting detailed thematic information (according to classifications, nomenclatures and code lists) for 31 domain areas: Construction, Prices, Demography, Education, Public finances, Maritime and Inland Economy, Energy, Social economy, Municipal and housing infrastructure, Business and consumer tendency, Culture, Forestry, Science and technology, Non-financial enterprises, Industry, National and regional accounts, Family, Agriculture, Labour Market, Internal Market, Information Society, State and protection of environment, Social benefits and assistance, Telecommunication and post, Transport, Tourism and Sport, Living conditions of the population, International exchange, Justice, Wages and salaries, labour costs, Health and healthcare. Data are generally available for Poland in total, and for some indicators also by voivodships. In the case of demographic data and of local government unit budgets data are available also for lower levels of territorial division. The length of the time series is due to the availability and consistency of information within each category. In the database information is presented in tables, whereas in the "Dashboards" module information is available in the form of interactive charts and masp. The Knowledge Database is a publicly available and free of charge. The access to the Knowledge Database and the use of its data is based on the open license (Attribution 4.0 International - CC BY 4.0).
The STRATEG system is a publicly accessible system, which is updated quarterly and designed to facilitate the process of monitoring the development and evaluating the effects of actions undertaken to strengthen social cohesion. The database contains a comprehensive set of key measures to monitor (mainly annual) development at the national level, as well as at lower levels of territorial division. To ensure international comparability, the database also contains the main indicators for the EU, its member states and regions at NUTS 2 level. The system is also used as a repository of indicators relating to various strategies – starting from the Europe 2020 Strategy of the EU and the most general Long-term National Development Strategy, as well as the Medium-term National Development Strategy, through 9 integrated strategies concerning economic efficiency and innovation, transport, energy security and environment, regional development, human capital, social capital, sustainable development of rural areas, agriculture and fishing industry, efficient state and national security. In addition, the system stores information on indicators for regional strategies, Partnership Agreement, National and Regional Operational Programmes. Analysis and perception of information is facilitated by data visualisation tools in the form maps and charts, as well as a comprehensive set of metadata describing the indicators. The system resources also provide additional information, such as links to most important documents of strategic importance, reports and other publications. Data sources Data about indicators available in the system come from official statistics and several dozen other sources, such as scientific institutes, national and regional centres and agencies, databases of international organizations and institutions. Additional information Relative figures (indexes, percentages) are generally calculated based on absolute data, expressed with a higher precision than presented in the tables. Owing to rounding, totals may not always correspond to the sum of all figures shown. |
||||||||
| 10.3.1. Data tables - consultations | ||||||||
KNOWLEDGE DATABASES - DOMAIN LIVING CONDITIONS: Due to the change of the DBW system and the mechanism for counting visits to the website, the table has changed. The data given below concerns the period from May 15, 2023 to December 31, 2023.
STRATEG - INDICATORS OF DOMAIN LIVING CONDITIONS:
|
||||||||
| 10.4. Dissemination format - microdata access | ||||||||
Information is disseminated through:
Link to the data request form: Statistical data request form Link to the Act on Public Statistics: Obwieszczenie Marszałka Sejmu Rzeczypospolitej Polskiej z dnia 13 lutego 2020 r. w sprawie ogłoszenia jednolitego tekstu ustawy o statystyce publicznej |
||||||||
| 10.5. Dissemination format - other | ||||||||
They are not available. |
||||||||
| 10.5.1. Metadata - consultations | ||||||||
They are not available. |
||||||||
| 10.6. Documentation on methodology | ||||||||
The survey documentation (for each year) consists of:
Currently, a national metadata system made available to external users is in preparation. The methodological description available on the website of the Central Statistical Office is available in the annual publication.
|
||||||||
| 10.6.1. Metadata completeness - rate | ||||||||
All required concepts are provided. |
||||||||
| 10.7. Quality management - documentation | ||||||||
Standards according to ISO 9000 are used in Polish official statistics. |
||||||||
|
|||
| 11.1. Quality assurance | |||
The EU-SILC survey applies the following quality management system procedures:
|
|||
| 11.2. Quality management - assessment | |||
The EU-SILC survey is well aligned with the methodology contained in DocSILC065 (Methodological guidelines and description of eu-silc target variables). This ensures high comparability of data at the European level. During the project realised under the Action plan for EU-SILC improvements Objective 1: Regional dimension of the EU-SILC data at NUTS2 level, the survey sample was increased, which resulted in an improvement in the precision of not only the main indicators but also the data in general. The problem, however, are substitute (proxy) interviews, which are high in Poland (in recent years, it is about 27% of all individual interviews). Unfortunately, it is a compromise between obtaining data from a person close to the respondent and the lack of an interview. Any information coming from people in the respondent's household is a better solution than imputating all the data. Concerns, however, are raised by proxy interviews in the case of questions about the assessment of various phenomena. Therefore, in Poland, proxy interviews are not allowed for some issues and additional weights are introduced. We are also looking for other solutions. We have introduced a self-administration (the paper form for people aged 16 and more is left for people who cannot be found or do not have time for the interviewer). Thanks to this measure, the interview rate in 2019 decreased by approx. 2 percentage points (to 25%). In 2024, the CAWI method was used for the first time in the entire country for an individual form. Unfortunately, this method was not introduced by interviewers very widely. The analysis of the situation conducted on the basis of voivodeship reports showed that this was due to interviewers' concerns about technical problems with the CAWI application and inability to obtain an interview from the respondent (fear of the respondent changing his or her mind regarding the choice of the CWAI method). Interestingly, interviewers who carried out (ine 2023) the pilot of the method in 2024 used this method more widely than other interviewers. This means that work (especially during training) is needed to convince interviewers to implement this method. Data from EU-SILC 2023 were transmitted to Eurostat in February 2024. |
|||
|
|||
| 12.1. Relevance - User Needs | |||
The main users of EU-SILC statistical data are policy makers, research institutes, media, and students. |
|||
| 12.2. Relevance - User Satisfaction | |||
Eurostat carried out an online general User Satisfaction Survey (USS) in the period between April and July 2019 to obtain a better knowledge about users, considering their needs and satisfaction with the services provided by Eurostat. The survey has shown that EU-SILC is of very high relevance for users. For the majority, both aggregates and micro-data were important or essential in their work irrespective of the purpose of their use. The use of the ad-hoc modules was less widespread than the use of the nucleus variables. Nevertheless, there was high interest to repeat these modules in order to have the possibility of comparing data over time. Users emphasized their strong need for more detailed micro-data, which is currently not possible. Under the new legal framework implemented from 2021, the NUTS 2 division will be available for the main indicators. Finally, users were satisfied with overall quality of the service delivered by Eurostat, which encompasses data quality and the supporting service provided to them. For more information, please consult the User Satisfaction Survey. |
|||
| 12.3. Completeness | |||
Yearly datasets contain all variables. No optional modular variables introduced:
|
|||
| 12.3.1. Data completeness - rate | |||
In 2024, Poland provided all variables in accordance with the D065 documentation: 100% |
|||
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 13.1. Accuracy - overall | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
According to Reg. (EU) 2019/1700 Annex II, precision requirements for all data sets are expressed in standard errors and are defined as continuous functions of the actual estimates and of the size of the statistical population in a country or in a NUTS 2 region. For the income and living conditions domain, the estimated standard errors of the following indicators are examined according to certain parameters set:
Further information is provided in section 13.2 Sampling error. |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 13.2. Sampling error | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
EU-SILC is a complex survey involving different sampling designs in different countries. In order to harmonize and make sampling errors comparable among countries, Eurostat (with the substantial methodological support of Net-SILC2) has chosen to apply the "linearization" technique coupled with the “ultimate cluster” approach for variance estimation. Linearization is a technique based on the use of linear approximation to reduce non-linear statistics to a linear form, justified by asymptotic properties of the estimator. This technique can encompass a wide variety of indicators, including EU-SILC indicators. The "ultimate cluster" approach is a simplification consisting in calculating the variance taking into account only variation among Primary Sampling Unit (PSU) totals. This method requires first stage sampling fractions to be small which is nearly always the case. This method allows a great flexibility and simplifies the calculations of variances. It can also be generalized to calculate variance of the differences of one year to another. The main hypothesis on which the calculations are based is that the "at risk of poverty" threshold is fixed. According to the characteristics and availability of data for different countries, we have used different variables to specify strata and cluster information. In particular, countries have been split into 3 groups: 1) BE, BG, CZ, IE, EL, ES, FR, HR, IT, LV, HU, PL, PT, RO, SI, UK and AL, whose sampling design could be assimilated to a two-stage stratified type we used DB050 (primary strata) for strata specification and DB060 (Primary Sampling Unit) for cluster specification; 2) DK, DE, EE, CY, LT, LU, NL, AT, SK, FI, CH whose sampling design could be assimilated to a one stage stratified type we used DB050 for strata specification and DB030 (household ID) for cluster specification; 3) MT, SE, IS, NO, whose sampling design could be assimilated to a simple random sampling, we used DB030 for cluster specification and no strata. |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 13.2.1. Sampling error - indicators | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
The concept of accuracy refers to the precision of estimates computed from a sample rather than from the entire population. Accuracy depends on sample size, sampling design effects and structure of the population under study. In addition to that, sampling errors and non-sampling errors need to be taken into account. Sampling error refers to the variability that occurs at random because of the use of a sample rather than a census and non-sampling errors are errors that occur in all phases of the data collection and production process. Sampling errors of indicators for the quality report were estimated using ultimate cluster method and linearization. Calibration of the weights was also taken into account. The R package vardpoor was used in the calculations. Annexes: PL_2024_Annex 3-Sampling_errors_13.2.1 PL_2024_Annex A EU-SILC - content tables |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 13.3. Non-sampling error | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Non-sampling errors are basically of 4 types:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 13.3.1. Coverage error | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Coverage errors include over-coverage, under-coverage and misclassification:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 13.3.1.1. Over-coverage - rate | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Coverage error
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 13.3.1.2. Common units - proportion | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
The EU-SILC survey does not use data from administrative sources. |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 13.3.2. Measurement error | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Measurement error for cross-sectional data
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 13.3.3. Non response error | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Non-response errors are errors due to an unsuccessful attempt to obtain the desired information from an eligible unit. Two main types of non-response errors are considered: 1) Unit non-response which refers to the absence of information of the whole units (households and/or persons) selected into the sample. According to Annex VI of the Reg.(EU) 2019/2242
NRh=(1-(Ra * Rh)) * 100 Where Ra is the address contact rate defined as: Ra= Number of address/selected person (including phone, mail if applicable) successfully contacted/Number of valid addresses/selected person (including phone, mail if applicable) selected and Rh is the proportion of complete household interviews accepted for the database Rh=Number of household interviews completed and accepted for database/Number of eligible households at contacted addresses (including phone, mail if applicable) • Individual non-response rates (NRp) is computed as follows: NRp=(1-(Rp)) * 100 Where Rp is the proportion of complete personal interviews within the households accepted for the database Rp= Number of personal interview completed/Number of eligible individuals in the households whose interviews were completed and accepted for the database • Overall individual non-response rates (*NRp) is computed as follows: *NRp=(1-(Ra * Rh * Rp)) * 100 For those Members States where a sample of persons rather than a sample of households (addresses, phones, mails etc.) was selected, the individual non-response rates will be calculated for ‘the selected respondent. 2) Item non-response which refers to the situation where a sample unit has been successfully enumerated, but not all the required information has been obtained.
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 13.3.3.1. Unit non-response - rate | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Unit non-response rate for cross-sectional
where A=total (cross-sectional) sample, B =New sub-sample (new rotational group) introduced for first time in the survey this year, C= Sub-sample (rotational group) surveyed for last time in the survey this year.
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 13.3.3.2. Item non-response - rate | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
The computation of item non-response is essential to fulfil the precision requirements. Item non-response rate is provided for the main income variables both at household and personal level. Item non-response which refers to the situation where a sample unit has been successfully enumerated, but not all the required information has been obtained. |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 13.3.3.2.1. Item non-response rate by indicator | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Calculations for Item non-response rate are included in the Annex. Annexes: PL_2024_Annex 2-Item_non_response_13.3.3.2.1 |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 13.3.4. Processing error | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Description of data entry, coding controls and the editing system
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 13.3.5. Model assumption error | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Not applicable as error modeling has not been applied. |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|||
| 14.1. Timeliness | |||
The EU-SILC content team has been working for several years to accelerate the publication of data compared to the reference period of the data. |
|||
| 14.1.1. Time lag - first result | |||
Data sets for 2024 were sent to Eurostat in December 2024. Unfortunately, it was necessary to submit a correction to the data sets in February and March 2025.
|
|||
| 14.1.2. Time lag - final result | |||
Final data: the number of months from the last day of the reference period to the day of publication of first results.:
|
|||
| 14.2. Punctuality | |||
The first publication of EU-SILC 2024 data took place in January 2025. |
|||
| 14.2.1. Punctuality - delivery and publication | |||
The publication "Incomes and living conditions of the population of Poland (report from the EU-SILC survey of 2024)" is scheduled to be published on the website of the Central Statistical Office on December 31, 2025. At the moment there is no risk of meeting this deadline. |
|||
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 15.1. Comparability - geographical | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
In Poland, the same definitions and forms for the EU-SILC survey apply throughout the country. The methodological instruction for the survey is also prepared centrally. Training for people carrying out the survey is conducted by the EU-SILC substantive team. In case of doubt, voivodship coordinators consult directly with the members of the substantive team. All of this reduces the possibility of regional errors. Any discrepancies in comparing data at the international level are limited to a minimum by adapting the methodology according to the guidelines prepared by Eurostat. Any doubts are consulted with the Unit F-4: Income and living conditions - Quality of life team. In the case of income data, minor differences along with the level of comparability are described in section 3.4. Statistical concepts and definitions. |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 15.1.1. Asymmetry for mirror flow statistics - coefficient | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Not applicable. |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 15.2. Comparability - over time | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
See the annex on Break in series. |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 15.2.1. Length of comparable time series | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Methodological changes affecting the comparability of data are described in the Annex Annexes: PL_2024_Annex 8-Breaks in series_15.2-updated |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 15.2.2. Comparability and deviation from definition for each income variable | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Comparability and deviation from definition for each income variable
F= Fully comparable; L= Largely comparable; P= Partly comparable and NC= Not collected.
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 15.3. Coherence - cross domain | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
The coherence of two or more statistical outputs refers to the degree to which the statistical processes, by which they were generated, used the same concepts and harmonised methods. A comparison with external sources for all income target variables and the number of persons who receive income from each ‘income component’ will be provided, where the Member States concerned consider such external data to be sufficiently reliable.
Comparison of EU-SILC and HBS results
The objective of this section is to compare HBS (Household Budget Survey) and EU-SILC results. When comparing these two sources we must take into account the discrepancies. The differences are to great extent brought about by the methodological diversity. Here are the main diverging points:
Comparison of selected income data in the Annex Annexes: Comparison of EU-SILC and HBS results_2024 |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 15.3.1. Coherence - sub annual and annual statistics | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Not applicable. |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 15.3.2. Coherence - National Accounts | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Currently, we do not yet have data to compare the results between EU-SILC and RN. Availability of data from the RN probably in August 2025. The study will be supplemented. |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 15.4. Coherence - internal | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
In 2024, there were no lack of coherency in the collection. |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|||
Mean (average) interview duration per household = 31 minutes. Mean (average) interview duration per person = 28 minutes. Mean (average) interview duration for selected respondents (if applicable) = minutes. - PL - Not applicable |
|||
|
|||
| 17.1. Data revision - policy | |||
The most important results from the EU-SILC survey together with the methodological description are presented in the publication Income and living conditions of the population of Polish - report from the EU-SILC. The methodological description contains information about the changes that were introduced to the survey in relation to previous years. In 2005-2024, there were no changes in the results after their publication. If this had happened, the following measures would have been applied:
In the case of making available data sets, the following is practiced: informing persons ordering data sets during the process of agreeing the scope of the contract about methodological changes in the survey (within the ordered thematic scope) and about possible lack of comparability of some data resulting from these changes. |
|||
| 17.2. Data revision - practice | |||
In 2005-2024, there were no changes in the results after their publication. |
|||
| 17.2.1. Data revision - average size | |||
In 2024, there are no revisions to report for the statistical process. |
|||
|
|||||||||||||||||||||||||||||||
Detailed information concerning sampling frame, sampling design, sampling units, sampling size, weightings and mode of data collection can be found in this section (please see below). Such information is mainly used for the computation of the accuracy measures. |
|||||||||||||||||||||||||||||||
| 18.1. Source data | |||||||||||||||||||||||||||||||
The new subsample for EU-SILC 2024 was selected in November 2023 from the sampling frame updated as of June 30, 2023.
|
|||||||||||||||||||||||||||||||
| 18.1.1. Sampling Design | |||||||||||||||||||||||||||||||
Sampling frame The sample for EU-SILC 2024 consisted of four panel subsamples. The samples for EU-SILC 2005 and for the next years were selected from the sampling frame based on the TERYT system, i.e. National Official Register of Territorial Division of the Country. Two kinds of primary sampling units (PSU) were distinguished in the sampling frame:
The whole territory of Poland is divided into enumeration statistical districts and census enumeration areas. The TERYT system is updated annually with respect to the territorial division into statistical districts and census enumeration areas. The lists of dwellings, names of towns, villages and streets are updated. Other changes due to new construction, demolition of buildings and administrative division modifications are also introduced. The new subsample for EU-SILC 2024 was selected in November 2023 from the sampling frame updated as of June 30, 2023. Sample design Type of sampling design A two-stage sampling scheme with different selection probabilities at the first stage was used. Primary sampling units (PSU) were enumeration census areas. At the second stage dwellings were selected. All the households from the selected dwellings were supposed to enter the survey. Prior to selection, primary sampling units were stratified. Stratification and sub stratification criteria The strata were the voivodships (NUTS2) and within the voivodships primary sampling units were classified by class of locality. In urban areas census areas were grouped by size of town. Big cities formed independent strata, but in the five largest cities districts were treated as strata. In rural areas strata were represented by rural gminas (NUTS5) of a subregion (NUTS3) or of a few neighbouring powiats (NUTS4). Altogether, 211 strata were distinguished for the first year of the survey; this amount in subsequent editions was subject to certain modifications resulting from changes in the administrative division. Sample selection schemes It was estimated that in the first year of the survey (2005) the sample should comprise about 24 000 dwellings. Proportional allocation of dwellings to particular strata was applied. In the following years, the allocation of newly drawn subsamples proportionally between voivodships was modified due to the necessity of obtaining reliable data (compliant with Eurostat recommendations) at the NUTS 2 level. As a consequence, this allocation has approximately become proportional to the square root of the number of dwellings in the population.
The number of dwellings selected from a particular stratum (in every NUTS 2 level) was in proportion to the number of dwellings in the stratum. Furthermore, the number of the first-stage units selected from the strata was obtained by dividing the number of dwellings in the sample by the number of dwellings determined for a given class of locality to be selected from the first-stage unit. In towns with at least 100 000 inhabitants 3 dwellings per PSU were selected, in towns with 20-100 thousand inhabitants – 4 dwellings per PSU, in towns with less than 20 000 inhabitants – 5 dwellings per PSU, respectively. In rural areas 6 dwellings were selected from each PSU. In the first year of the survey 5912 census areas and 24044 dwellings were selected for the sample. Census areas were selected according to the Hartley-Rao scheme. Prior to selection, census areas were put in random order for each stratum separately and then the determined number of PSUs was selected with probabilities proportionate to the number of dwellings. Then, from each of the selected census areas dwellings were selected using the simple random selection without replacement procedure. The selected sample of primary sampling units was divided into four subsamples, equal in size. Starting from 2006 one of the subsamples is eliminated and replaced with a new one, selected independently as described above. In 2024 subsample 2 was replaced by subsample 6 consisting of 2606 census areas and 9201 dwellings. In 2024, a sample of reserve dwellings was scheduled for the new sample (as in the previous years), which will allow to obtain, in subsequent editions of the survey, an increase in the number of completed surveys within regions (NUTS 2). The larger sample carried out at the level of NUTS2 classification results from the need to meet the precision requirements for selected indicators, which are analyzed by Eurostat [1]. After the analysis of historical data, it was assumed that in the class of locality "over 20 thousand. inhabitants ", 12 reserve dwellings will be drawn to each address from the main sample; for the class of locality "less than 20 thousand. inhabitants ", 10 reserve dwellings will be drawn; for the remaining class of rural areas a random selection of 6 reserve addresses was established. In determining the size of the new subsample in the regions (NUTS 2 level), a mathematical model was used, which included the following elements: • limitations for standard errors of AROPE indicator (people at risk of poverty or social exclusion) from Eurostat regulation, which should be met in 2024 year • the model of dependence of the estimated value of standard errors of the AROPE indicator from the number of households with completed interviews in each region • historical data on the completeness rates for the subsamples surveyed in previous years • expected impact of the planned use of the reserve dwellings.
When drawing the new subsample 6 in 2024 year, the following additional elements were used to modify the sampling scheme used in previous years: • the strata for the first stage sampling units were defined by regions (NUTS 2), i.e., voivodships, taking into account the division of the Mazowieckie voivodship into two regions: the Warsaw Capital Region and the Mazowieckie Regional Region; then in regions by class of locality. Large cities generally constituted independent strata. In Warsaw, Krakow, Lodz, Poznan and Wroclaw, several strata each were created by combining neighbouring districts. Small cities and rural areas, on the other hand, were stratified by sub-region (NUTS 3) with consideration of classes of locality. In defining the strata in rural areas, account was taken of their diverse nature, as defined in the delimitation of rural areas (DOW) introduced by the Statistics Poland, which divides rural areas into 4 classes taking into account population density and distance to urban agglomerations. In addition, part of the “agricultural” strata, defined based on the percentage of dwellings with a user of an individual farm, was distinguished. In addition, specific “rich” strata were established based on the highest values of average tax income per capita of the municipality (according to PIT tax bases). A total of 238 strata were established, including 101 rural strata; 53 “agricultural” strata and 36 “rich” strata were created; • Income ranks were assigned to each dwelling address in the frame (the so-called Social Surveys Frame (OBS)) thanks to the Statistics Poland’s access to individual tax data from the Ministry of Finance, allowing for the identification of persons with the PESEL ID; This made it possible to assign information to OBS databases at the level of people and addresses. The provided administrative data processed by the Statistics Poland covered the years 2016-2019 and made it possible to obtain a total annual income for people. On the basis of unit tax data from PIT databases for 2019, a set was created in the OBS in which a code with a value from 1 to 10 was assigned to the apartment address identifier, i.e. a rank based on the deciles of the equivalent income distribution; the equivalent income was calculated by first summing up the total income from PIT for people assigned to a given address according to OBS and then dividing the total income by the square root of the number of people in the dwelling; • the allocation of the sample size between the strata determined earlier for each region was determined using the algorithm described in the article Wesołowski, Wieczorkowski (2017) [Wesolowski J., Wieczorkowski R. (2017), An eigenproblem approach to optimal equal-precision sample allocation in subpopulations, Communications in Statistics - Theory and Methods, 46: 5, 2212-2231.], Which solves the problem of optimal allocation in a two-stage sampling scheme that theoretically obtains minimal estimates of the relative standard error for the estimator of the mean value of a fixed feature; the new allocation algorithm requires the availability of a certain variable for each elementary sampling unit (i.e. a dwelling) in the frame; the selected variable should be well correlated with the key variables of the study; in the case of the EU-SILC survey, the 'income rank' feature described above was used; The new allocation algorithm also requires that the required ratio of the number of randomly drawn second-stage units to the number of first-stage units be specified as a parameter
Sample distribution over time In the first year of the survey the selected sample of primary sampling units were divided into four subsamples, equal in size. Starting from 2006 one of the subsamples is eliminated and replaced with a new one, selected independently. Substitution If the household from the selected dwelling refused to enter the survey substitution from reserve sample was applied (only for new subsample). The survey from 2018 introduces the sorting of addresses from the reserve list due to the distance between the reserve address and address from main sample. This solution was introduced due to a decrease in the interviewer burden because of the travelling time between the addresses (in particular in rural areas) and travel costs in the case of the need for multiple visits at the same address (no contact with the respondent or completion of the interview).
Concerning the SILC instrument, three different sample size definitions can be applied:
Given that the effective sample size has been already treated in the section dealing with sampling errors, in this section the attention focuses mainly on the achieved sample size.
In total 20 022 households were interviewed and included in the dataset. 40 329 persons at the age of 16 years and more completed an individual interview. 48 040 is the number of persons who are members of the households surveyed.
The following graphs show comparison of distributions of realized units from new subsample (DB075=2 and DB135=1) according to selected variable (available in the frame from administrative registers), by original and substituted dwellings. Substituted units accounted for about 50 percent of all realized new subsample units. Graphical analysis leads to a general conclusion that substituted sample did not make a significant difference compared to original sample. Proper calibration of weights (described in Annex 5) is an additional guarantee of the appropriate quality of estimation.
Fig.1. Comparison of distributions of realized units from new subsample according to number of employed persons, by original and substituted dwellings
Fig.2. Comparison of distributions of realized units from new subsample according to number of employed persons, by original and substituted dwellings, for NUTS 2 regions
[1] Annex II to the Regulation (EU) 2019/1700 of the European Parliament and of the Council establishing a common framework for European statistics relating to persons and households, based on data at individual level collected from samples. |
|||||||||||||||||||||||||||||||
| 18.1.2. Sampling unit | |||||||||||||||||||||||||||||||
The first-stage sampling units (primary sampling units - PSUs) were enumeration census areas, while at the second stage dwellings were selected. All the households from the selected dwellings are supposed to enter the survey. |
|||||||||||||||||||||||||||||||
| 18.1.3. Sampling frame | |||||||||||||||||||||||||||||||
The sample for EU-SILC 2024 consisted of four panel subsamples. The samples for EU-SILC 2005 and for the next years were selected from the sampling frame based on the TERYT system, i.e. National Official Register of Territorial Division of the Country. Two kinds of primary sampling units (PSU) were distinguished in the sampling frame:
The whole territory of Poland is divided into enumeration statistical districts and census enumeration areas. The TERYT system is updated annually with respect to the territorial division into statistical districts and census enumeration areas. The lists of dwellings, names of towns, villages and streets are updated. Other changes due to new construction, demolition of buildings and administrative division modifications are also introduced. The new subsample for EU-SILC 2024 was selected in November 2023 from the sampling frame updated as of June 30, 2023. |
|||||||||||||||||||||||||||||||
| 18.2. Frequency of data collection | |||||||||||||||||||||||||||||||
Data is collected once a year. In Poland, the EU-SILC survey was conducted throughout the country from April 22 to June 28, 2024. |
|||||||||||||||||||||||||||||||
| 18.3. Data collection | |||||||||||||||||||||||||||||||
Mode of data collection
100% are all PAPI or CAPI or CATI interviews
Annexes: PL_2024_Annex 4-Data_collection_18.3 |
|||||||||||||||||||||||||||||||
| 18.4. Data validation | |||||||||||||||||||||||||||||||
Validation is based on prepared assumptions: scope and logical. The first stage of validation takes place during the interview, the next one during the data collection preparation. The assumptions are developed for both cross and panel data sets. Each signaled situation is analyzed and if it is considered a mistake, it is corrected based on information from the respondent. The number of individual errors is monitored. If an error occurs too often, the reason is analyzed. The reason may be:
|
|||||||||||||||||||||||||||||||
| 18.5. Data compilation | |||||||||||||||||||||||||||||||
The research uses the following processes: data weighting and imputation of missing income data. These processes are described in detail in sections 3.5.1 and 3.5.2. |
|||||||||||||||||||||||||||||||
| 18.5.1. Imputation - rate | |||||||||||||||||||||||||||||||
In the case of PL, the imputation and Item non-response rates have the same values (all item nonresponses occurred have been imputed). Data and annex from point 13.3.3.2.1. |
|||||||||||||||||||||||||||||||
| 18.5.2. Weighting methods | |||||||||||||||||||||||||||||||
Detailed description in the annex. Annexes: PL_2024_Annex 5-Weights |
|||||||||||||||||||||||||||||||
| 18.5.3. Estimation and imputation | |||||||||||||||||||||||||||||||
The methodology of EU-SILC requires for the imputation of the missing income data. The complete file is obtained through the imputation of the missing data. Imputation is a procedure aimed at ensuring the completeness of a data set by replacing the data which are missing due to the respondent’s refusal to give answers with values that are correct from the formal point of view (imputation values). The imputation values are received by the means of a formalised procedure (an algorithm) designed in such a way that the generated values reflect, as precisely as possible, the probable values of missing data in terms of information included in the data set. There are several methods of income variable imputation. They can be classified as deterministic and stochastic methods. In the case of deterministic methods, for a particular set of data the selected method and the set of explanatory variables (imputation algorithm) clearly determine the imputation values for each record. In stochastic methods the imputation value is determined with the use of an error term and that is why with the same algorithm and the same data file, each realisation of the algorithm may give slightly different imputation values. Although the stochastic methods slightly increase estimator variance (introducing an additional random error component), they do not distort variance or original data distribution characteristics allowing for the correct estimation of random error. Deterministic imputation brings about variable variance reduction in the file and random error underestimation; it also distorts to a greater extent the correlation structure and variable distribution. In the income data imputation applied in the EU-SILC survey, the preferable methods are those which preserve the distribution characteristics (thus favouring the stochastic methods).
The following stochastic methods were used:
It involves the replacement of missing data in a record with gaps (the recipient record) with the data collected from a different record (the donor record) randomly selected out of complete (from the point of view of imputed variable) records which meet the specified conditions for similarity with the recipient record. Auxiliary qualitative categorising variables (explanatory variables) , used for grouping records, may be used in the hot-deck method. In this case, a random representative is selected out of the records showing adequate values of auxiliary variables. If it is not possible to find a donor with the equivalent values for all the auxiliary variables, the so called sequence approach is adopted. The categorising variables are ranked from the most to the least significant ones. If there are no donors, the categorisation is carried out with the subsequent explanatory variables being left out, starting from the least significant ones, so as to obtain a subset containing donors. In the case of applying a quantitative categorising variable in the hot-deck method, a breakdown into deciles is used as a categorisation criterion.
Auxiliary variables are the explanatory variables of the regression model. The model takes either a linear or power exponential form. It is fitted on the basis of the records which are complete from the point of view of the imputed variable. The imputed value (or its logarithm in the case of transformed models) is a sum of the theoretical value derived from the model and a randomly selected model residual. The set of records, out of which the residual is selected, is restricted to those which are nearest to the record imputed for the theoretical value derived from the model.
Out of the deterministic methods the following were applied:
The application of stochastic regression imputation requires a model which describes well the formation of a variable with relatively small variance of an error term and good statistical qualities. With high variance of a random component, there is a danger of getting accidental values which are not typical of the correct part of the dataset. That is why in the cases where in accordance with the assumption referred to above, stochastic imputation is required, the hot-deck method is preferred to regression imputation. This is particularly justified when the number of records for imputation is rather low, or when the number of correct records is too small for a suitable model fitting. Stochastic regression imputation is most commonly used for incomes from hired employment, when:
It is also widely used for income categories other than income from hired work if income of a given person/household from the previous year is known. In such a case, the stochastic regression imputation is treated as the basic method, however, the hot-deck method is also applied when it is difficult to adjust an appropriate model. In view of a relatively wide scope of applications of the stochastic regression imputation, an additional protection against possible effects of insufficient model adequacy was introduced. The residuals are not generated from the distribution of residuals for the whole sample, but they are selected from a restricted subset. Although in an ideal model residuals should be in the form of white noise, showing no trend whatsoever, in reality there may be some trends (systematic elements) retained in the distribution of residuals, which are not detected by the model, e.g. those related to non-linearity of relationships which cannot be removed by any known transformations. In such a case the use of residuals from a restricted range reduces the risk of generating values diverging from the real variable distribution by combining the theoretical value and the residual which would be utterly improbable (in combination with this theoretical value). Deterministic imputation is applied where missing data concern less significant components of income variables (taxes, social and health insurance fees, additions, etc.) in the situation when the main component is known. In such cases deterministic regression imputation is usually applied. The conversion of a gross value into a net value and vice versa is performed by the use of the regression deterministic imputation method, if it proves necessary due to missing data. Deduction imputation is employed in rare cases of obvious relationships and can be treated as a supplementary stage of data editing. The explanatory variables in the models and the grouping ones in the case of hot-deck method have been selected so as to represent the relationships which, according to logics and knowledge about the phenomena studied, should occur in the data set, taking into account the accessibility of potential variables in the questionnaire. The relationships have been tested on the file of correct data and in the majority of cases they proved to be significant. Some of the explanatory variables have been retained, even if their impact on the imputed variable has not been statistically confirmed, if they express an economically important relationship or provide a grouping condition (interpretation criterion) in the calculation algorithm for variables. For the persons and households not surveyed in the previous year (a new sample, new household members, persons who could not be interviewed previously) or for those who did not gain a particular type of income in the previous year, explanatory variables derived from the current data file are applied. Wherever the same type of income is found in the data for the previous year, its value is treated as the main explanatory (categorizing) variable, both in the case of variables subjected to regression imputation and the hot-deck method. The current variables may be treated as additional explanatory variables. Since the 2023 edition of EU-SILC, deductive imputation was introduced. It is used for benefits in the case of which the amount of the benefit can be determined with very high reliability on the basis of information provided by the respondent regarding the fact of receiving the benefit and other qualitative characteristics from the survey describing him/herself and his/her situation. In such cases, the amount obtained as a result of deductive imputation is not marked as imputed but as a value obtained from the study. This approach was developed as part of a project implemented with the support of EC funds under the grant agreement 101052514-2021-PL-ILC-SILC. Imputed rent - estimated using regression model. The first step consists in the estimation of a hedonic price function according to which actual rents paid by tenants depend on the main characteristics of dwellings. In the second step, the imputed rent is calculated using the model for all households that do not pay rent at the market price. Monthly rent per 1 m sq. of the usable dwelling area is the variable of interest in the model. An exponential formula of model specification is used (estimation on logarithms).
|
|||||||||||||||||||||||||||||||
| 18.6. Adjustment | |||||||||||||||||||||||||||||||
Not applicable. |
|||||||||||||||||||||||||||||||
| 18.6.1. Seasonal adjustment | |||||||||||||||||||||||||||||||
Not applicable. |
|||||||||||||||||||||||||||||||
|
|||
No comment. Annexes: PL_2024_Annex A EU-SILC - content tables PL_2024_Annex 9-Rolling module PL_2024_Annex 8-Breaks in series_15.2-updated HOUSEHOLD QUESTIONNAIRE_2024 PERSONAL QUESTIONNAIRE_2024 |
|||
|
|||
|
|||