Income and living conditions (ilc)

National Reference Metadata in Single Integrated Metadata Structure (SIMS)

Compiling agency: Hungarian Central Statistical Office


Eurostat metadata
Reference metadata
1. Contact
2. Metadata update
3. Statistical presentation
4. Unit of measure
5. Reference Period
6. Institutional Mandate
7. Confidentiality
8. Release policy
9. Frequency of dissemination
10. Accessibility and clarity
11. Quality management
12. Relevance
13. Accuracy
14. Timeliness and punctuality
15. Coherence and comparability
16. Cost and Burden
17. Data revision
18. Statistical processing
19. Comment
Related Metadata
Annexes (including footnotes)



For any question on data and metadata, please contact: Eurostat user support

Download


1. Contact Top
1.1. Contact organisation

Hungarian Central Statistical Office

1.2. Contact organisation unit

Quality of Life Statistics Department

Living Standard Statistics Section

1.5. Contact mail address

HU-1024 Budapest Keleti Károly u. 5-7.


2. Metadata update Top
2.1. Metadata last certified

30 May 2025

2.2. Metadata last posted

30 May 2025

2.3. Metadata last update

10 October 2025


3. Statistical presentation Top
3.1. Data description

The European Union Statistics on Income and Living Conditions (EU-SILC) is a survey-based instrument aiming at collecting timely and comparable cross-sectional and longitudinal multidimensional microdata on income, poverty, social exclusion and living conditions. In addition, it collects module variables every three years, six years or ad-hoc new policy needs modules.

The EU-SILC instrument provides two types of data:

  1. Cross-sectional data pertaining to a given time or a certain time period with variables on income, poverty, social exclusion and other living conditions;
  2. Longitudinal data pertaining to individual-level changes over time, observed periodically over four‐or more year rotation scheme (Annex III (2) of 2019/1700).

Social exclusion and housing condition information is collected mainly at household level while labour, education and health information is obtained for persons aged 16 and over. The core of the instrument is income information at very detailed component level and mainly collected at personal level.

3.2. Classification system
  • International Standard Classification of Education (ISCED'2011);
  • International Standard Classification of Occupations (ISCO-08);
  • Classification of Economic Activities (NACE Rev.2-2008);
  • Common classification of territorial units for statistics (NUTS 2);
  • Geographical code list (SCL Geo Code);
  • The recommendations made by the United Nations in the Canberra Group Handbook on Household Income Statistics should also be taken into account.

For more details on the classifications used, please see Eurostat code list or Statistics explained glossary on classifications.

3.3. Coverage - sector

Data refer to all private households and individuals living in the private households in the national territory at the time of data collection.

The EU-SILC survey is a key instrument for the European Semester and the European Pillar of Social Rights, providing information on income distribution, poverty and social exclusion, as well as various related living conditions and poverty EU policies, such as on child poverty, access to health care and other services, housing, over indebtedness and quality of life. It is also the main source of data for microsimulation purposes and flash estimates of income distribution and poverty rates.

3.4. Statistical concepts and definitions

Statistical concepts and definitions for EU-SILC are specified in EU Regulation (EU) 2019/1700, EU Regulation 2019/2181, and EU Regulation 2019/2242. Additional information about microdata access is available in Statistics on Income and Living Conditions - Access to microdata - Eurostat and in the methodological guidelines and description of EU-SILC target variables (see CIRCABC).

Further details are provided in items 5, 15.1.1.1, 15.2.2 and 18.3.

3.5. Statistical unit

Statistical units are private households and all persons living in these households who have usual residence in Hungary. Annex II of the EU regulation 2019/2242 defines specific statistical units per variable and specifies the content of the quality reports on the organization of a sample survey in the income and living conditions domain pursuant to EU regulation 2019/1700 of the European Parliament and of the Council.

3.6. Statistical population

The target population is private households and all persons composing these households having their usual residence in Hungary. A private household means a person living alone or a group of persons who live together, providing oneself or themselves with the essentials of living.

3.6.1. Reference population

Definitions of reference population, household and household membership (specify any deviation from Eurostat definition in your country)

Reference population

Private household definition

Household membership

the same concet used as in the corresponding regulation

the same as in the corresponding regulation

the same as in the corresponding regulation

3.6.2. Population not covered by the data collection

The sub-populations that are not covered by the data collection includes: those who moved out of the country’s territory; or those with no usual residence; or those living in institutions or who have moved to an institution compared to the previous year.

3.7. Reference area

Reference area is the territory of Hungary.

3.8. Coverage - Time

HU-SILC was introduced into the statistical system in Hungary in 2005.

Datasets are available from 2005 till the current year covered in this report namely 2024.

3.9. Base period

Not applicable.


4. Unit of measure Top

The data involves several units of measure depending upon the variables. Income variables are transmitted to Eurostat in national currency. For more information, see methodological guidelines and description of EU-SILC target variables available on CIRCABC.


5. Reference Period Top

Description of reference period used for incomes

Period for taxes on income and social insurance contributions

Income reference periods used

Reference period for taxes on wealth

Lag between the income ref period and current variables

 T-1

 T-1

 T-1

The fieldwork was between February 1 and April 15, 2024. The lag between the income reference period and current variables is min 2 to 4 months.

According to the regulation:

  • Survey year indicated with T
  • Income reference period is the previous calendar year: T-1

There was not any changes compared to the corresponding regulation.


6. Institutional Mandate Top
6.1. Institutional Mandate - legal acts and other agreements

EU regulation (EU) 2019/1700 was publish in the OJ on October 10, 2019, establishing a common framework for European statistics relating to persons and households, based on data at individual level collected from samples (IESS). The Annex to the Commission Implementing EU regulation (EU) 2019/2180 of December 16, 2019 specifies the detailed arrangements and content for the quality reports pursuant to EU regulation (EU) 2019/1700 and EU regulation 2019/2242.

6.2. Institutional Mandate - data sharing

Confidential microdata are not disclosed by Eurostat. Access to confidential microdata for scientific purposes may be granted on the  basis of EU regulation 557/2013 and EU regulation 223/2009 on European statistics.


7. Confidentiality Top
7.1. Confidentiality - policy

We follow the correcponding EU regulation on confidentiality.

7.2. Confidentiality - data treatment

Our national regulations and any corresponding information are available on our website: Központi Statisztikai Hivatal (ksh.hu).


8. Release policy Top
8.1. Release calendar

HU-SILC data collection has a standard data publication policy.

The fieldwork is carried out from February to April. The first dissemination of the results are in a form of publication on Income, Living Condition, Poverty and Social Exclusion in October year T.

The data tables are uploaded to STADAT database which is available on our website at the same time. Untill the official validation by Eurostat, all data are treated as preliminary, while after the official validation the data treated as final.

Information on the release available on this link: Központi Statisztikai Hivatal (ksh.hu).

8.2. Release calendar access

European release of EU-SILC data is done according to predifined calendar which is publicly available on this link: 

Release calendar - Eurostat (europa.eu)

8.3. Release policy - user access

In line with the Community legal framework and the European Statistics Code of Practice, Eurostat disseminates European statistics on Eurostat's website (see section 10 - 'Accessibility and clarity'), respecting professional independence and in an objective, professional and transparent manner in which all users are treated equitably. The detailed arrangements are governed by the Eurostat protocol on impartial access to Eurostat data for users. Additional information about microdata access is available in Statistics on Income and Living Conditions - Access to microdata.


9. Frequency of dissemination Top

Annual


10. Accessibility and clarity Top
10.1. Dissemination format - News release

Standard data and publication is in October of data collection year. 

That time a comprehensive study and a STADAT database is available for users.

10.2. Dissemination format - Publications

Helyzetkép | 2023 (ksh.hu)

A comprsensive study is avalaible on our website on Living Standars of households including income situation and poverty and social exclusion indicators.

Please note that according to the Hungarian data dissemination policy data are marked with refence year instead of EU standard of data collection year.

10.3. Dissemination format - online database

 The same time of the study publication data are available for users in STADAT database in our website.

 KSH Statinfo v39 | Témakör választó.

10.3.1. Data tables - consultations

HCSO Dissemination Directorate continously collects information on number of visits on our website. This is available by statistcal themes, by month and annual breakdowns. 

10.4. Dissemination format - microdata access

Detailed information is available for researcher on the access to dataset for reserach purposes on our website.

In order to support scientific research, HCSO provides access to microdata files for scientific purposes only, subject to researcher accreditation, as follows.

  1. Public used files: Publicly accessible micro-data files with strong anti-disclosure protection, of which two groups are available on our website. The sample files, which support research work, and educational micro-data files.
  2. Release of anonymised mocrodata: Micro-data files containing data from statistical observation units with protection against disclosure that minimises the possibility of identifying the statistical unit concerned. Only for scientific research purposes, if there is a research institutional background.
  3. Safe centre: Access to individual-level statistical data that cannot be directly identified, subject to a high level of data protection and strict compliance with data protection regulations. Access in a secure, camera-monitored environment. Access to data files requested for scientific research purposes only.
  4. Remote Execution: This means the use of data files that can only be requested for scientific research. The researcher does not have direct access to the requested micro-data files, the query is run by the HCSO based on the program code/specification.
10.5. Dissemination format - other

Dissemination uses the channels described in 10.2 - 10.4.

10.5.1. Metadata - consultations

Not available.

10.6. Documentation on methodology

HU-SILC follows the standard methodoly and meta information of HCSO.

Details and publications are avaliable on this link: Központi Statisztikai Hivatal (ksh.hu)

Annex 10 - provide information on metadata on Benefits.

For annexes, see Annexes section.

10.6.1. Metadata completeness - rate

All required concepts are provided, 100%.

10.7. Quality management - documentation

See details in 10.6.


11. Quality management Top
11.1. Quality assurance

All social surveys follows the general quality assurance policy in HCSO.

One of the basic documents of HCSO's commitment to quality is the Quality Policy (the relevant document is available on the link below). The Quality Policy issued in February 2023 follows the directions set out in the Office's strategy, the guidelines of the European Statistics Code of Practice and the National Statistics Code of Practice, and the requirements of the ISO 9001:2015 Quality Management System.



Annexes:
Quality policy of HCSO - in HUngarian only
11.2. Quality management - assessment

Quality management has unified concept in HCSO. It is based on GSBPM 5.0 model.

Quality assurance guidelines is avalabale on the link, in Hungarian only. 



Annexes:
Quality assurance guidelines - in Hungarian only


12. Relevance Top
12.1. Relevance - User Needs

The main users of EU-SILC statistical data are general publicy, policy makers, research institutes, the media, and students.

12.2. Relevance - User Satisfaction

Eurostat carried out a general User Satisfaction Survey (USS) in 2024 (User Satisfaction Survey by years) to obtain a better understanding of users’ needs and satisfaction with the services provided by Eurostat. The survey showed that EU-SILC is highly relevant to users. For the majority, both aggregates and microdata are considered important or essential in their work, irrespective of the purpose of their use. The use of the ad-hoc modules was less widespread than the use of the annual variables. Users emphasized their strong need for more detailed micro-data.

For more information, please consult the User Satisfaction Survey.

12.3. Completeness

HU-SILC follows the corresponding regulations in the context of collected and transmitted data to Eurostat.

All the variables in the EU-SILC guidelines for 2024 operations are collected.

HY145N variable is not collected since our national taxation system has different concepts. Monthy deducted income have to be summarised in December of the given calendar year. If there is a overpayment or underpayment than it is corrected in December. 

PY035 variable was not collected during the 2024 data collection, however it is planned to be included in the 2026 data collection.

12.3.1. Data completeness - rate

Not available.


13. Accuracy Top
13.1. Accuracy - overall

According to Reg. (EU) 2019/1700 Annex II, precision requirements for all data sets are expressed in standard errors and are defined as continuous functions of the actual estimates and of the size of the statistical population in a country or in a NUTS 2 region. For the income and living conditions domain, the estimated standard errors of the following indicators are examined according to certain parameters set:

  • Ratio at‐risk‐of‐poverty or social exclusion to population
  • Ratio of at‐persistent‐risk‐of‐poverty over four years to population
  • Ratio at‐risk‐of‐poverty or social exclusion to population in each NUTS 2 region

Further information is provided in section 13.2 Sampling error.

13.2. Sampling error

EU-SILC is a complex survey involving different sampling designs in different countries. In order to harmonize and make sampling errors comparable among countries, Eurostat has chosen to apply the "linearization" technique coupled with the “ultimate cluster” approach for variance estimation.

Linearization is a technique based on the use of linear approximation to reduce non-linear statistics to a linear form, justified by asymptotic properties of the estimator. This technique can encompass a wide variety of indicators, including EU-SILC indicators. The "ultimate cluster" approach is a simplification consisting in calculating the variance taking into account only variation among Primary Sampling Unit (PSU) totals. This method requires first-stage sampling fractions to be small, which is nearly always the case. This method allows a great flexibility and simplifies the calculations of variances. It can also be generalized to calculate the variance of the differences of one year to another.

The main hypothesis on which the calculations are based is that the "at risk of poverty" threshold is fixed. According to the characteristics and availability of data for different countries, we have used different variables to specify strata and cluster information. 

In particular, countries have been split into 3 groups:

  1. BE, BG, CZ, IE, EL, ES, FR, HR, IT, LV, HU, PL, PT, RO, SI, UK and AL, whose sampling design could be assimilated to a two-stage stratified type we used DB050 (primary strata) for strata specification and DB060 (Primary Sampling Unit) for cluster specification.
  2. DK, DE, EE, CY, LT, LU, NL, AT, SK, FI, CH whose sampling designs could be assimilated to a one-stage stratified type, we used DB050 for strata specification and DB030 (household ID) for cluster specification.
  3. MT, SE, IS, NO, whose sampling design could be assimilated to a simple random sampling, we used DB030 for cluster specification and no strata.
13.2.1. Sampling error - indicators

The concept of accuracy refers to the precision of estimates computed from a sample rather than from the entire population. Accuracy depends on sample size, sampling design effects and structure of the population under study. In addition to that, sampling errors and non-sampling errors need to be taken into account. Sampling error refers to the variability that occurs at random because of the use of a sample rather than a census and non-sampling errors are errors that occur in all phases of the data collection and production process.

Main indicators, standard error and CI at country level

  AROPE At risk of poverty Severe Material and Social Deprivation Very low work intensity
  Ind. Stand. errors 95% CI Ind. Stand. errors 95% CI Ind. Stand. errors 95% CI Ind. Stand. errors 95% CI
  value L U value   L U value L U value L U
Total 19.3 0.51 18.3 20.3 14.3 0.47 13.4 15.2 9.3 0.40 8.5 10.0 4.7 0.28 4.2 5.3
Male 18.2 0.59 17.1 19.4 13.4 0.54 12.4 14.5 8.8 0.44 8.0 9.7 4.9 0.30 4.3 5.5
Female 20.2 0.54 19.2 21.3 15.1 0.50 14.2 16.1 9.6 0.43 8.8 10.5 4.5 0.36 3.8 5.2
Age 0-17 22.9 1.11 20.7 25.1 17.4 1.07 15.3 19.5 12.9 0.95 11.0 14.7 4.1 0.58 3.0 5.2
Age18-64 17.5 0.53 16.5 18.6 12.8 0.50 11.9 13.8 8.4 0.37 7.7 9.1 4.9 0.26 4.4 5.4
Age 65+ 21.3 0.76 19.8 22.8 16.1 0.67 14.8 17.4 8.6 0.55 7.6 9.7 - - - -

 

Main indicators, standard error and CI at NUTS 2 level

DB040 AROPE At risk of poverty Severe Material and Social Deprivation Very low work intensity
Ind. Stand. errors 95% CI Ind. Stand. errors 95% CI Ind. Stand. errors 95% CI Ind. Stand. errors 95% CI
value L U value L U value L U value L U
HU11 11.1 0.73 9.6 12.5 7.8 0.64 6.5 9.0 4.0 0.50 3.0 4.9 3.0 0.42 2.2 3.8
HU12 20.0 2.06 16.0 24.1 12.4 1.73 9.0 15.8 11.4 1.56 8.4 14.5 3.9 1.02 1.9 5.9
HU21 10.2 1.42 7.4 13.0 7.1 1.40 4.4 9.9 4.0 0.54 3.0 5.1 1.6 0.44 0.7 2.5
HU22 11.6 1.19 9.3 13.9 8.9 1.06 6.8 10.9 3.8 0.51 2.8 4.8 3.1 0.63 1.9 4.4
HU23 23.7 2.35 19.1 28.3 17.8 2.25 13.4 22.2 11.9 1.43 9.1 14.8 6.2 1.49 3.3 9.1
HU31 28.8 1.66 25.5 32.0 21.3 1.57 18.2 24.4 18.1 1.63 14.9 21.3 6.9 1.14 4.7 9.2
HU32 28.0 1.56 25.0 31.1 23.0 1.41 20.2 25.8 13.6 1.66 10.4 16.9 7.2 1.06 5.1 9.3
HU33 22.0 1.88 18.3 25.7 17.4 1.69 14.1 20.7 8.1 1.05 6.0 10.2 6.0 0.87 4.3 7.7

 

Persistent-risk-of-poverty ratio over four years to the population, standart error and CI

  Persistent‐risk‐of‐poverty
  Ind. Stand. Errors 95% CI
  value   L U
Total 6.7 0.8 5.1 8.3
13.3. Non-sampling error

Non-sampling errors are basically of 4 types:

  • Coverage errors: errors due to divergences existing between the target population and the sampling frame.
  • Measurement errors: errors that occur at the time of data collection. There are a number of sources for these errors, such as the survey instrument, the information system, the interviewer and the mode of collection.
  • Processing errors: errors in post-data-collection processes such as data entry, keying, editing and weighting.
  • Non-response errors: errors due to an unsuccessful attempt to obtain the desired information from an eligible unit. Two main types of non-response errors are considered:
    • Unit non-response: refers to absence of information for the whole units (households and/or persons) selected into the sample.
    • Item non-response: refers to the situation where a sample unit has been successfully enumerated, but not all required information has been obtained.
13.3.1. Coverage error

Coverage errors include over-coverage, under-coverage and misclassification:

  • Over-coverage: relates either to wrongly classified units that are in fact out of scope, or to units that do not exist in practice. - Since 2022 the sampling frame of HU-SILC is the Register of Dwellings. It may have 15-20% over-coverage due to empty dwellings, non-residential buildings etc.
  • Under-coverage: refers to units not included in the sampling frame. - Our sampling frame may have non-negligible under-coverage, but its' extent is not yet quantified.
  • Misclassification: refers to the incorrect classification of units that belong to the target population - Misclassification may affect the stratification variables (especially that used for the stratification of adresses) but its' excent is not yet quantified.
13.3.1.1. Over-coverage - rate

Coverage error

Main problems

Population (sub-population)

Size of error

Comments

Over-coverage

empty dwellings, non-residential buildings 

15-20% 

 

Under-coverage

 

not yet quantified 

 

Misclassification

 

not yet quantified  

 

13.3.1.2. Common units - proportion

Not appliacble. We use data only from the survey. 

13.3.2. Measurement error

Measurement error for cross-sectional data

Cross-sectional data

Source of measurement errors

Building process of questionnaire 

Interview training

Quality control

 

Based on the experiences of the previous waves (HU-SILC2005 -2023) the following steps were done:

Since the core part  of the survey is standard according to the regulation no cognitive testing was done regarding the core part.

Module sections, namely 3-year rolling module (Children) and 6-year rolling module (Access to services):

The 3-year rolling module was already included in SILC, so we did not introduce cognitive testing for it in the 2024 wave. The 6-year rolling module had not been included in the SILC survey so far, therefore it was necessary to examine the questions through cognitive testing.

The questionnarie was built up on the basis of the corresponding Methodological guideline (docSILC065) and and our previous wave experiences.

After the finalization of the questionnaries the IT develeopment of CAPI and CATI questionnarie were done. 

 

 

 

After the finalization of the questionnaire a detailed Interviewer’s manual was prepared to help the interviewers during and even before the data collection.

As a preparation for data collection - interviewers received the questionnaire and Interviewer’s manual by the end of  January 2024 on their tablets.  After a week of individual preparation, a comprehensive training was held for interviewers in small groups via online platform Webex. The training covered the data collection process, the detailed topics of the questionnaire by chapters. All the interviewers participating in the data collection had to pass a test after the training with 80% compliance rate. After passing the test they had to fill two mock-up questionnaires as a test and 2 mock-up non-response questionnaires and if those were accepted by the monitoring team than the interviews were allowed to work and start the real data collection interviews.  The data collection started on 1 February and lasted till 15 April. If during the data collection period a new interviewer is employed than he/she had to go through this training process before going to the field.  During the data collection interviewers can report problems and ask clarification from the monitoring head. The monitoring colleagues share the problems and solutions thus provide unified responses to queries from interviewers. 

First version of CAPI and CATI  questionnraie programs were tested by the colleagues of Living Standard Statistics Section and then by the colleagues of Department of Household Surveys Data Collection. After the collection of any bugs or problems the IT programs are finalized.

During the fieldwork 1 person is the head of any actions and problems related to the data collection in the Department of Household Surveys Data Collection. He/she coordinated the work of the regional outlets and provided answers for any questions from the interviewers by phone and email. 

During the data collection field work a systematic quality controll was used. The monitoring colleagues monitored the progress of the work, regularly reported the number of visited addresses, completed interviews, non-response rate, etc. 

There was a continuous quality check on the completed questionnaires and interviewer's performance as well by the monitoring colleagues.

The supervisors randomly called the households with completed interviews by phone and asked about the interviewer (whether the interviewer visited the households, was he/she polite, etc.). In case any problem detected the interviewers' work went trough strict controll.

 

13.3.3. Non response error

Non-response errors are errors due to an unsuccessful attempt to obtain the desired information from an eligible unit. Two main types of non-response errors are considered: unit non-response and item-non response.

1) Unit non-response which refers to the absence of information of the whole units (households and/or persons) selected into the sample. According to Annex VI of the EU regulation 2019/2242

  • Household non-response rates (NRh) is computed as follows:

NRh=(1-(Ra * Rh)) * 100

Where Ra is the address contact rate defined as:

Ra= Number of address/selected person (including phone, mail if applicable) successfully contacted/number of valid addresses/selected person (including phone, mail if applicable) selected

and Rh is the proportion of complete household interviews accepted for the database

Rh=Number of household interviews completed and accepted for database/Number of eligible households at contacted addresses (including phone, mail if applicable)

  • Individual non-response rates (NRp) is computed as follows:

NRp=(1-(Rp)) * 100

Where Rp is the proportion of complete personal interviews within the households accepted for the database

Rp= Number of personal interview completed/Number of eligible individuals in the households whose interviews were completed and accepted for the database.

  • Overall individual non-response rates (*NRp) is computed as follows:

*NRp=(1-(Ra * Rh * Rp)) * 100

For those Members States where a sample of persons rather than a sample of households (addresses, phones, mails etc.) was selected, the individual non-response rates will be calculated for the selected respondent.

2) Item non-response which refers to the situation where a sample unit has been successfully enumerated, but not all the required information has been obtained.

13.3.3.1. Unit non-response - rate

Unit non-response rate for cross-sectional

Address (including phone, mail if applicable) contact rate

Complete household interviews

Complete personal interviews

Household Non-response rate

Individual non-response rate

Overall individual non-response rate

(Ra)

(Rh)

(Rp)

(NRh)

(NRp)

(NRp)*

A

B

C

A

B

C

A

B

C

A

B

C

A

B

C

A

B

C

 65.98

75.84

53.36 

37.01

48.36

27.08

 97.24

99.20

96.80

75.58

63.33

85.55

2.76

0.80

3.20

76.25

 63.62

86.01

where

A=total (cross-sectional) sample,

B =New sub-sample (new rotational group) introduced for first time in the survey this year,

C= Sub-sample (rotational group) surveyed for last time in the survey this year.

 

Some explanation to the data: The most influential and controversial factor in the nonresponse rates is the denominator of "Ra": "number of valid addresses selected". SILC is a panel survey so we select the sample households when they first enter the survey. After that there is no sample selection, only panel attrition. The number of selected valid addresses for every rotational groups are the number of eligible addresses (existing addresses for living purpose) selected in the initial period of the rotational group. (We propose to avoid the usage of "response rate"/"non-response rate" definitions for t/(t-1) type ratios, because these do not reflect the possible effects of non-response on panel survey quality.) In our case there is an additional temporary factor which somewhat spoils our response rates. Until 2021 the SILC inherited the respondents' sample of HBS so only HBS had a selected samlpe. From 2022 the SILC has its own sample what is going to improve the response rates.

13.3.3.2. Item non-response - rate

The computation of item non-response is essential to fulfil the precision requirements. The item non-response rate is provided for the main income variables, both at household and personal level.

Item non-response refers to the situation where a sample unit has been successfully enumerated, but not all the required information has been obtained.

13.3.3.2.1. Item non-response rate by indicator

See Annex 2.

13.3.4. Processing error

 Description of data entry, coding controls and the editing system

Data entry and coding

(if any used)

Editing controls

We follow the standard processes for Social Data collection in HCSO and it is valid for SILC as well. Unified IT program is used for any data collection method (CAPI, CATI). The program has built in checks for controlling data entry based on data received during filling the questinnarie. The controll are based on the information related to the socio-rdemographic information of the respondents, age, educ level, etc) and information they had given to sorting questions or based on variable related to each other.  Consistency controll between varaibles. Coding is done by IT programs as well. 

After the data entry and coding a raw dataset is built. During the control we check the household structure the number of persons and wether all the persons are grouped to the correct households. The number of records in the achieved sample and then we select the succesfull interviews and the corresponding persons and households. There is a text content and misspelling control, value controll for the numeric variables. Extreme value checks and further consistency controll between variables.  ID control, Outlier control, data consistency checking (e.g. basic demographic data – highest education level attained; basic demographic data – economic status;  economic status under the income reference period – the income components), controlling of the amount of social transfers.

Sample persons, co-residents and sample households are observed in the survey over the duration of the  four year panel period.

Sample persons moving to a private household within the national territory are covered in the survey and followed to the location of the household.

Sample persons who are no longer members of a private household, or who have moved outside the national territory are dropped from the survey.

Co-residents living in a household containing at least one sample person is followed.

A sample household shall be dropped from the survey in the following situations:

(a) the household was not enumerated for a single year due to either of the following reasons: (1) the address was impossible to locate; (2) the address was non-residential or unoccupied; (3) there was no information on what happened to the household (the household was lost); (4) the household refused to co-operate;

(b) the household was not contacted in the first year of the panel or in two consecutive years of the panel due to either of the following reasons: (1) it was not possible to access the address; (2) the whole household was temporarily away or unable to respond due to incapacity or illness or for other serious reasons.

 

Re-interview rates by wave

Re-interview rates Wave 2 Wave 3 Wave 4
(a) individuals in interviewed households % 69.8 60.2 54.3
(b) individuals out of scope % 7.5 10.3 12.1
(c) individuals not interviewed for reasons other than their being out of scope % 22.7 29.6 33.6

 

13.3.5. Model assumption error

Not appliacable.


14. Timeliness and punctuality Top
14.1. Timeliness

Timeliness of information reflects the length of time between its availability and the event or phenomenon it describes.

This section refers to the timing of dataset sending to Eurostat and first publication of results from the data collection.

HU-SILC 2024 dataset was set sent to Eurostat in 20 December 2024. At the time of compliling the report, the database has already been validated.

The first results of the data collection were published in 22 October 2024 in a form of a comprehensive study and diagram set. At the same time the data on the STADAT database was also avalable in the section Living conditions.  All data are considered as preliminary untill we receive the validation certificate from Eurostat.

14.1.1. Time lag - first result

Time lag between the reference period (income year 2023) and first results publication (October 22, 2024) is 9 months.

14.1.2. Time lag - final result

The final result will be published in the STADAT database in June 2024.

Time lag between the reference period (income year 2023) and final results publication is 17 months.

14.2. Punctuality

This section describes the lag between the actual time of the data delivery and the target delivery date.

14.2.1. Punctuality - delivery and publication

The actual devilery date was 20 December 2024 to Eurostat. The target date was 31 December 2024. There was a difference of 11 days between the actual date and the target date.

The actual publication date was 22 October 2024 on our website. It was done according to the predefined internal schedule. No lag between the dates. 


15. Coherence and comparability Top
15.1. Comparability - geographical

The result are comparable in NUTS2 level. The key indicators are avalable by NUTS regions on our website in a time series from 2014 (see 5.1.2. chapters on the link). The key indicators are calculated according to the new AROPE concept.

15.1.1. Asymmetry for mirror flow statistics - coefficient

Not applicable.

15.2. Comparability - over time

Due to the revision in 2025, there was a break in the series starting from the 2019 recording.

See Annex 8.

15.2.1. Length of comparable time series

The last break in the series occurred at the 2019 recording, therefore the comparable period is 6 years from 2019 to 2024.

15.2.2. Comparability and deviation from definition for each income variable

Comparability and deviation from definition for each income variable

Income

Identifier

Comparability

Deviation from definition if any

Total hh gross income

(HY010)

 F

 none

Total disposable hh income

(HY020)

 F

 none

Total disposable hh income before social transfers other than old-age and survivors' benefits

(HY022)

 F

 none

Total disposable hh income before all social transfers

(HY023)

 F

 none

Income from rental of property or land

(HY040)

 F

 none

Family/ Children related allowances

(HY050)

 F

 none

Social exclusion payments not elsewhere classified

(HY060)

 F

 none

Housing allowances

(HY070)

 F

 none

Regular inter-hh cash transfers received

(HY080)

 F

 none

Alimonies received

(HY081)

 F

 none

Interest, dividends, profit from capital investments in incorporated businesses

(HY090)

 F

 none

Interest paid on mortgage

(HY100)

 F

 none

Income received by people aged under 16

(HY110)

 F

 none

Regular taxes on wealth

(HY120)

 F

 none

Taxes paid on ownership of household main dwelling

(HY121)

 F

 none

Regular inter-hh transfers paid

(HY130)

 F

 none

Alimonies paid

(HY131)

 F

 none

Tax on income and social contributions

(HY140)

 F

 none

Repayments/receipts for tax adjustment

(HY145)

 NC

 none

Value of goods produced for own consumption

(HY170)

 F

 none

Cash or near-cash employee income

(PY010)

 F

 none

Other non-cash employee income

(PY020)

 F

 none

Income from private use of company car

(PY021)

 F

 none

Employers social insurance contributions

(PY030)

 F

 none

Contributions to individual private pension plans

(PY035)

 NC

 none

Cash profits or losses from self-employment

(PY050)

 F

 none

Pension from individual private plans

(PY080)

 F

 none

Unemployment benefits

(PY090)

 F

 none

Old-age benefits

(PY100)

 F

 none

Survivors benefits

(PY110)

 F

 none

Sickness benefits

(PY120)

 F

 none

Disability benefits

(PY130)

 F

 none

Education-related allowances

(PY140)

 F

 none

F= Fully comparable; L= Largely comparable; P= Partly comparable and NC= Not collected.

15.3. Coherence - cross domain

The coherence of two or more statistical outputs refers to the degree to which the statistical processes by which they were generated, used the same concepts and harmonised methods. A comparison with external sources for all income target variables and the number of persons who receive income from each ‘income component’ will be provided where the Member States concerned consider such external data to be sufficiently reliable.

15.3.1. Coherence - sub annual and annual statistics

Not applicable.

15.3.2. Coherence - National Accounts

Coherence with National Accounts for income variables

EU-SILC variables National Accounts item (S14) Coverage rate (calculated as EU-SILC and NA ratio) EU-SILC growth rate (nominal, year to year) National accounts growth rate (nominal, year to year)
Employee income:  D11/rec Wages and salaries 72% 13% 16%
PY010G Employee cash or near cash income+ PY021G Company car  
Income from self-employment: B3g Mixed income, gross 51% 62% 13%
PY050G Cash benefits or losses from self-employment  
Social benefits other than social transfers in kind:    77% 13% 17%
HY050G Family/children related allowances  
+HY060G Social exclusion not elsewhere classified  
+PY090G  Unemployment benefits  
+ PY100G  Old-age benefits   
+ PY110G Survivor’ benefits + PY120G Sickness benefits   
+ PY130G Disability benefits +PY140G Education-related allowances   
+ HY070G Housing allowances   
   
Social contributions and taxes on income paid:  D61/use: net social contributions 59% 30% 16%
HY140G  Tax on income and social contributions + D51/use: taxes on income
Total disposable household income HY020 B6 Gross disposable income 52% 13% 17%
15.4. Coherence - internal

There are no known internal logical inconsistencies.


16. Cost and Burden Top
Obs  country duration_HH_24  duration_Per16plus_24 

 HU

45.9 

14.2 

 

Mean (average) interview duration for selected respondents not applicable.


17. Data revision Top
17.1. Data revision - policy

The revision of the annual datasets from the 2019–2024 data collections was carried out in the second quarter of 2025, in accordance with the pre-defined schedule.

The primary objective of the data revision was to reweight the datasets using benchmark figures derived from the results of the 2022 census. An additional aim was to improve methodological procedures, with a particular focus on fine-tuning the grossing-up, imputation, and correction methods for income data. These improvements were mainly intended to eliminate previously observed clustering in the income distribution, reduce excessive volatility in the at-risk-of-poverty gap, and resolve earlier inconsistencies between gross and net incomes. As a result of the revision, the accuracy of poverty-related indicators and the internal consistency of the data have significantly improved.

The new weights affected all indicators. The methodological adjustments and refinements in data processing concerned the income variables; therefore, the indicators that rely on income data were subject to more substantial changes.

17.2. Data revision - practice

Main Steps of the Revision Procedure

The revision process included verification of income classifications, logical and content consistency checks, determination of gross/net incomes, correction and imputation of income data, weighting, and quality checks with macro-level validation.

Detailed Description of the Revision Procedure

Income classifications were reviewed and corrected according to Eurostat definitions. Logical and consistency checks ensured demographic plausibility and coherence between employment status and income. Gross and net incomes were determined based on tax and social contribution rules for the relevant years. Missing income data were supplemented using aggregated administrative data as well as stratified averages and medians. Employee incomes were imputed through individual-level linkage with data from the tax authority. From 2023 onwards, individual-level administrative linkage for old-age pensions was also initiated. Survey weights were updated using the 2022 Population Census as a reference, and calibration ensured alignment with control totals. Quality checks included internal consistency analyses and reconciliation with macro-level aggregates to ensure plausibility and reliability of the estimates.

See more data about the revision in annex 11.

17.2.1. Data revision - average size

The revision of the 2019-2024 data collections was implemented in the second quarter of 2025.


18. Statistical processing Top

Detailed information concerning sampling frame, sampling design, sampling units, sampling size, weightings and mode of data collection can be found in this section (please see below). Such information is mainly used for the computation of the accuracy measures.

18.1. Source data

HU-SILC is direct data collection. All the information are derived from the data collected from the respondent during the field work. 

18.1.1. Sampling Design

The concept of accuracy refers to the precision of estimates computed from a sample rather than from the entire population. Accuracy depends on sample size, sampling design effects and structure of the population under study. In addition to that, sampling errors and non sampling errors need to be taken into account. Sampling error refers to the variability that occurs at random because of the use of a sample rather than a census and non-sampling errors are errors that occur in all phases of the data collection and production process.
We use two-staged stratified sampling.
At stage 1 we have a stratified sample of localities with pps selection. At stage 2 (in the sampled localities) we have a stratified simple random sample of addresses.
At stage 1 the population of localities are stratified. Each of the larger localities is a stratum of its own. These are the self-representing localities, the number of which is 126. Smaller localities are stratified by NUTS2 regions, personal income tax base per capita, the size of locality and the prevalence of internet usage.
At stage 2 the adresses are stratified by the estimated value of dwelling.
At stage 1 localities were selected with pps without replacement.
At stage 2 addresses within each strata were selected with srs without replacement.

Actual and achieved sample size:

 

 Obs

 DB020

Actual_SSize 

Achieved_SSize 

 HU

16105 

8943 

 

 

 

 

Achieved sample size:

 

Obs 
DB020 HU
number_of_hh2024 8943
persons_16_over2024 16425
selected_respondents2024 .
NewRG  3
 newRG_number_of_hh2024  2705
 percent2  30.25
 OldRG  4
 OldRG_number_of_hh2024  1951
 percent3  21.82

Number and percentage of Proxy interview:

Obs 

PB010 

 PB020

proxy 

total 

proxy_rate 

2024 

 HU

5552 

16425 

33.8 

 

 

 

 

 

Longitudinal information 2021-2024:

Achieved household sample size

Obs 

 country

 DB020

wave2324 

wave222324 

wave21222324 

 HU

 HU

6204 

3825 

1935 

 

 

 

 

Achieved individual sample size 

Obs 
RB020  HU
TOTAL2324  13367
SAMPLEPER2324  11103
CORES2324 2264
TOTAL222324 7913
SAMPLEPER222324 6714
CORES222324 1199
TOTAL21222324 3946
SAMPLEPER21222324 3375
CORES21222324 569
18.1.2. Sampling unit

At stage 1 the sampling units are localities.

At stage 2 the sampling units are addresses.

18.1.3. Sampling frame

Since 2022 the sampling frame of HU-SILC is the Register of Dwellings. 

18.2. Frequency of data collection

HU-SILC is an annual data collection. It is carried out in every year in a fixed schedule time frame for the data collection.

Fieldwork timing and sample development over time HB050.

18.3. Data collection

Mode of data collection

 

1-PAPI

2-CAPI

3-CATI

4-CAWI

5-PAPI proxy

6-CAPI-proxy

7-CATI-proxy

8-CAWI proxy

9-other

% of total

 

37.0

63.0

 

 

 

 

 

 

Description of collecting income variables

The source or procedure used for the collection of income variables

The form (gross, net) in which income variables at component level have been obtained

The method used for obtaining target variables in the required form

All the income variables were collected from the respondents. The income target variables were grouped into more detailed sub-components according to Hungarian tax and benefit system.  

For taxable incomes (employment income, business income, certain family benefits), gross amounts are collected. For non-taxable incomes (e.g., pensions), net amounts are collected.

1. Separation of personal and household incomes

Personal incomes: income linked to each household member individually, such as employment income, pensions, and personal benefits.

Household incomes: certain types of income that pertain to the household as a whole or involve multiple members, e.g., joint agricultural income or income from household enterprises.

Household-type incomes are always allocated to the relevant household members, typically based on the type of income.

2. Treatment of gross and net incomes

Employment income, personal income-related benefits, and taxable social benefits: gross values are collected and subsequently converted to net values according to applicable tax and social security regulations.

Tax-exempt benefits, pensions, and certain family allowances: net amounts are collected, as these do not constitute taxable income.

3. Data processing steps

Aggregation of subcomponents: subcategories collected in the questionnaire (e.g., base salary, bonuses, overtime) are summed at the individual level.

Household-level aggregation: all personal and household incomes are combined to obtain total household gross income.

 

Fieldwork duration

Obs

HB010

HB020

start_day

start_month

start_year

end_day

end_month

end_year

1

2024

HU

1

2

2024

15

4

2024

18.4. Data validation

Dataset validation is done by Eurostat.

18.5. Data compilation

Regarding income items we do use imputation. Completing the income variables during data collection is a condition for accepting the questionnaire. For the imputation procedure, the Hungarian Central Statistical Office utilizes administrative data obtained from the National Tax and Customs Administration and the Hungarian State Treasury.

The imputed rent is calculated by the Housing Statistics Section. 

The genaral approach we use for imputed rent determination:

Hungary has got a special housing market situation in the aspect of imputed rental calculation. The share of market rental sector is cca. 5-8 %. Owner occupiers constitute 95-92 % of the total housing market.  Personal attitudes and social circumstances make stronger the role of private property in the housing market. Geographical and physical attributes and foundemantally  the location of the dwelling within the country determines mostly the value of a dwelling, and possibility to let it on the rental market. Comparison of standard of living on the basis of EU-SILC survey between different social groups is not affected by the minor groups of market renters. The calculation of imputed rent is reasoned by international comparison of data within EU.

The imputed rent was calculated based on stratification method. The dwellings in HU-SILC were startified into 30 clusters according to geographical location of the dwelling since it is the key factor which determines the market rent. Then the avarage rents per 1 square meter corresponding to each geographical unit coming from Dwelling statistics were applied. The imputed rent was calculated as average rent per square meter in the given geo strata multiplied by actual square meter of the dwelling.   We consider this approach as the best available to estimate imputed rent for the time being in the context of small rental market.

18.5.1. Imputation - rate

See 18.5. and 13.3.4.

18.5.2. Weighting methods

The weighting of EU-SILC is regulated in detail by chapter 8 of „Methodological Guidelines and Description of EU-SILC Target Variables” so we follow the principles laid down there. Our primary goal is to create cross-sectional weights for the current years’ sample and longitudinal weights for the 2, 3 and 4 years long panel subsamples. We create weights for every rotational groups separately to form the starting points (base weights) of final weights. The process starts with a common cross-sectional weighting of the new rotational groups. The initial weights are the design weights (the reciprocal of the inclusion probabilities) which are adjusted by calibration to get the cross-sectional weights. These weights are updated on personal level by logistic regression models to get the next periods’ base weights.  There are two parts of this modelling: first we classify persons with unknown status into „in scope” and „out of scope” categories to identify every persons who have left the target population (died, moved abroad or to an institution). Then we model the probability of remaining in respondents’ sample for every „in scope” persons (panel attrition). These models use various explanatory variables: geographical area, gender, age, economic activity, employment status, educational attainment, income, health related indicators, dwelling type, tenure status and some indicators of well-being. As we should avoid extreme dispersion of final weigths it is advisable to limit the weight range. The next step is the creation of weights of people arrived to the sample households by the next period.  Then these personal weights are transformed to household weights by generalized weight share method to form the initial weights of a calibration. The calibrated weights are reduced proportionally to the sample size of the rotational groups. The resulting weights are used again as primary weights  of a calibraton which renders the cross-sectional weights of a given year. If it is the final year of the panel then these are the RB050 and DB090 weights. The before mentioned calibrations are implemented by a raking ratio method. We apply a relatively large table of control totals which has two parts. The first part contains the demoghrapic control numbers by regions: 0-14 years old women/men, 15-29 years old women/men, 30-59 years old women/men, 60+ years old women/men (8*8 control numbers). The second part contains control totals related to economic activity and education broken down by 3 settlement types: Budapest – other bigger cities – other settlements. These are the number of employees, enterpreneurs, pensioners, unemployed, students, employed people broken down by schooling, number of households broken down by the number of children and single person households (3*16 control totals).  We calibrate iteratively to the 2 sets of control numbers and apply weight trimming to avoid influential records. The creation of longitudinal weights (RB062-RB064) is based on the initial years’ base weights and the modelling of panel attrition as described before.

18.5.3. Estimation and imputation

See Annex 6.

18.6. Adjustment

Not applicable.

18.6.1. Seasonal adjustment

Not applicable.


19. Comment Top

Household definition: In our case the sample is based of the list of addresses. If there is a person (age 18+) who is studying and living away in a rented flat, other private address, than she/he is not considered as a household member in the selected and responding household.  Regarding students in dormitory - they are out of scope. 


Related metadata Top


Annexes Top
HU_2024_Annex 1 - National questionnaire_EN.docx
HU_2024_Annex 1 - National questionnaire_HU.docx
HU_2024_Annex 2-Item_non_response_13.3.3.2.1_revisionJAV.xlsx
HU_2024_Annex 3-Sampling_errors_13.2_revision
HU_2024_Annex 4-Data_collection_18.3_revision
HU_2024_Annex 5 - Weighting procedures .docx
HU_2024_Annex 6_Estimation_and_imputation_revision.docx
HU_2024_Annex 7-Coherence_15.3-15.3.2_revision
HU_2024_Annex 8-Breaks in series_15.2_revision
HU_2024_Annex 9-Rolling module.docx
HU_2024_Annex A EU-SILC - content tables_revision
HU_Annex 11 EU-SILC Data revision_2019_2024_final