The sample of the Survey of Young People in Egypt 2009 (SYPE) was designed in such a way as to be representative at the national as well as regional levels. The sample size of approximately 17,000 young people between the ages of 10 and 29 was selected to provide estimates of key indicators related to adolescents and youth for the country as a whole and for four administrative regions (Urban governorates, Lower Egypt governorates, Upper Egypt governorates and the Frontier governorates), and, where relevant, for the urban and rural segments of these regions. These indicators include never enrollment rates, dropout rates, the incidence of child labor, and unemployment rates. Based on previous statistics about the incidence of young people in the relevant age and sex groups, we determined that a nationally-representative sample of 11,000 households would be sufficient. To obtain accurate estimates for the Frontier governorates, these governorates had to be oversampled. As a result, the SYPE is not a self-weighted sample and weights are needed to obtain the correct estimates.
The SYPE sample was designed as a multi-stage stratified cluster sample. The primary sampling units (PSUs) were selected from a CAPMAS master sample. The master sample is a stratified cluster sample that contains 2,400 PSUs, divided into 1,080 urban and 1,320 rural PSUs. These PSUs are drawn from a frame of enumeration areas (EAs) covering the entire country prepared by CAPMAS from the 2006 Population Census. Each EA is drawn up in such a way as to contain roughly 1500 dwelling units. The sample is stratified into governorates and each governorate is further stratified into urban and rural segments, where relevant. The distribution of PSUs across strata in the master sample reflects the distribution of the population so as to produce a self-weighted sample.
To achieve a fairly wide geographic dispersion in the SYPE sample and thus minimize the design effect, we set the number of households per cluster to 25. To obtain these 25 households, 25 dwelling units were systematically selected from the roughly 1500 listed in each EA. To get the sample size we needed, we set the number of required PSUs to 455, for a total sample size of 11,375 households. The distribution of PSUs across governorates and urban and rural areas in both the master sample and the SYPE sample are shown in (Table 1 in Appendix C of the final report available among the external resources). The final sample of households interviewed was made up of 11,372 households, which yielded a total of 15,029 young people aged 10-29.
The PSU's in the SYPE sample were drawn from the EA's in the master sample at a rate of roughly 19%-20%. With the exception of the Frontier Governorates and the Luxor administrative area, the sampling rate varies in a relatively narrow range from 14% to 27%. To get good representation from the sparsely populated Frontier Governorates, we increased the sampling rate significantly, in some cases retaining all the PSU's in the master sample. Weights will be derived at the level of the administrative region to account for these varying sampling rates.
Selecting the Urban Slums Sub-Sample
One of the objectives of SYPE is to obtain separate estimates for young people living in urban slums (referred to in the final report chapters as informal urban areas). To make sure we had enough representation of urban slums, we used a study conducted by the Information and Decision Support Center of the Egyptian Cabinet of Ministers (IDSC) to classify urban PSU's in the CAPMAS master sample into slum and non-slum areas. Deciding how to allocate urban PSUs to slum and non-slum areas was not a straightforward exercise given the unreliability of the data on the population of the slum areas.
First, we had to make a decision on how to allocate the 212 urban PSUs to slum and non-slum PSUs. The most reasonable estimate of the share of slums in the urban population was close to 20%, leading us to allocate 44 of the 212 urban PSU's in the sample to slum areas. Second, we had to allocate these 44 slum PSUs to the various governorates. This allocation was done in such a way as to match as closely as possible, the distribution of the number of slum areas across governorates is shown in (Table 2 in Appendix C of the final report available among the external resources).
Ideally, we should have allocated slum PSUs across governorates according to each governorate's share of slum population rather than its share in the number of slum areas. However, given the unreliable information about the population of slum areas, it was impossible to do the allocation in terms of population. This allocation decision is likely to understate the true share of slums in governorates such as Cairo, Giza and Alexandria, where the size of slums is likely to be larger than average, and overstate slum populations in governorates like Damietta, Dakahlia and Sharkia where the size of slums is probably smaller than average. Without reliable data on slum populations, it is unfortunately not possible to use weights to correct for this possible bias in the geographic distribution of slums.
** More information on the sampling procedures is available in Appendix C in the English final report available among the external resources.