Combining Deep Learning and Crowd-sourcing Images to Predict Housing Quality in Rural China

doi:10.21203/rs.3.rs-1973264/v1

Download PDF

Article

Combining Deep Learning and Crowd-sourcing Images to Predict Housing Quality in Rural China

https://doi.org/10.21203/rs.3.rs-1973264/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Housing quality is an essential contributor to human well-being, security and health. Monitoring the housing quality is crucial for unveiling socio-economic development status and providing political proposals. However, it is exceedingly scarce to depict the nationwide housing quality in large-scale and fine-granularity in remote rural areas owing to the high cost of canonical survey methods. Taking rural China as an example, we collect massive rural house images for housing quality assessment by various volunteers and further build up a deep learning model based on the assessed images to realize an automatic prediction for huge raw house images. As a result, the model performance achieves a high R² of 0.76. Afterward, the housing qualities of 10,000 Chinese villages are predicted based on 50,000 unlabeled geo-images, and an apparent spatial heterogeneity is uncovered. Specifically, divided by Qinling Mountains-Huaihe River Line, housing quality in southern China is much higher than in northern China. Our method provides high-resolution estimates of housing quality across the extensive rural area, which could be a complementary tool for automatically monitoring housing change and supporting house-related policy-making.

In daily life, humans carry out a bulk of activities in their household houses; therefore, the housing quality determines their living quality and well-being degree. However, there still are massive people facing a severe housing problem all over the world, especially in the rural area. In the US, more than 6.7 million rural households live in houses lacking necessary domestic facilities while where they spend over 30% of their income¹; in China, although dwelling condition has improved significantly since 1978^2,3, hundred million villages are eagerly looking forward to household houses with high quality^4,5. Following the Sustainable Development Goal proposed, the accessibility of safe and affordable houses has been a significant indicator of eliminating poverty and promoting well-being⁶.

In China, housing is not only associated with the living quality but also the primary carrier containing lifetime laboring achievement of rural residents⁷, which possesses a time-honored history that regards house as the representative of family, wealth and even marriage^{8, 9–11}. Hence, rural housing quality is considered the barometer of rural wealth status¹². Consequently, depicting the global distribution of the housing quality across the rural area at a finer granularity and further unveiling its spatial pattern is of remarkable significance for understanding villagers’ living situations and exploring the association between housing quality and rural wealth.

Currently, researching housing quality in rural areas mainly relies on questionnaires and field investigations, such as Demographic and Health Surveys¹³ and China General Social Survey¹⁴, to require detailed data on the individual house. For instance, leveraging the house area per capita data from China Household Finance Survey to evaluate the housing quality, Wang finds the inequality of house assets in rural China is much higher than in urban area¹². Similarly, housing size, construction materials and domestic facilities are also adopted to assess housing quality and analyze the socio-economic development in rural areas^15–17. However, these traditional data collection approaches are costly, inefficient, and limited to capturing well-rounded and large-scale housing quality data to depict a nationwide spatial distribution and explore the holistic pattern.

Recently, many scholars get remarkable success in the quality estimation of urban building environments by combining massive street view imagery and machine learning, for instance, the street quality, greening proportion, workable level and entertainment service ability^18–22. Generally, they construct a mapping between the objective objectives in street image and subjective street quality assessment, thus realizing an automatic quality prediction using machine learning. Similarly, for rural housing quality evaluation, people evaluate the housing quality principally through observing physical objectives and subjective perception in reality. For instance, a rural house with a tiled façade, broad garden, and rich household facilities is more likely to be regarded as good quality. Therefore, it is viable to evaluate housing quality in rural China at a large-scale and fine-granularity if the nationwide rural house images could be collected.

To fill this gap, we establish a crowed-sourcing platform called Rural Image Clap, as Fig. 1(a) demonstrated. It allows villagers to share their surrounding rural images, including houses, farmland, ponds, gardens, roads, etc. Meanwhile, users had the accessibility of assessing the housing quality manually according to the house images. As a foregoing description, Fig. 1(b) exemplifies typical rural housing with high quality, which possesses luxurious external decoration, clean and broad gardens and a huge residential area. To date, over 1 million rural images have been shared, covered 29 provinces in China. In conclusion, there is an emerging opportunity that use large-scale and massive rural house images to evaluate housing quality throughout China villages.

In this paper, we propose a framework that evaluates rural housing quality by combining deep learning and crowd-sourcing images. As Fig. 2 shown, firstly, massive rural house images and their manual quality assessment are captured to train a deep learning model. Then raw house images are input into the well-trained model to predict their housing quality. Ultimately, the map of rural housing quality in rural China is depicted. Based on the predicted result, we can realize the spatial distribution pattern of the rural house with different quality in capacious and develop-unbalanced China and give a political suggestion for dealing with inequality of living conditions and promoting sustainable development in the rural area.

Mapping based on deep-learning prediction

Leveraging 15,700 rural housing images and their manual quality assessment from Rural Image Clip, we train a deep learning model of DenseNet to automatically predict the housing quality of 50,000 raw rural house images from 10,000 villages. As a result, the DenseNet achieves an R² of 0.76 and a ${MSE}_{avg}$ of 0.13 respectively, that means a good performance for housing quality prediction.

In the image scale, the average, standard deviation, maximum and minimum value of predicted housing quality of 10,000 house images is 5.81, 0.53, 7.81 and 4.02, respectively. To visualize its spatial distribution in China, the predicted housing quality is aggregated to the village scale displayed in Fig. 3.

Spatial pattern of housing quality in rural China with multi-scales

In general, it could be noticed a noticeable spatial variation in housing quality distribution within southern and northern China. With the Qinling Mountain- Huaihe River, as the dividing line between south and north, the rural housing quality in northern China is almost middle and low. In contrast, the high-quality rural houses in the southern villages become intensive and dense (Fig. 4a).

In addition, when we plot the“Hu Huanyong Line” in the map, which is a significant proxy to differentiate the socio-economic development in China²³, rural housing quality in the west is conspicuously lower compared with the east. In specific, high and middle-high quality villages are rare in east-south villages unless junction, but gather in south-eastern China (Fig. 4b).

These two spatial patterns of rural housing quality distribution on a nation scale positively correlate with the socio-economic development status. As we know, the huge economic imbalance in different areas is significant in China. Previously, people usually paid major attention to the development imbalance between urban and rural areas instead of the inherent difference between villages²⁴. The predicted result illustrates that rural villagers in richer rural areas construct better-quality houses than in under-developed areas.

Furthermore, the rural housing quality is aggregated to the province-scale as Fig. 5a shown. We can notice the rural housing quality of the province grows from edge provinces to provinces along the Yangtze River. It’s worth noting that the pattern is partly different from the regional wealth level, such as the two most well-developed metropolitan areas, “Beijing-Tianjin-Hebei” and Pearl River Delta, didn’t construct high-quality villages like the Yangtze River Delta. In conclusion, we don’t think villagers in the richer province will build higher-quality rural houses.

In a finer-granularity view, we likewise find that rural housing quality is affected by other factors, not only regional wealth. For example, in Guangdong province, the wealthiest area is the Pearl River Delta; whereas the high-quality rural houses mainly gather in Leizhou Peninsula, which belongs to the poor area (Fig. 5c). Some literature proposes an opinion that Cantonese has a tradition of building well-decorated and tall houses in their hometown because they commonly have a hometown complex²⁵ and are eager for identity and respect for their household wealth from neighbors and relatives²⁶.

On the contrary, in Jiangsu Province, the rural houses in southern Jiangsu, a famous wealthy district, have a corresponding higher quality, compared with a relatively poor area of northern Jiangsu (Fig. 5b). Consequently, despite the rural house being the paramount representative of wealth, other impact factors such as culture, convention, and territoriality also constantly affect the spatial pattern of rural housing quality distribution.

In this paper, we collect ten thousand rural house images from a crowd-sourcing platform Rural Image Clap and use deep learning to predict the housing quality in Rural China with these images. According to the predicted result, we depict a nationwide map of rural housing distribution and further unveil its spatial pattern on multiple scales.

As precious researches mentioned, rural housing quality is mainly decided by local wealth and socio-economic development level based on either little-sample questionnaires and field surveying or coarse-granularity general statistics, thus lack of a holistic and elaborated cognition. Through the nationwide predicted result, we find out that rural housing quality conforms with the regional wealth on the whole, but other factors affect how much incomes villagers invest is invested in constructing houses, like culture in a different area. This achievement enhances our understanding of the status of villagers’ living condition in capacious rural China, which complement the research gap in rural housing quality.

Otherwise, there are still some limitations in the study. Firstly, the rural image assessment completely depends on subjective perception from users all over the country; thus, if the south users occupy the majority, the north rural house may be under-estimated compared with south houses with the same investment, and vice versa. The bias of crowd-sourcing images likely affects the predicted reliability. Besides, deep learning is a black-box model that we don’t figure out why it derives the housing quality instead of an end-to-end mapping between images and quality scores. In the future, we will discover the interpretability of deep learning for housing quality prediction to ensure which features contribute to the specific housing scores.

Deep Neural Network

As described before, Rural Image Clap generates bulk of rural housing images and partial manual assessment of housing quality from users. Therefore, it provides a precious opportunity to use Deep Learning to automatically and at scale predict housing quality based on these images in rural China. Finally, it reveals the status of rural wealth and development in China.

In brief, Deep Learning could extract the high-dimensional feature from input images using a deep neural network and construct the mapping between features and housing quality using a full-connected layer. Thus, effectively extracting features from images determines the predicting accuracy. Among various deep learning models such as AlexNet²⁷, VGG²⁸, and ResNet²⁹, DenseNet³⁰ gets a striking success in image processing tasks owing to its strong feature extraction ability. Therefore, DenseNet is adopted in rural housing quality prediction in the paper.

The architecture of DenseNet is shown in Fig. 6. It consists of a single convolutional layer, four Dense Blocks, three Transition layers and one full-connected layer in sequence. Specifically, as Fig. 6 shows, the Dense Block comprises several modules with two convolutional kernels of different sizes. Furthermore, these modules are connected by “Dense Connection,” which entirely takes advantage of shallow convolution features to enhance model performance. Otherwise, the Transition layer connects the adjacent Dense Blocks to deliver the extracted features³⁰. Finally, constantly convolved high-dimensional features are input to a full-connected layer to predict the housing quality.

For Deep Neural Network, the predicted accuracy is determined by the model's parameters; therefore, to obtain the optimal value of parameters, the assessed house images and their quality scores are used to train the model to update parameters, namely Back Propagation until the loss function achieves convergence. In this study, we use the loss function of Mean Square Error (MSE) to measure the deviation between predicted and true house quality in each iteration, which is formulated by Eq. 1.

$$MSE=\frac{{\sum }_{j=1}^{10}\frac{{\sum }_{i=1}^{n}({\widehat{y}}_{i}^{j}-{y}_{i}^{j}{)}^{2}}{n}}{10}$$

Where, owing to each house image possess multiple quality scores from 1 to 10, thus ${\widehat{y}}_{i}^{j}$ and ${y}_{i}^{j}$ is respectively the predicted and true normalized frequency of score $j(j\in [\text{1,10}\left]\right)$ of image $i(i\in [1,n\left]\right)$.

After model training, the performance of the trained DenseNet is evaluated by${MSE}_{avg}$ and ${R}^{2}$ formulated in Eq. 2 to Eq. 4.

$${\widehat{y}}_{i}^{avg}=\frac{{\sum }_{j=1}^{10}{\widehat{y}}_{i}^{j}\times j}{10}$$

$${MSE}_{avg}=\frac{{\sum }_{i=1}^{n}({\widehat{y}}_{i}^{avg}-{y}_{i}^{avg}{)}^{2}}{n}$$

$${R}^{2}=1-\frac{\sum _{i=1}^{n}({{\widehat{y}}_{i}^{avg}-{y}_{i}^{avg})}^{2}}{\sum _{i=1}^{n}({{\stackrel{-}{y}}^{avg}-{y}_{i}^{avg})}^{2}}$$

Where, ${\widehat{y}}_{i}^{avg}$ and ${y}_{i}^{avg}$are the predicted and true weighted average of quality scores of images $i$ respectively; ${\stackrel{-}{y}}^{avg}$ is the average of ${y}_{i}^{avg}$.

In detail, ${R}^{2}$ can examine the fitting degree between dependent and independent variables of the model, where the result of 1 demonstrates a perfect fit, and it means a reliable model for predictions. ${MSE}_{avg}$ elucidates the average deviations between the average of predicted and true value.

Experimental data and set

Based on the Rural Image Clap, we collect 15,700 rural house images after filtering among all shared rural images covering 28 provinces in China. Meanwhile, these house images are assessed manually by at least 15 users. Specifically, (1) users subjectively give a quality score from 1(the worst) to 10(the best) for each house image; (2) all scores of each image are calculated to obtain normalized frequencies. Otherwise, 50,000 raw rural house images covering 10000 villages are collected without manual assessment.

The average score for all assessed 15,700 rural house images is 5.7, while the highest and lowest scores are 8.7 and 3.2, respectively. Besides, the whole housing quality scores follow a normal distribution with a standard deviation of 0.85. Taking 9 rural house images as a typical example, as Fig. 7 displayed, the rural houses with a high floor, luxuriant decoration, and wall-embraced gardens are assessed as high quality. On the contrary, the low-quality houses represent a primitive, fragile and unsafe sense without the ability to resist storm, besides, they rarely possess external leisure space like a garden but face the roads directly.

In deep learning, these 15,700 assessed images are divided into training, validation and test sets with 80%, 20%, and 10%, respectively. In addition, some hyper-parameters of the DenseNet model is set as below: batchsize is 32, epochs are 100, and the learning rate is initially 1×10 − 5 and adaptively adjusts with the decreasing degree of 0.1.

Acknowledgments: We thank the anonymous reviewers for their valuable suggestions.

Author contributions: X.L. and W.X conceived of the research question. Y.C. and W.D. organized the volunteer to assess rural images. W.X. and Y.W designed and trained the deep learning model. W.X, and Y.C. processed the data. W.X., Y.G and L.C interpreted the result and drafted the manuscript.

Availability of Data and Materials: The dataset and code for training models during the current study are available in the github repository https://github.com/Tutu-wq/housingquality.git.

Competing interests: The authors declare that they have no competing interests.

Ahrens, K. A., Haley, B. A., Rossen, L. M., Lloyd, P. C. & Aoki, Y. Housing assistance and blood lead levels: children in the United States, 2005–2012. Am. journal public health 106, 2049–2056 (2016).
Park, A. & Wang, S. China’s poverty statistics. China Econ. Rev. 12, 384–398 (2001).
Ravallion, M. & Chen, S. China’s (uneven) progress against poverty. In Governing rapid growth in China, 65–111 (Routledge, 2009).
Long, H., Li, Y. & Liu, Y. Analysis of evolutive characteristics and their driving mechanism of hollowing villages in china. Acta Geogr. Sinica 64, 1203–1213 (2009).
Sutherland, D. & Yao, S. Income inequality in china over 30 years of reforms. Camb. J. Reg. Econ. Soc. 4, 91–105 (2011).
Assembly, G. United nations conference on housing and sustainable urban development (habitat iii) regional report for Africa: transformational housing and sustainable urban development in Africa. In 2015 United Nations Conference on Housing and Sustainable Urban Development (Habitat III) regional report for Africa: transformational housing and sustainable urban development in Africa Search in (2015).
McKinley, T. & Wang, L. N. Housing and wealth in rural China. China Econ. Rev. 3, 195–211 (1992).
Herbers, D. J. & Mulder, C. H. Housing and subjective well-being of older adults in Europe. J. Hous. Built Environ. 32, 533–558 (2017).
Adams, J. S. The meaning of housing in America. Annals Assoc. Am. Geogr. 74, 515–526 (1984).
Lu, M. Determinants of residential satisfaction: Ordered logit vs. regression models. Growth change 30, 264–287 (1999).
Ren, H., Folmer, H. & Van der Vlist, A. J. The impact of home ownership on life satisfaction in urban china: a propensity score matching analysis. J. Happiness Stud. 19, 397–422 (2018).
Wang, Y., Li, Y., Huang, Y., Yi, C. & Ren, J. Housing wealth inequality in china: An urban-rural comparison. Cities 96, 102428 (2020).
Corsi, D. J., Neuman, M., Finlay, J. E. & Subramanian, S. Demographic and health surveys: a profile. Int. journal epidemiology 41, 1602–1613 (2012).
Ren, H., Yuan, N. & Hu, H. Housing quality and its determinants in rural China: a structural equation model analysis. J. Hous. Built Environ. 34, 313–329 (2019).
De Brauw, A., Huang, J., Rozelle, S., Zhang, L. & Zhang, Y. The evolution of China’s rural labor markets during the reforms. Dep. Agric. Resour. Econ. UC Davis Work. Pap. (2002).
Wang, H., Su, F., Wang, L. & Tao, R. Rural housing consumption and social stratification in transitional China: Evidence from a national survey. Hous. Stud. 27, 667–684 (2012).
Tusting, L. S. et al. Mapping changes in housing in sub-Saharan Africa from 2000 to 2015. Nature 568, 391–394 (2019).
Gebru, T. et al. Using deep learning and google street view to estimate the demographic makeup of neighborhoods across the United States. Proc. Natl. Acad. Sci. 114, 13108–13113 (2017).
Suel, E., Polak, J. W., Bennett, J. E. & Ezzati, M. Measuring social, environmental and health inequalities using deep learning and street imagery. Sci. reports 9, 1–10 (2019)
Law, S., Paige, B. & Russell, C. Take a look around: using street view and satellite images to estimate house prices. ACM Transactions on Intell. Syst. Technol. (TIST) 10, 1–19 (2019).
Yao, Y. et al. A human-machine adversarial scoring framework for urban perception assessment using street-view images. Int. J. Geogr. Inf. Sci. 33, 2363–2384 (2019).
Zhang, F., Hu, M., Che, W., Lin, H. & Fang, C. Framework for virtual cognitive experiment in virtual geographic environments. ISPRS Int. J. Geo-Information 7, 36 (2018).
Chen, D. et al. Exploring the spatial differentiation of urbanization on two sides of the hu huanyong line–based on nighttime light data and cellular automata. Appl. Geogr. 112, 102081 (2019).
Tao Yang, D. & Zhou, H. Rural-urban disparity and sectoral labour allocation in china. The J. Dev. Stud. 35, 105–133(1999).
Smith, L. & Mazzucato, V. Constructing homes, building relationships: Migrant investments in houses. Tijdschrift voor economische en sociale geografie 100, 662–673 (2009)
Fei, H.-t., Fei, X., Hamilton, G. G. & Zheng, W. From the soil: The foundations of Chinese society (Univ of CaliforniaPress, 1992).
Krizhevsky, A., Sutskever, I. & Hinton, G. E. Imagenet classification with deep convolutional neural networks. Adv. Neural information processing systems 25 (2012).
Simonyan, K. & Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, 770–778 (2016).
Huang, G., Liu, Z., Van Der Maaten, L. & Weinberger, K. Q. Densely connected convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, 4700–4708 (2017).

No competing interests reported.

Download PDF

Editorial decision: Major revision
29 Sep, 2022
Reviews received at journal
19 Sep, 2022
Reviewers agreed at journal
12 Sep, 2022
Reviewers invited by journal
12 Sep, 2022
Editor assigned by journal
12 Sep, 2022
Editor invited by journal
12 Sep, 2022
Submission checks completed at journal
12 Sep, 2022
First submitted to journal
18 Aug, 2022

You are reading this latest preprint version

Combining Deep Learning and Crowd-sourcing Images to Predict Housing Quality in Rural China

Status:

Version 1

Abstract

Figures

1 Introduction

2 Results

3 Discussion

4 Methods

Declarations

References

Additional Declarations

Status:

Version 1