Modeling Urban Air Quality Trend Surface Using Social Media Data

WANG Yandong1 JING Tong1 JIANG Wei1 WANG Teng1 FU Xiaokang1

(1.State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, China 430079)

【Abstract】Air pollution is getting worse with the development of cities in recent years. Urban air quality is mainly monitored by air quality monitoring stations at present. However, the number of stations is limited and the air quality fluctuates in different urban areas. So it is unefficient to detect air quality’s distribution in a city by air quality monitoring stations only. Based on Sina Weibo data with location information, we propose an urban air quality trend surface modeling method by analysing the correlation between air pollution related topic microblogs and air quality monitoring station AQI data. The study reveals that our method not only qualitatively shows the relative air quality in different regions of the city, but also demonstrations the urban air quality in a quantitative and fine-grained way. The findings of this study evaluate the feasibility of using a new type of large-scale data source for research on air quality estimation of any location in a city, and are of great significance when reflecting air quality distribution and finding areas where are relatively air polluted.

【Keywords】 social media; Sina Weibo; urban air quality; trend surface;


【Funds】 National Natural Science Foundation of China, No. 41271399 The National Natural Science Foundation of China,No.41271399 China Special Fund for Surveying, Mapping and Geoinformation Research in the Public Interest, No. 201512015 China Special Fund for Surveying,Mapping and Geoinformation Research in the Public Interest,No.201512015 Specialized Research Fund for the Doctoral Program of Higher Education, No. 20120141110036 the Specialized Research Fund for the Doctoral Program of Higher Education,No.20120141110036 National Key Technology Research and Development Program of China, No. 2012BAH35B03 the National Key Technology R&D Program of China,No.2012BAH35B03

Download this article

(Translated by CHEN Ziyue)


    [1]Huang R J, Zhang Y, Bozzetti C, et al. High Secondary Aerosol Contribution to Particulate Pollution During Haze Events in China[J]. Nature, 2014, 514(7 521): 218–222.

    [2]Zheng Y, Liu F, Hsieh H-P. U-Air: When Urban Air Quality Inference Meets Big Data[C]. The19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Chicago, Illinois, USA, 2013.

    [3]Zhang Xiaoxiang. Spatial Analysis in the Era of Big Date[J]. Geomatics and Information Science of Wuhan University, 2014, 39(6): 655–659 (in Chinese).

    [4]Li Deren, Yao Yuan, Shao Zhenfeng. Big Data in Smart City[J]. Geomatics and Information Science of Wuhan University, 2014, 39(6): 631–640 (in Chinese).

    [5]Li Qingquan, Li Deren. Big Data GIS[J]. Geomatics and Information Science of Wuhan University, 2014, 39(6): 641–644 (in Chinese).

    [6]Liu Y, Liu X, Gao S, et al. Social Sensing: A New Approach to Understanding Our Socioeconomic Environments[J]. Annals of the Association of American Geographers, 2015, 105: 1–19.

    [7]Huberman B A, Asur S. Predicting the Future with Social Media[C]. IEEE/WIC/ACM International Conference on the Web Intelligence and Intelligent Agent Technology, Washington, D C, USA, 2010.

    [8]Ferrari L, Rosi A, Mamei M, et al. Extracting Urban Patterns from Location-Based Social Networks[C]. The 3rd ACM SIGSpatial International Workshop on Location-Based Social Networks, Dallas, Texas, 2011.

    [9]Liu Y, Sui Z, Kang C, et al. Uncovering Patterns of Inter-urban Trip and Spatial Interaction from Social Media Check-in Data[J]. PloS One, 2014, DOI: 10. 1371/journal. pone. 0086026.

    [10]Liu Y, Kang C, Gao S, et al. Understanding Intraurban Trip Patterns from Taxi Trajectory Data[J]. Journal of Geographical Systems, 2012, 14(4): 463–483.

    [11]Nagel A C, Tsou M-H, Spitzberg B H, et al. The Complex Relationship of Realspace Events and Messages in Cyberspace: Case Study of Influenza and Pertussis Using Tweets[J]. Journal of Medical Internet Research, 2013, 15(10): 237–246.

    [12]SalathéM, Khandelwal S. Assessing Vaccination Sentiments with Online Social Media: Implications for Infectious Disease Dynamics and Control[J]. PLoS Comput Biol, 2011, DOI: 10. 1371/journal. pcbi. 1002199.

    [13]Achrekar H, Gandhe A, Lazarus R, et al. Predicting Flu Trends Using Twitter Data[C]. IEEE Conference on the Computer Communications Workshops, Shanghai, China, 2011.

    [14]Sakaki T, Okazaki M, Matsuo Y. Earthquake Shakes Twitter Users: Real-Time Event Detection by Social Sensors[C]. The 19th International Conference on World Wide Web, Toronto, Canada, 2010.

    [15]de Longueville B, Smith R S, Luraschi G. OMG, from Here, I Can See the Fames!: A Use Case of Mining Location Based Social Networks to Acquire Spatio-temporal Data on Forest Fires[C]. ACM International Workshop on Location Based Social Networks, Seattle, W A, USA, 2009.

    [16]Yates D, Paquette S. Emergency Knowledge Management and Social Media Technologies: A Case Study of the 2010Haitian Earthquake[J]. International Journal of Information Management, 2011, 31(1): 6–13.

    [17]Tsou M H, Yang J A, Lusher D, et al. Mapping Social Activities and Concepts with Social Media (Twitter) and Web Search Engines (Yahoo and Bing): A Case Study in 2012US Presidential Election[J]. Cartography and Geographic Information Science, 2013, 40(4): 337–348.

    [18]Tumasjan A, Sprenger T O, Sandner P G, et al. Predicting Elections with Twitter: What 140 Characters Reveal About Political Sentiment[C]. International Conference on Web and Social Media, Washington, D C, USA, 2010.

    [19]Burton S H, Tanner K W, Giraud-Carrier C G, et al. “Right Time, Right Place” Health Communication on Twitter: Value and Accuracy of Location Information[J]. Journal of Medical Internet Research, 2012, 14(6): 86–96.

    [20]Kay S, Zhao B, Sui D. Can Social Media Clear the Air? A Case Study of the Air Pollution Problem in Chinese Cities[J]. The Professional Geographer, 2015, 67(3): 351–363.

    [21]Cheng Ronglan, Chen Guangzhi. Parametric Estimation of the van Genuchten’s Equation by a New Method with Levenberg-Marquardt Theory[EB/OL]. http: //, 2015 (in Chinese).

This Article


CN: 42-1676/TN

Vol 42, No. 01, Pages 14-20

January 2017


Article Outline


  • 1 Establishment method of air quality trend surface
  • 2 Extraction of regions with relatively severe air pollution
  • 3 Conclusions
  • References