Articles | Volume 18, issue 4
https://doi.org/10.5194/essd-18-2609-2026
https://doi.org/10.5194/essd-18-2609-2026
Data description article
 | 
13 Apr 2026
Data description article |  | 13 Apr 2026

BuildingSense: a new multimodal building function classification dataset

Pengxiang Su, Runfei Chen, Heng Xu, Wei Huang, Xinling Deng, Songnian Li, Wanglin Yan, Hangbin Wu, and Chun Liu

Related authors

A Multi-Scenario Dataset for Long-Term Indoor Localization and Pedestrian Behavior Analysis in Dynamic Environments
Faezeh Sadat Mortazavi, Junyi Wei, Tim Schimansky, Hangbin Wu, Claus Brenner, and Monika Sester
Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLVIII-1-W6-2025, 155–162, https://doi.org/10.5194/isprs-archives-XLVIII-1-W6-2025-155-2025,https://doi.org/10.5194/isprs-archives-XLVIII-1-W6-2025-155-2025, 2025
Incremental Crowd-Source Data Fusion and Map Update Method Based on Driving Data for Traffic Signs
Haopeng Hu, Hangbin Wu, Shengke Huang, Wei Huang, and Chun Liu
Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLVIII-G-2025, 641–647, https://doi.org/10.5194/isprs-archives-XLVIII-G-2025-641-2025,https://doi.org/10.5194/isprs-archives-XLVIII-G-2025-641-2025, 2025
Smart Building Digital Twin for Interior Water Distribution System Management
Gopika Rajan and Songnian Li
Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLVIII-4-2024, 373–379, https://doi.org/10.5194/isprs-archives-XLVIII-4-2024-373-2024,https://doi.org/10.5194/isprs-archives-XLVIII-4-2024-373-2024, 2024
Exploring Spatiotemporal Interaction Patterns of Dockless Shared Bicycle Networks: A Case Study of Shenzhen, China
Xiaoqian Hu, Yang Chen, Shishuo Xu, and Songnian Li
ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci., X-4-2024, 169–174, https://doi.org/10.5194/isprs-annals-X-4-2024-169-2024,https://doi.org/10.5194/isprs-annals-X-4-2024-169-2024, 2024
The Concept of Levels of Detail for 3D Niche Models in CityGML
Su Yang, Jun Chen, Miaole Hou, Hongchao Fan, Songnian Li, and Yingkui Sun
ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci., X-4-2024, 419–425, https://doi.org/10.5194/isprs-annals-X-4-2024-419-2024,https://doi.org/10.5194/isprs-annals-X-4-2024-419-2024, 2024

Cited articles

Arribas-Bel, D. and Fleischmann, M.: Spatial signatures – Understanding (urban) spaces through form and function, Habitat Int., 128, 102641, https://doi.org/10.1016/j.habitatint.2022.102641, 2022. a
Azimi, S. M., Henry, C., Sommer, L. W., Schumann, A., and Vig, E.: SkyScapes – Fine-grained semantic understanding of aerial scenes, 2019 IEEE/CVF Int. Conf. Comput. Vis. (ICCV), 7392–7402, https://api.semanticscholar.org/CorpusID:207998372 (last access: 20 February 2026), 2019. a
Bommasani, R., Hudson, D. A., and Ehsan Adeli, E. A.: On the opportunities and risks of foundation models, arXiv [preprint] https://doi.org/10.48550/arXiv.2108.07258, 2022. a
Braun, M., Krebs, S., Flohr, F., and Gavrila, D. M.: EuroCity persons: A novel benchmark for person detection in traffic scenes, IEEE T. Pattern Anal. Mach. Intell., 41, 1844–1861, https://doi.org/10.1109/TPAMI.2019.2897684, 2019. a
Brown, T. B., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A., Agarwal, S., Herbert-Voss, A., Krueger, G., Henighan, T., Child, R., Ramesh, A., Ziegler, D. M., Wu, J., Winter, C., Hesse, C., Chen, M., Sigler, E., Litwin, M., Gray, S., Chess, B., Clark, J., Berner, C., McCandlish, S., Radford, A., Sutskever, I., and Amodei, D.: Language models are few-shot learners, in: Proceedings of the 34th International Conference on Neural Information Processing Systems, NIPS '20, Curran Associates Inc., Red Hook, NY, USA, ISBN 9781713829546, 2020. a, b
Download
Short summary
The accessibility of building function is essential for urban research. We reviewed the recent work and concluded three limitations: few open-source datasets, coarse building function categories, and poor model interpretability with inadequate multimodal feature fusion. Thus, we created BuildingSense with fine-grained categories and multimodal data, and proved that the large model can be used for improving the interpretability of results, with three directions for enhancing their performance.
Share
Altmetrics
Final-revised paper
Preprint