基于残差神经网络改进的密集人群计数方法__GIS论文

软件名称：基于残差神经网络改进的密集人群计数方法
软件大小： 0.00 B
软件评级： ★★★★★★
开发商：史劲霖, 周良辰, 闾国年, 林冰仙
软件来源：《地球信息科学学报》
解压密码：www.gissky.net

资源简介

摘要：

为避免密集人群踩踏事件发生,从监控图像中准确获取密集人群人数信息非常重要。针对密集人群计数难度大、人群目标小、场景尺度变化大等特点,本文提出一种新型神经网络结构VGG-ResNeXt。本网络使用VGG-16的前10层作粗粒度特征提取器,使用改进的残差神经网络作为细粒度特征提取器。利用改进的残差神经网络“多通道,共激活”的特点,使得单列式人群计数神经网络获得了多列式人群计数网络的优点（即从小目标、多尺度的密集人群图像中提取更多人群特征）,同时避免了多列式人群计数网络训练难度大、结构冗余等缺点。实验结果表明本模型在UCF-CC-50数据集、ShangHaiTech B数据集和UCF-QNRF数据集中取得了最高精度,MAE指标分别优于其他同期模型7.5%、18.8%和2.4%,证明了本模型的在计数精度方面的有效性。本研究成果可以有效帮助城市管理,有效缓解公安疏导压力,保障人民生命财产安全。

关键词: 图像, 密集人群, 人群计数, 特征提取, 神经网络, 单列式神经网络, 改进残差结构

Abstract:

In order to avoid crowd trampling, it is very important to accurately obtain information on the number of crowds from surveillance images. Early crowd counting studies used a feature engineering approach, in which human-designed feature extraction algorithms were used to obtain features that represented the number of people to be counted. However, the counting accuracy of such methods is not sufficient to meet the practical requirements when facing heavily occluded counting scenes with large changes in scene scale. In recent years, with the development of neural network, breakthroughs have been made in image classifications, object detections, and other fields. Neural network methods have also advanced the accuracy and robustness of dense crowd counting. In view of the difficulty of counting dense crowds, small crowd targets, and large changes in scene scale, this paper proposes a new neural network structure named: VGG-ResNeXt. The features extracted by VGG-16 are used as general-purpose visual description features. ResNet has more hidden layers, more activation functions and has more powerful feature extraction capabilities to extract more features from crowd images. Improved residual structure ResNeXt expands on the base of ResNet to further enhance feature extraction capabilities while maintaining the same computing power requirements and number of parameters. Therefore, in this paper, the first 10 layers of VGG-16 are used as the coarse-grained feature extractor, and the improved residual neural network ResNeXt is used as the fine-grained feature extractor. With the improved residual neural network feature of "multi-channel, co-activation", the single-column crowd counting neural network obtains the advantages of the multicolumn crowd counting network (i.e., extracting more features from dense crowd images with small targets and multiple scales), while avoiding the disadvantages of the multicolumn crowd counting network, such as the difficulty of training and structural redundancy. The experimental results show that our model achieves the highest accuracy in the UCF-CC-50 dataset with a very large number of people per image, the ShangHaiTech PartB dataset with a sparse crowd, and the UCF-QNRF dataset with the largest number of images currently included. Our model outperforms other models in the same period by 7.5%, 18.8%, and 2.4%, respectively, in MAE in the above three datasets, demonstrating the effectiveness of the model in improving counting accuracy in dense crowds. The results of this research can effectively help city management, relieve the pressure on public security, and protect people's lives and property.

Key words: images, dense crowd, crowd counting, feature extraction, neural networks, single column-based CNN, improved ResNet

下载地址1

下载说明

·如果您发现该资源不能下载，请通知管理员.gissky@gmail.com

·为确保下载的资源能正常使用，请使用[WinRAR v3.8]或以上版本解压本站资源,缺省解压密码www.gissky.net ，如果是压缩文件为分卷多文件，请依次下载每一个文件，并按照顺序命名为1.rar,2.rar,3.rar...，然后鼠标右击1.rar解压.

·为了保证您快速的下载速度，我们推荐您使用[网际快车]等专业工具下载.

·站内提供的资源纯属学习交流之用,如侵犯您的版权请与我们联系.

快速通道

+ hot

资源简介

相关资源

下载说明