DOI QR코드

DOI QR Code

Crowd Activity Classification Using Category Constrained Correlated Topic Model

  • Huang, Xianping (College of Computer Science and Technology, Zhejiang University of Technology) ;
  • Wang, Wanliang (College of Computer Science and Technology, Zhejiang University of Technology) ;
  • Shen, Guojiang (College of Computer Science and Technology, Zhejiang University of Technology) ;
  • Feng, Xiaoqing (College of Information, Zhejiang University of Finance &Economics) ;
  • Kong, Xiangjie (School of Software, Dalian University of Technology)
  • Received : 2015.04.10
  • Accepted : 2016.09.22
  • Published : 2016.11.30

Abstract

Automatic analysis and understanding of human activities is a challenging task in computer vision, especially for the surveillance scenarios which typically contains crowds, complex motions and occlusions. To address these issues, a Bag-of-words representation of videos is developed by leveraging information including crowd positions, motion directions and velocities. We infer the crowd activity in a motion field using Category Constrained Correlated Topic Model (CC-CTM) with latent topics. We represent each video by a mixture of learned motion patterns, and predict the associated activity by training a SVM classifier. The experiment dataset we constructed are from Crowd_PETS09 bench dataset and UCF_Crowds dataset, including 2000 documents. Experimental results demonstrate that accuracy reaches 90%, and the proposed approach outperforms the state-of-the-arts by a large margin.

Keywords

References

  1. Ali S, Shah M., "Floor fields for tracking in high density crowd scenes[C]," Computer Vision-ECCV 2008, Springer Berlin Heidelberg, 1-14, 2008.
  2. Rodriguez M, Ali S, Kanade T., "Tracking in unstructured crowded scenes[C]," Computer Vision, in Proc. of 2009 IEEE 12th International Conference on. IEEE, 1389-1396, 2009.
  3. Bera A, Manocha D., "Realtime multilevel crowd tracking using reciprocal velocity obstacles[C]," Pattern Recognition (ICPR), in Proc. of 2014 22nd International Conference on. IEEE, 4164-4169, 2014.
  4. Fradi H, Dugelay J L., "Spatial and temporal variations of feature tracks for crowd behavior analysis[J]," Journal on Multimodal User Interfaces, 1-11, 2015.
  5. Shao J, Kang K, Loy C C, et al., "Deeply learned attributes for crowded scene understanding[C]," in Proc. of CVPR, 4657-4666, 2015.
  6. Wu S, Moore B E, Shah M., "Chaotic invariants of lagrangian particle trajectories for anomaly detection in crowded scenes[C]," Computer Vision and Pattern Recognition (CVPR), in Proc. of 2010 IEEE Conference on. IEEE, 2054-2060, 2010.
  7. Benabbas Y, Ihaddadene N, Djeraba C., "Motion pattern extraction and event detection for automatic visual surveillance[J]," Journal on Image and Video Processing, 7, 2011.
  8. Rodriguez M, Sivic J, Laptev I, et al., "Data-driven crowd analysis in videos[C]," Computer Vision (ICCV), in Proc. of 2011 IEEE International Conference on. IEEE, 1235-1242, 2011.
  9. YI S, WANG X, LU C et al., "L0 regularized stationary time estimation for crowd group analysis[C]," CVPR '14, 2219-2226, 2014.
  10. Blei D M, Lafferty J D., "A correlated topic model of science[J]," The Annals of Applied Statistics,1(1): 17-35, 2007. https://doi.org/10.1214/07-AOAS114
  11. Meyer D., "Support Vector Machines The Interface to libsvm in package e1071[J]," R News, 1(5):1-3, 2001.
  12. Popoola O P, Wang K., "Video-Based Abnormal Human Behavior Recognition-A Review[J]," IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS-PART C: APPLICATIONS AND REVIEWS, NOVEMBER 2012, 42(6):865-878, 2012. https://doi.org/10.1109/TSMCC.2011.2178594
  13. Li T, Chang H, Wang M et al., "Crowded scene analysis: A survey[J]," Circuits and Systems for Video Technology, IEEE Transactions on, 25(3): 367-386, 2015. https://doi.org/10.1109/TCSVT.2014.2358029
  14. Raghavendra R, Cristani M, Bue A D et al., "Anomaly Detection in Crowded Scenes: A Novel Framework Based on Swarm Optimization and Social Force Modeling[M]," Modeling, Simulation and Visual Analysis of Crowds, Springer New York, 2013.
  15. Fradi H, Dugelay J L., "Sparse Feature Tracking for Crowd Change Detection and Event Recognition[C]," Pattern Recognition (ICPR), in Proc. of 2014 22nd International Conference on. IEEE, 4116-4121, 2014.
  16. Alexei Gritai, Yaser Sheikh & Mubarak Shah, "On the use of Anthropometry in the Invariant Analysis of Human Actions [A]," in Proc. of IEEE Proceedings of Intelligent Conference on Pattern Recognition, 923-926, 2004.
  17. Wang X, Ma K T, Ng G W et al., "Trajectory analysis and semantic region modeling using nonparametric hierarchical bayesian models[J]," International journal of computer vision, 95(3): 287-312, 2011. https://doi.org/10.1007/s11263-011-0459-6
  18. Varadarajan J, Odobez J M., "Topic models for scene analysis and abnormality detection[C]," Computer Vision Workshops (ICCV Workshops), in Proc. of 2009 IEEE 12th International Conference on, IEEE, 1338-1345, 2009.
  19. T. Xiang and S. Gong, "Video behavior profiling for anomaly detection[J]," IEEE Trans. Pattern Anal. Mach. Intell, 30(5):893-908, May 2008. https://doi.org/10.1109/TPAMI.2007.70731
  20. O. Boiman andM. Irani, "Detecting irregularities in images and in video[C]," in Proc. of 10th IEEE Int. Conf. Comput. Vision, vol. 1:462-469, 2005.
  21. Hasan S, Ukkusuri S V, "Urban activity pattern classification using topic models from online geo-location data[J]," Transportation Research Part C: Emerging Technologies, 44: 363-381, 2014. https://doi.org/10.1016/j.trc.2014.04.003
  22. Blei D M, Ng A Y, Jordan M I, "Latent dirichlet allocation[J]," the Journal of machine Learning research, 3: 993-1022, 2003.
  23. Hofmann T., "Probabilistic latent semantic indexing[C]," in Proc. of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 50-57, 1999.
  24. X. Wang, X. Ma, and W. E. L., "Grimson. Unsupervised activity perception in crowded and complicated scenes using hierarchical bayesian models[J]," IEEE Trans. Pattern Anal.Mach. Intell, 31(3):539-555, Mar 2009. https://doi.org/10.1109/TPAMI.2008.87
  25. J. Li, S. Gong and T. Xiang, "Global behavior inference using probabilistic latent semantic analysis[C]," in Proc. of 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, 1338-1345, 2008.
  26. Brox T., "Optical Flow[M]," Computer Vision, Springer US, 565-569, 2014.
  27. CHANG C-C,LIN C-J., "Libsvm: A library for support vector machines[J]," ACM Transactions on Intelligent Systems and Technology (TIST), 2(3):27, 2011.
  28. PETS 2009 Benchmark Data. http://www.cvg.reading.ac.uk/PETS2009/a.html.