Rate-accuracy trade-off In video classification with deep convolutional neural networks

Jubran, Mohammad K.; Alhabib, Abbas; Chadha, Aaron; Andreopoulos, Yiannis

Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.11889/6121

DC Field	Value	Language
dc.contributor.author	Jubran, Mohammad K.	en_US
dc.contributor.author	Alhabib, Abbas	en_US
dc.contributor.author	Chadha, Aaron	en_US
dc.contributor.author	Andreopoulos, Yiannis	en_US
dc.date.accessioned	2020-01-15T07:08:23Z	-
dc.date.available	2020-01-15T07:08:23Z	-
dc.date.issued	2018-10-01	-
dc.identifier.issn	10518215	-
dc.identifier.uri	http://hdl.handle.net/20.500.11889/6121	-
dc.description.abstract	—Advanced video classification systems decode video frames to derive the necessary texture and motion representations for ingestion and analysis by spatio-temporal deep convolutional neural networks (CNNs). However, when considering visual Internet-of-Things applications, surveillance systems and semantic crawlers of large video repositories, the video capture and the CNN-based semantic analysis parts do not tend to be colocated. This necessitates the transport of compressed video over networks and incurs significant overhead in bandwidth and energy consumption, thereby significantly undermining the deployment potential of such systems. In this paper, we investigate the trade-off between the encoding bitrate and the achievable accuracy of CNN-based video classification models that directly ingest AVC/H.264 and HEVC encoded videos. Instead of retaining entire compressed video bitstreams and applying complex optical flow calculations prior to CNN processing, we only retain motion vector and select texture information at significantly-reduced bitrates and apply no additional processing prior to CNN ingestion. Based on three CNN architectures and two action recognition datasets, we achieve 11%–94% saving in bitrate with marginal effect on classification accuracy. A model-based selection between multiple CNNs increases these savings further, to the point where, if up to 7% loss of accuracy can be tolerated, video classification can take place with as little as 3 kbps for the transport of the required compressed video information to the system implementing the CNN models	en_US
dc.publisher	IEEE	en_US
dc.relation.ispartof	IEEE Transactions on Circuits and Systems for Video Technology	en_US
dc.subject	Imaging systems - Classification	en_US
dc.subject	Video classification	en_US
dc.subject	Convolutional neural networks	en_US
dc.subject	Streaming technology (Telecommunications)	en_US
dc.subject	Streaming video	en_US
dc.title	Rate-accuracy trade-off In video classification with deep convolutional neural networks	en_US
dc.type	Article	en_US
newfileds.department	Engineering and Technology	en_US
newfileds.item-access-type	bzu	en_US
newfileds.thesis-prog	none	en_US
newfileds.general-subject	none	en_US
dc.identifier.doi	https://api.elsevier.com/content/abstract/scopus_id/85058871158	-
dc.identifier.doi	https://api.elsevier.com/content/abstract/scopus_id/85058871158	-
dc.identifier.doi	https://api.elsevier.com/content/abstract/scopus_id/85058871158	-
dc.identifier.doi	https://api.elsevier.com/content/abstract/scopus_id/85058871158	-
dc.identifier.doi	https://api.elsevier.com/content/abstract/scopus_id/85058871158	-
dc.identifier.doi	10.1109/TCSVT.2018.2887408	-
dc.identifier.doi	https://api.elsevier.com/content/abstract/scopus_id/85058871158	-
dc.identifier.doi	https://api.elsevier.com/content/abstract/scopus_id/85058871158	-
dc.identifier.doi	https://api.elsevier.com/content/abstract/scopus_id/85058871158	-
dc.identifier.doi	https://api.elsevier.com/content/abstract/scopus_id/85058871158	-
dc.identifier.doi	https://api.elsevier.com/content/abstract/scopus_id/85058871158	-
dc.identifier.scopus	2-s2.0-85058871158	-
dc.identifier.url	https://api.elsevier.com/content/abstract/scopus_id/85058871158	-
item.grantfulltext	open	-
item.fulltext	With Fulltext	-
Appears in Collections:	6. BZU Dataset Collection 6. BZU Dataset Collection

Files in This Item:

File	Description	Size	Format
2018 CSVT.pdf		1.94 MB	Adobe PDF	View/Open

Show simple item record

Page view(s)

192

checked on Apr 14, 2024

Download(s)

90

checked on Apr 14, 2024

Google Scholar^TM

Check

Files in This Item:

Page view(s)

Download(s)

Google ScholarTM

Altmetric

Google Scholar^TM