Applications of Real-time Object Detection on NVIDIA ...€¦ · Input Image Dimension VOC2007 mAP...

JK Jung2018/05

Applications of Real-time Object Detection on NVIDIA Jetson TX2

自主創新Rapid Innovation

綠能環保Sustainable Energy

雲端應用Cloud Solutions

移動生活Mobile Lifestyle

新興市場Emerging Markets

JK Jung (鍾俊魁) @ IIoT Center AI Team, Inventec

• Blog: https://jkjung-avt.github.io/

• GitHub: https://github.com/jkjung-avt/

NVIDIA JETSON TX2 FORSMART CITY APPLICATIONS

Inventec Confidential 3

NVR/Server

ControlCenter

Smart Camera (IVS)

SOS Emergency

AI Gateway

LED Light

Solar Power

Battery

Display Panel

Smart Streetlights

Sensors

Real Deployment at Taoyuan Industrial Park

Illegal Parking Detection

Smart Streetlight

IP-CAM * 2

WiFiAntenna

IoT Gateway

IVS (TX2)

More Deployment Cases

Parking Lot Vehicle CountingTraffic Counting

Traffic Counting Dashboard (Control Center)

WeeklyReport

HourlyReport

DEVELOPING OBJECT DETECTION ALGORITHMS ON NVIDIA JETSON TX2

Faster R-CNN (FRCN)

Courtesy of https://blog.csdn.net/majinlei121/article/details/53870433

Single Shot Multibox Detector (SSD)

Courtesy of https://arxiv.org/pdf/1512.02325.pdf

Applying Object Detection Models on Jetson TX2

• To run Faster R-CNN on Jetson TX2: https://jkjung-avt.github.io/faster-rcnn/

• To run SSD on Jetson TX2: https://jkjung-avt.github.io/ssd/

• Observations:– Faster R-CNN is more accurate and could pick up smaller objects

– But Faster R-CNN is too slow (1~2 fps) for real-time edge analytics

– Training with more data does improve accuracy (mAP) of the models

• To improve inference speed of the object detection models:– Using faster CNN feature extractors

– Applying TensorRT: https://developer.nvidia.com/tensorrt

– Designing the model with less anchor boxes

– Trade-off (input image size) between mAP and inference time

Input Image Dimension

VOC2007 mAP

Inference Speed on Jetson TX2

Comments

VGG16 (original) 1000x600 0.69+ 900 ms

GoogLeNet 1000x600 0.69 480 ms

GoogLeNet +TensorRT

1280x720 0.69 200 ms

Faster R-CNN

SSDInput Image Dimension

VOC0712 mAP

Inference Speed on Jetson TX2

Comments

VGG16 (original) 300x300 0.72 160 ms

VGG16 + TensorRT

300x300 0.72 75 ms

GoogLeNet 300x300 0.70 60 ms

GoogLeNet +TensorRT

300x300 0.70 28 ms > 30 fps

FUTURE DIRECTIONS

Future Directions

• People counting and tracking

• Boat/vessel counting at the harbor

• Water level monitoring (flooding alert)

• More advanced event detection about people:– Fight

– Crime, robbery, etc.

– Fall and anesthesia detection for elderly

• More advanced event detection for vehicles and roads:– Traffic collision

– Unloading cargos from trucks or vans

– Scattered material, or wandering animals

– Road construction

Anomaly Detection

THANK YOU!

Questions and Answers

Applications of Real-time Object Detection on NVIDIA ...€¦ · Input Image Dimension VOC2007 mAP...

Documents

Transcript of Applications of Real-time Object Detection on NVIDIA ...€¦ · Input Image Dimension VOC2007 mAP...

A Simple and Light-weight Attention Module for ... · classi cation, VOC2007/MS-COCO detection, super res-olution and scene parsing with various architectures in-cluding mobile-oriented

VGG16 Transfer Learning Architecture for Salak Fruit ...

SISTEMA DE ARMARIOS - Industrial and Electrical ... Soporte modular, 1000x600, 24 módulos 1 72.64 7 8713574124639 ACM10080R5 Soporte modular, 1000x800, 35 módulos 1 79.77 7 8713574124486

The PASCAL Visual Object Classes Challenge 2007 (VOC2007) Development Kit · 2015-07-18 · 1 Challenge The goal of this challenge is to recognize objects from a number of visual

Terminal Brain Damage - usenix.org · B-Slim 82.19 197K !"# 93 % 46.7% B-Dropout 81.18 776K !"# 94 % 40.5% B-D-Norm 80.17 778K !"# 97 % 45.9% AlexNet 83.96 2.5M !"# 96 % 47.3% VGG16

傲睿智存 高效的视频处理与AI - Xilinx...支持的AI网络 YoloV3, Densebox, Resnet, MobilenetV1-SSD，VGG16, InceptionV3/V4 工具链：DNNDK(深度神经网络开 发套件)

Python を用いたオープンソース ソフトウェア (OSS) 活用におけ … · (Ex. Python では2 系or 3 系，D/L-CNN ではAlexNet or VGG16 or Inception, など) そもそもコーディングができず、GUI

Classification of Brain Tumor by Combination of Pre ... · Classification of Brain Tumor by Combination of Pre-Trained VGG16 CNN 14 Introduction Magnetic Resonance Imaging (MRI) is

Biurko proste proste 4 Zamówienie: 85792/2016/AP/MEB Szafka 1 Nr Wymiar Nazwa 1 702x596 Bok prawy 2 702x596 Bok lewy 4 1000x600 Blat 6 282x960 Ściana tylna Biurko proste 5 Zamówienie:

D-22 CNN MILを用いた弱教師あり領域分割img.cs.uec.ac.jp/pub/conf15/151127shimok_3_ppt.pdf · VGG16のBPによるサリエンシーマップ 目的 評価 D-22 CNNを用いた弱教師あり領域分割

Plankton Classiﬁcation Using VGG16 Networknoiselab.ucsd.edu/ECE228/FinalProjects/Group16.pdf · tional neural networks ”learn”. 1. Introduction ... on machine learning techniques

農業における深層学習の活用 - SWEST...Convolutional Neural Networkとは 顔画像を入力した時の例（VGG16の中間層出力） W 1,1 W 1,2 W 2,1 W 2,2 ・・・

Adaptive Multi-Scale Information Flow for Object Detection · fc6 and fc7 with convolutional layers, and add new layers (conv6_1 and conv6_2) after the VGG16. These modiﬁed and

Gradient Forward-Propagation for Large-Scale Temporal Video … · 2021. 6. 24. · BP + 3D VGG16 1:9 1 BP + 3D VGG16 (Remat) 1:5 1 BP + Causal 3D VGG16 2:4 1 BP + I3D [2] 2:9 1 BP

Analyzing Neural Language Models Introduction · Deep Learning Tidal Wave 13 VGG16 Inception ResNet (34 layers above; up to 152 in paper) Transfer Learning “We use features extracted

Can AI help in screening Viral and COVID-19 pneumonia? · pneumonia using pre-trained ImageNet models [33] and their ensembles. A customized VGG16 model was used by Xianghong et al.

Comparison of FairMOT-VGG16 and MCMOT Implementation for ...

Plankton Classiﬁcation Using VGG16 Network - …noiselab.ucsd.edu/ECE285/FinalProjects/Group16.pdfPlankton Classiﬁcation Using VGG16 Network Lucas Tindall UCSD ltindall@ucsd.edu

cs230.stanford.educs230.stanford.edu/projects_spring_2019/reports/18681618.pdf · Tool detection:Used Fast-RCNN for spatial detection of surgical tools and VGG16 for classification

Malicious Software Classiﬁcation using VGG16 …...Malicious Software Classiﬁcation using VGG16 Deep Neural Network’s Bottleneck Features Edmar Rezende y, Guilherme Ruppert ,

傲睿智存高效的视频处理与AI - Xilinx...支持的AI网络 YoloV3, Densebox, Resnet, MobilenetV1-SSD，VGG16, InceptionV3/V4 工具链：DNNDK(深度神经网络开发套件)

Python を用いたオープンソースソフトウェア (OSS) 活用におけ … · (Ex. Python では2 系or 3 系，D/L-CNN ではAlexNet or VGG16 or Inception, など) そもそもコーディングができず、GUI

D-22 CNN MILを用いた弱教師あり領域分割img.cs.uec.ac.jp/pub/conf15/151127shimok_3_ppt.pdf · VGG16のBPによるサリエンシーマップ目的評価 D-22 CNNを用いた弱教師あり領域分割

農業における深層学習の活用 - SWEST...Convolutional Neural Networkとは顔画像を入力した時の例（VGG16の中間層出力） W 1,1 W 1,2 W 2,1 W 2,2 ・・・