[FFmpeg-devel] [PATCH V8 3/3] lavfi: add filter dnn_detect for object detection
Guo, Yejun
yejun.guo at intel.com
Fri Apr 16 03:49:10 EEST 2021
> -----Original Message-----
> From: Guo, Yejun <yejun.guo at intel.com>
> Sent: 2021å¹´4æ13æ¥ 13:14
> To: ffmpeg-devel at ffmpeg.org
> Cc: Guo, Yejun <yejun.guo at intel.com>
> Subject: [PATCH V8 3/3] lavfi: add filter dnn_detect for object detection
>
> Below are the example steps to do object detection:
>
> 1. download and install l_openvino_toolkit_p_2021.1.110.tgz from
> https://software.intel.com/content/www/us/en/develop/tools/openvino-to
> olkit/download.html
> or, we can get source code (tag 2021.1), build and install.
> 2. export LD_LIBRARY_PATH with openvino settings, for example:
> .../deployment_tools/inference_engine/lib/intel64/:.../deployment_tools/i
> nference_engine/external/tbb/lib/
> 3. rebuild ffmpeg from source code with configure option:
> --enable-libopenvino
> --extra-cflags='-I.../deployment_tools/inference_engine/include/'
> --extra-ldflags='-L.../deployment_tools/inference_engine/lib/intel64'
> 4. download model files and test image
> wget
> https://github.com/guoyejun/ffmpeg_dnn/raw/main/models/openvino/20
> 21.1/face-detection-adas-0001.bin
> wget
> https://github.com/guoyejun/ffmpeg_dnn/raw/main/models/openvino/20
> 21.1/face-detection-adas-0001.xml
> wget
> https://github.com/guoyejun/ffmpeg_dnn/raw/main/models/openvino/20
> 21.1/face-detection-adas-0001.label
> wget https://github.com/guoyejun/ffmpeg_dnn/raw/main/images/cici.jpg
> 5. run ffmpeg with:
> ./ffmpeg -i cici.jpg -vf
> dnn_detect=dnn_backend=openvino:model=face-detection-adas-0001.xml:
> input=data:output=detection_out:confidence=0.6:labels=face-detection-ad
> as-0001.label,showinfo -f null -
>
> We'll see the detect result as below:
> [Parsed_showinfo_1 @ 0x560c21ecbe40] side data - detection bounding
> boxes:
> [Parsed_showinfo_1 @ 0x560c21ecbe40] source:
> face-detection-adas-0001.xml
> [Parsed_showinfo_1 @ 0x560c21ecbe40] index: 0, region: (1005, 813) ->
> (1086, 905), label: face, confidence: 10000/10000.
> [Parsed_showinfo_1 @ 0x560c21ecbe40] index: 1, region: (888, 839) ->
> (967, 926), label: face, confidence: 6917/10000.
>
> There are two faces detected with confidence 100% and 69.17%.
>
> Signed-off-by: Guo, Yejun <yejun.guo at intel.com>
> ---
> configure | 1 +
> doc/filters.texi | 40 ++++
> libavfilter/Makefile | 1 +
> libavfilter/allfilters.c | 1 +
> libavfilter/vf_dnn_detect.c | 421
> ++++++++++++++++++++++++++++++++++++
will push the patch set tomorrow if there's no objection.
More information about the ffmpeg-devel
mailing list