[FFmpeg-user] How to use ocr filter

nicolab robelt2525 at gmail.com
Thu Sep 17 16:05:57 CEST 2015


When I using ocr filter, how to output ocr text file ?
https://ffmpeg.org/ffmpeg-filters.html#ocr

img.png
<http://ffmpeg-users.933282.n4.nabble.com/file/n4672454/img.png> 

ffmpeg -f lavfi -i
"movie=img.png,ocr=datapath=tessdata:language=eng,drawgraph=lavfi.ocr.text" 
out.png -y -loglevel 99
ffmpeg version 2.8.git Copyright (c) 2000-2015 the FFmpeg developers
  built with gcc 5.2.0 (GCC)
  configuration: --prefix=/mingw/i686-w64-mingw32 --enable-version3
--enable-gpl
 --enable-memalign-hack --enable-w32threads --enable-libtesseract
--disable-outdev=sdl 
 --disable-ffplay --disable-ffprobe --disable-ffserver --disable-doc
--disable-htmlpages 
 --disable-manpages --disable-podpages --disable-txtpages --disable-debug
 --pkg-config-flags=--static
  libavutil      55.  2.100 / 55.  2.100
  libavcodec     57.  2.100 / 57.  2.100
  libavformat    57.  2.100 / 57.  2.100
  libavdevice    57.  0.100 / 57.  0.100
  libavfilter     6.  4.100 /  6.  4.100
  libswscale      4.  0.100 /  4.  0.100
  libswresample   2.  0.100 /  2.  0.100
  libpostproc    54.  0.100 / 54.  0.100
Splitting the commandline.
Reading option '-f' ... matched as option 'f' (force format) with argument
'lavf
i'.
Reading option '-i' ... matched as input file with argument
'movie=img.png,ocr=d
atapath=tessdata:language=eng,drawgraph=lavfi.ocr.text'.
Reading option 'out.png' ... matched as output file.
Reading option '-y' ... matched as option 'y' (overwrite output files) with
argu
ment '1'.
Reading option '-loglevel' ... matched as option 'loglevel' (set logging
level)
with argument '99'.
Finished splitting the commandline.
Parsing a group of options: global .
Applying option y (overwrite output files) with argument 1.
Applying option loglevel (set logging level) with argument 99.
Successfully parsed a group of options.
Parsing a group of options: input file
movie=img.png,ocr=datapath=tessdata:langu
age=eng,drawgraph=lavfi.ocr.text.
Applying option f (force format) with argument lavfi.
Successfully parsed a group of options.
Opening an input file:
movie=img.png,ocr=datapath=tessdata:language=eng,drawgrap
h=lavfi.ocr.text.
detected 4 logical cores
[Parsed_movie_0 @ 02438040] Setting 'filename' to value 'img.png'
Probing image2 score:50 size:929
Probing mp3 score:1 size:929
Probing png_pipe score:99 size:929
[png_pipe @ 02438480] Format png_pipe probed with size=2048 and score=99
[png_pipe @ 02438480] Before avformat_find_stream_info() pos: 0 bytes
read:929 s
eeks:0
[png_pipe @ 02438480] 0: start_time: -9223372036854.775 duration:
-9223372036854
.775
[png_pipe @ 02438480] stream: start_time: -9223372036854.775 duration:
-92233720
36854.775 bitrate=0 kb/s
[png_pipe @ 02438480] After avformat_find_stream_info() pos: 929 bytes
read:929
seeks:0 frames:1
[Parsed_movie_0 @ 02438040] seek_point:0 format_name:(null)
file_name:img.png st
ream_index:-1
[Parsed_ocr_1 @ 04813f80] Setting 'datapath' to value 'tessdata'
[Parsed_ocr_1 @ 04813f80] Setting 'language' to value 'eng'
[Parsed_ocr_1 @ 04813f80] Tesseract version: 3.02
[Parsed_drawgraph_2 @ 024375e0] Setting 'm1' to value 'lavfi.ocr.text'
[auto-inserted scaler 0 @ 048187c0] w:iw h:ih flags:'bilinear' interl:0
[Parsed_ocr_1 @ 04813f80] auto-inserting filter 'auto-inserted scaler 0'
between
 the filter 'Parsed_movie_0' and the filter 'Parsed_ocr_1'
[AVFilterGraph @ 02437580] query_formats: 4 queried, 2 merged, 1 already
done, 0
 delayed
[auto-inserted scaler 0 @ 048187c0] picking yuv444p out of 15 ref:rgb24
alpha:0
[auto-inserted scaler 0 @ 048187c0] w:160 h:48 fmt:rgb24 sar:1/1 -> w:160
h:48 f
mt:yuv444p sar:1/1 flags:0x2
[lavfi @ 024331e0] All info found
[lavfi @ 024331e0] 0: start_time: 0.000 duration: -9223372036854.775
[lavfi @ 024331e0] stream: start_time: 0.000 duration: -9223372036854.775
bitrat
e=0 kb/s
Input #0, lavfi, from
'movie=img.png,ocr=datapath=tessdata:language=eng,drawgrap
h=lavfi.ocr.text':
  Duration: N/A, start: 0.000000, bitrate: N/A
    Stream #0:0, 1, 1/25: Video: rawvideo, 1 reference frame (RGBA /
0x41424752)
, rgba, 900x256 [SAR 1:1 DAR 225:64], 1/25, 25 tbr, 25 tbn, 25 tbc
Successfully opened the file.
Parsing a group of options: output file out.png.
Successfully parsed a group of options.
Opening an output file: out.png.
Successfully opened the file.
[graph 0 input from stream 0:0 @ 04838fa0] Setting 'video_size' to value
'900x25
6'
[graph 0 input from stream 0:0 @ 04838fa0] Setting 'pix_fmt' to value '28'
[graph 0 input from stream 0:0 @ 04838fa0] Setting 'time_base' to value
'1/25'
[graph 0 input from stream 0:0 @ 04838fa0] Setting 'pixel_aspect' to value
'1/1'

[graph 0 input from stream 0:0 @ 04838fa0] Setting 'sws_param' to value
'flags=2
'
[graph 0 input from stream 0:0 @ 04838fa0] Setting 'frame_rate' to value
'25/1'
[graph 0 input from stream 0:0 @ 04838fa0] w:900 h:256 pixfmt:rgba tb:1/25
fr:25
/1 sar:1/1 sws_param:flags=2
[format @ 04838a60] compat: called with
args=[rgb24|rgba|rgb48be|rgba64be|pal8|g
ray|ya8|gray16be|ya16be|monob]
[format @ 04838a60] Setting 'pix_fmts' to value
'rgb24|rgba|rgb48be|rgba64be|pal
8|gray|ya8|gray16be|ya16be|monob'
[AVFilterGraph @ 04817400] query_formats: 4 queried, 3 merged, 0 already
done, 0
 delayed
Output #0, image2, to 'out.png':
  Metadata:
    encoder         : Lavf57.2.100
    Stream #0:0, 0, 1/25: Video: png, 1 reference frame, rgba, 900x256 [SAR
1:1
DAR 225:64], 1/25, q=2-31, 200 kb/s, 25 fps, 25 tbn, 25 tbc
    Metadata:
      encoder         : Lavc57.2.100 png
Stream mapping:
  Stream #0:0 -> #0:0 (rawvideo (native) -> png (native))
Press [q] to stop, [?] for help
Cliping frame in rate conversion by 0.000008
[output stream 0:0 @ 048391e0] EOF on sink link output stream 0:0:default.
No more output streams to write to, finishing.
[AVIOContext @ 048416e0] Statistics: 0 seeks, 1 writeouts
frame=    1 fps=0.0 q=-0.0 Lsize=N/A time=00:00:00.04 bitrate=N/A
video:2kB audio:0kB subtitle:0kB other streams:0kB global headers:0kB muxing
ove
rhead: unknown
Input file #0
(movie=img.png,ocr=datapath=tessdata:language=eng,drawgraph=lavfi.
ocr.text):
  Input stream #0:0 (video): 1 packets read (921638 bytes); 1 frames
decoded;
  Total: 1 packets (921638 bytes) demuxed
Output file #0 (out.png):
  Output stream #0:0 (video): 1 frames encoded; 1 packets muxed (1543
bytes);
  Total: 1 packets (1543 bytes) muxed
1 frames successfully decoded, 0 decoding errors
[AVIOContext @ 02438a80] Statistics: 929 bytes read, 0 seeks



-----
https://twitter.com/nico_lab
http://nico-lab.net/
--
View this message in context: http://ffmpeg-users.933282.n4.nabble.com/How-to-use-ocr-filter-tp4672454.html
Sent from the FFmpeg-users mailing list archive at Nabble.com.


More information about the ffmpeg-user mailing list