Thread: [Vxl-maintainers] vidl2_istream API modifications

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

Hi All,

  I'm cleaning up vidl2 a bit in preparation for the eventual move to
core.  There are a few unresolved issues with the API that I'd like to
get some feedback on.

  First, I've added a num_frames() function to the vidl2_istream.
This has been requested by users, and seems like it should be there.
This function should return the number of frames in a video or -1 for
live streams or other streams with indeterminate length.  I've already
implemented this for vidl2_image_list_istream and
vidl2_ffmpeg_istream.  All other istreams currently return -1.  I
could use some help with implementing this in vidl2_dshow_file_istream
from Miguel and in vidl2_v4l_istream from Brendan.  Of course, anyone
else familiar with these is welcome to chip in.

  Second, I think we need to clear up the API regarding advance(),
current_frame(), and read_frame().  The way it is now, advance() moves
the stream pointer one frame ahead (usually without fully decoding the
next frame), current_frame() decodes the current frame (if necessary)
and returns the frame, and read_frame() calls advance() followed by
current_frame().  It was also originally assumed that each istream
could be opened in a state with the current frame pointing to an
invalid frame before the first frame of the video.  The reason for
this was that a live stream could be opened without capturing any
images until the first call of advance() (or read_frame()).  The
problem with opening in an invalid state is that when piping an
istream into an ostream the image size and other properties often need
to be known before opening the ostream.  This usually requires
accessing the first frame of the istream for probing anyway.  The
probing process can lead to dropping of the first frame because
read_frame() calls advance() again before calling current_frame().  So
the questions I have are:

1)  Should each istream always open in a valid state so that a frame
can be accessed with current_frame() and probed for properties?  In
the case of live streams we can always discard the initial frame by
calling advance() right before the capture loop.  However, this does
require that capture devices be ready to capture when the stream is
opened.

2) Should the redundant read_frame() function be removed?  It may make
iterating through streams easier for beginners, but it doesn't add any
new functionality.

3) If read_frame() stays and each stream opens on the first frame,
should read_frame() return the current frame and then advance instead
of the reverse.  Think i++ instead of ++i.  This way you can open an
istream and then immediately call read_frame() without dropping the
first frame.

There are still other lingering issues like synchronous vs
asynchronous capture, but I think I'll leave that for another time.

Thanks,
Matt

Thread: [Vxl-maintainers] vidl2_istream API modifications

vxl-maintainers