First, thanks a lot to the developers of this awesome program.
I'd like to identify the following patterns in all the frames of the video of a tennis game: the Players (2), the Ball being played(1), the Net, the Court and the Background. These patterns can overlap (players can be on the court, ball can be in the background).
In order to track the ball first, I've tried a FournierTemplateMatcher (see code below), but the results are not accurate as seen below, the ball is mistaken for other elements.
I've also tried a SIFT with similar results. The ball gets mistaken for other elements.
Given the difficulty of recognizing these different moving elements that are moving and rotating in 3d, I believe that my best chance of success is to use the machine learning in a similar fashion as caltech 101: giving the algorithm enough images of players, balls and courts to let it recognize them all separately. Problem is, I don't know how to correctly annotate the images, or how to parametrize the algorithm. Sorry for not having more specific questions, but
how should I start? where can I find documentation? Any help would be much appreciated.
Would another possibility be to use the FelzenszwalbHuttenlocherSegmenter? This,at least, sees the ball as a separate object. see here
View and moderate all "General Discussion" comments posted by this user
Mark all as spam, and block user from posting to "Discussion"
First, thanks a lot to the developers of this awesome program.
I'd like to identify the following patterns in all the frames of the video of a tennis game: the Players (2), the Ball being played(1), the Net, the Court and the Background. These patterns can overlap (players can be on the court, ball can be in the background).
In order to track the ball first, I've tried a FournierTemplateMatcher (see code below), but the results are not accurate as seen below, the ball is mistaken for other elements.

I've also tried a SIFT with similar results. The ball gets mistaken for other elements.
Given the difficulty of recognizing these different moving elements that are moving and rotating in 3d, I believe that my best chance of success is to use the machine learning in a similar fashion as caltech 101: giving the algorithm enough images of players, balls and courts to let it recognize them all separately. Problem is, I don't know how to correctly annotate the images, or how to parametrize the algorithm. Sorry for not having more specific questions, but
how should I start? where can I find documentation? Any help would be much appreciated.
Would another possibility be to use the FelzenszwalbHuttenlocherSegmenter? This,at least, sees the ball as a separate object. see here
Thanks again, Cheers
Gauthier
query image: here
short video stream: https://youtu.be/zWz-f5jW_gY
***Fournier Template Matcher code:
*** SIFT CODE
FHS CODE