OpenIMAJ / Discussion / General Discussion: Machine learning and video tracking

First, thanks a lot to the developers of this awesome program.

I'd like to identify the following patterns in all the frames of the video of a tennis game: the Players (2), the Ball being played(1), the Net, the Court and the Background. These patterns can overlap (players can be on the court, ball can be in the background).

In order to track the ball first, I've tried a FournierTemplateMatcher (see code below), but the results are not accurate as seen below, the ball is mistaken for other elements.

I've also tried a SIFT with similar results. The ball gets mistaken for other elements.

Given the difficulty of recognizing these different moving elements that are moving and rotating in 3d, I believe that my best chance of success is to use the machine learning in a similar fashion as caltech 101: giving the algorithm enough images of players, balls and courts to let it recognize them all separately. Problem is, I don't know how to correctly annotate the images, or how to parametrize the algorithm. Sorry for not having more specific questions, but

how should I start? where can I find documentation? Any help would be much appreciated.
Would another possibility be to use the FelzenszwalbHuttenlocherSegmenter? This,at least, sees the ball as a separate object. see here

Thanks again, Cheers

Gauthier

query image: here
short video stream: https://youtu.be/zWz-f5jW_gY

***Fournier Template Matcher code:

public class templatevid {
    public static void main( String[] args ) throws IOException {
        MBFImage q = ImageUtilities.readMBF(new File("D:/down/tennis/TENNISBALLV4.jpg"));
        Video<MBFImage> video;
        video = new XuggleVideo(new File("D:/down/tennis/video/feddjoko.mkv"));

        for (MBFImage mbfImage : video) {       
        MBFImage target = mbfImage;
        MBFImage query = q.clone();
        FourierTemplateMatcher tm = new FourierTemplateMatcher(query.flatten(), FourierTemplateMatcher.Mode.CORRELATION_COEFFICIENT ) ;
        tm.analyseImage(target.flatten());
        FValuePixel best1 = tm.getBestResponses(3)[0] ;
        FValuePixel best2 = tm.getBestResponses(3)[1] ;
        FValuePixel best3 = tm.getBestResponses(3)[2] ;
        target.drawShape(new Ellipse(best1.getX(), best1.getY(), 10f, 10f, 0), RGBColour.RED);
        target.drawShape(new Ellipse(best2.getX(), best2.getY(), 10f, 10f, 0), RGBColour.RED);
        target.drawShape(new Ellipse(best3.getX(), best3.getY(), 10f, 10f, 0), RGBColour.RED);
        DisplayUtilities.display(target,"result");
        System.out.println(best1.getX()+" "+best1.getY()+"\t"+best2.getX()+" "+best2.getY()+"\t"+best3.getX()+" "+best3.getY());
        DisplayUtilities.display(query,"query");
    }
    }

*** SIFT CODE

    public class App {
public static void main( String[] args ) throws IOException {
    Video<MBFImage> video;
    video = new XuggleVideo(new File("D:/down/tennis/video/feddjoko.mkv"));
    DoGSIFTEngine engine = new DoGSIFTEngine();
    for (MBFImage mbfImage : video) {

        MBFImage query = ImageUtilities.readMBF(new File("D:/down/tennis/TENNISBALLV4.jpg"));
        MBFImage target = mbfImage;

                    LocalFeatureList<Keypoint> queryKeypointsc3_2 = engine.findFeatures(query.flatten());
                    LocalFeatureList<Keypoint> targetKeypointsc3_2 = engine.findFeatures(target.flatten());
                    LocalFeatureMatcher<Keypoint> matcherc3_2 = new BasicTwoWayMatcher<Keypoint>();
                    matcherc3_2.setModelFeatures(queryKeypointsc3_2);
                    matcherc3_2.findMatches(targetKeypointsc3_2);
                    RobustHomographyEstimator modelFitterc3_2 = new RobustHomographyEstimator(1,50,new RANSAC.NumberInliersStoppingCondition(1), HomographyRefinement.NONE);
                    matcherc3_2 = new ConsistentLocalFeatureMatcher2d<Keypoint>(new BasicTwoWayMatcher<Keypoint>(), modelFitterc3_2);
                    matcherc3_2.setModelFeatures(queryKeypointsc3_2);
                    matcherc3_2.findMatches(targetKeypointsc3_2);


                    MBFImage consistentMatches3_2 = MatchingUtilities.drawMatches(query, target, matcherc3_2.getMatches(),RGBColour.WHITE);
                    DisplayUtilities.displayName(consistentMatches3_2,"RANSAC");

        }

FHS CODE

public class App {
    public static void main( String[] args ) throws IOException {
        MBFImage input = ImageUtilities.readMBF(new File("D:/down/tennis/capture.jpg"));
        MBFImage clone = input.clone();
        FelzenszwalbHuttenlocherSegmenter test = new FelzenszwalbHuttenlocherSegmenter(2f,5f,100);
        List<ConnectedComponent> comp = test.segment(clone);
        SegmentationUtilities.renderSegments(clone, comp);
        MBFImage clone2 = ColourSpace.convert(clone, ColourSpace.RGB);
        DisplayUtilities.display(clone2);
        ImageUtilities.write(clone, new File("D:/down/tennis/export-FHS.jpg"));
    }
}

First, thanks a lot to the developers of this awesome program.

In order to track the ball first, I've tried a FournierTemplateMatcher (see code below), but the results are not accurate as seen below, the ball is mistaken for other elements.
![](http://s18.postimg.org/af1iqbdgp/BOARD.jpg)

I've also tried a SIFT with similar results. The ball gets mistaken for other elements.
 
![](http://s8.postimg.org/r20q7mk9h/boardsift.jpg)

1. how should I start? where can I find documentation? Any help would be much appreciated.

2. Would another possibility be to use the FelzenszwalbHuttenlocherSegmenter? This,at least, sees the ball as a separate object. see [here](http://http://imgur.com/Ae0aEUB)

Thanks again, Cheers

Gauthier

query image: [here](http://imgur.com/RsOcu0M)
short video stream: https://youtu.be/zWz-f5jW_gY

***Fournier Template Matcher code:
~~~~
public class templatevid {
    public static void main( String[] args ) throws IOException {
    	MBFImage q = ImageUtilities.readMBF(new File("D:/down/tennis/TENNISBALLV4.jpg"));
    	Video<MBFImage> video;
    	video = new XuggleVideo(new File("D:/down/tennis/video/feddjoko.mkv"));
    	
    	for (MBFImage mbfImage : video) {  		
    	MBFImage target = mbfImage;
    	MBFImage query = q.clone();
    	FourierTemplateMatcher tm = new FourierTemplateMatcher(query.flatten(), FourierTemplateMatcher.Mode.CORRELATION_COEFFICIENT ) ;
    	tm.analyseImage(target.flatten());
    	FValuePixel best1 = tm.getBestResponses(3)[0] ;
    	FValuePixel best2 = tm.getBestResponses(3)[1] ;
    	FValuePixel best3 = tm.getBestResponses(3)[2] ;
    	target.drawShape(new Ellipse(best1.getX(), best1.getY(), 10f, 10f, 0), RGBColour.RED);
    	target.drawShape(new Ellipse(best2.getX(), best2.getY(), 10f, 10f, 0), RGBColour.RED);
    	target.drawShape(new Ellipse(best3.getX(), best3.getY(), 10f, 10f, 0), RGBColour.RED);
    	DisplayUtilities.display(target,"result");
    	System.out.println(best1.getX()+" "+best1.getY()+"\t"+best2.getX()+" "+best2.getY()+"\t"+best3.getX()+" "+best3.getY());
    	DisplayUtilities.display(query,"query");
    }
    }
~~~~

*** SIFT CODE 
    	
    	
        public class App {
    public static void main( String[] args ) throws IOException {
        Video<MBFImage> video;
    	video = new XuggleVideo(new File("D:/down/tennis/video/feddjoko.mkv"));
    	DoGSIFTEngine engine = new DoGSIFTEngine();
    	for (MBFImage mbfImage : video) {
		
    		MBFImage query = ImageUtilities.readMBF(new File("D:/down/tennis/TENNISBALLV4.jpg"));
        	MBFImage target = mbfImage;

LocalFeatureList<Keypoint> queryKeypointsc3_2 = engine.findFeatures(query.flatten());
        	        	LocalFeatureList<Keypoint> targetKeypointsc3_2 = engine.findFeatures(target.flatten());
        	    		LocalFeatureMatcher<Keypoint> matcherc3_2 = new BasicTwoWayMatcher<Keypoint>();
        	    		matcherc3_2.setModelFeatures(queryKeypointsc3_2);
        	        	matcherc3_2.findMatches(targetKeypointsc3_2);
        	        	RobustHomographyEstimator modelFitterc3_2 = new RobustHomographyEstimator(1,50,new RANSAC.NumberInliersStoppingCondition(1), HomographyRefinement.NONE);
        	    		matcherc3_2 = new ConsistentLocalFeatureMatcher2d<Keypoint>(new BasicTwoWayMatcher<Keypoint>(), modelFitterc3_2);
        	    		matcherc3_2.setModelFeatures(queryKeypointsc3_2);
        	    		matcherc3_2.findMatches(targetKeypointsc3_2);

MBFImage consistentMatches3_2 = MatchingUtilities.drawMatches(query, target, matcherc3_2.getMatches(),RGBColour.WHITE);
        	        	DisplayUtilities.displayName(consistentMatches3_2,"RANSAC");
        	
    		}
           
            
            
            
FHS CODE
~~~~
public class App {
    public static void main( String[] args ) throws IOException {
    	MBFImage input = ImageUtilities.readMBF(new File("D:/down/tennis/capture.jpg"));
    	MBFImage clone = input.clone();
    	FelzenszwalbHuttenlocherSegmenter test = new FelzenszwalbHuttenlocherSegmenter(2f,5f,100);
    	List<ConnectedComponent> comp = test.segment(clone);
    	SegmentationUtilities.renderSegments(clone, comp);
    	MBFImage clone2 = ColourSpace.convert(clone, ColourSpace.RGB);
    	DisplayUtilities.display(clone2);
    	ImageUtilities.write(clone, new File("D:/down/tennis/export-FHS.jpg"));
    }
}
~~~~

Add attachments
Cancel

You seem to have CSS turned off. Please don't fill out this field.

Machine learning and video tracking

Forums

Help

Machine learning and video tracking

Machine learning and video tracking

Forums

Help

Machine learning and video tracking document.SUBSCRIPTION_OPTIONS = { "thing": "topic", "subscribed": false, "url": "subscribe", "icon": { "css": "fa fa-envelope-o" } };

Machine learning and video tracking