Oliver Kaus
Creator of this blog.
Nov 9, 2021 2 min read

🎾🤖 Tennalytix (2/5): Track Pixel Positions Through Deep Learning

Introduction

The goal of this blog post is to describe how I tracked the player and tennis ball pixel locations using deep learning models and how I set this up to run for multiple tennis videos.

Player & Ball Tracking

Object detection is a well studied deep learning problem. To not re-invent the wheel, I used the state of the art (at time of this writing) pre-trained object detection model Detectron2 developed by Facebook. In a similar way, I found a deep learning model which performed well on ball tracking data and adjusted it to me needs. My laptop did not have GPUs available, so I decided to move the predictions into Google Colab to make use of the speed benefits. The process to extract player and ball tracking data looked as follows:

I uploaded tennis mp4 videos into Google Drive
I cloned my GitHub repo into Google Colab
The GitHub code pulled all required videos from Google Drive, pulled them into Google Colab, split them into single videos and set them up for model predictions
Detectron2 and ball prediction model were detecting objects/ball pixel locations frame-by-fame
The results were processed a For Detectron2, this meant filtering out non-human objects
The cleaned pixel predictions for both models were stored in json files which included additional image metadata and put into Google Drive

Below you can see a visualisation of the Detectron2 model predictions that I also used as a cover image for this blog: alt

When filtering the player predictions for the competing players and visualising their geometric center and visualising the ball predictions, it looks like this: alt

Conclusion

At the end of this process, all json tracking files were stored into Google Drive.

tennalytix

« 🎾🔍 Tennalytix (1/5): Introduction Into My Tennis Analytics Project 🎾💻 Tennalytix (3/5): Create Tkinter GUI To Manually Overwrite Predictions »