We’re in the process of developing a web application that displays an MJPEG stream. Our goal is to identify specific objects in the video in real-time and perform pose estimation. Ideally, we’d like it to function in a way similar to fiduciary markers, but instead, leverage a machine learning algorithm to enable markerless tracking.
Could anyone share insights on whether this is feasible using TensorFlow.js?
Yes, markerless tracking and pose estimation with a single camera in a web browser using TensorFlow.js is feasible. Use models like PoseNet or MoveNet for pose estimation and Coco SSD for object detection. Implement by:
Including TensorFlow.js in your project.
Loading the appropriate model.
Accessing the camera with getUserMedia.
Processing video frames through the model.
Rendering results on a canvas over the video stream.
These models are efficient and can run in real-time directly in the browser.