In image classification, it is obvious that images are considered as samples but what is considered as samples in Tensorflow object detection API? Are the samples images or the individual objects in the images?
if an image contains 10 objects, should I consider 10 objects as training samples or just a single image?
I need to know this to effectively set batch size and steps for training.