How to mange image data for Deep Learning?

ai2ys · February 17, 2022, 12:43pm

I would like to learn what are common best practice ways to manage image data for Deep Learning.

Currently my scenario is the following:

Retrieving different collections of images
Labeling the image using a labeling tool like CVAT
Question: how to manage the data after the labeling step?

What is then the best way to manage the data from different sources that has been labeled? Do you store it to a database or export it just to the file system? How do you manage the data after it has been labeled?

What are your workflows for retrieving, labeling and managing image data?

Aniket_Dubey · September 20, 2024, 12:26pm

Hi @ai2ys ,

After labeling images with tools like CVAT, store data on file systems for smaller datasets or databases for larger ones. Organize images in structured directories with metadata files. Use data versioning tools like DVC to track changes. Convert labels to standard formats (COCO, PASCAL VOC, YOLO) for compatibility.

Create preprocessing pipelines for data loading and augmentation. Implement regular backups and consider cloud storage for collaboration.

Workflow involves:
1. Image retrieval :collect images from various sources Store raw images in a structured file system or cloud storage .
2.Labeling: CVAT
3.Post-labeling management: (storage, versioning, validation)
4.Preprocessing and training : TensorFlow
5.Continuous improvement :updating dataset
Hope it helps ,

Thank You .

Topic		Replies	Views
How to load big image segmentation dataset General Discussion datasets , help_request	3	1470	December 2, 2022
Recommended way to save/load data to/from disk to tf.data.Dataset General Discussion tfdata	7	4367	July 19, 2023
Image preparation for prediction of a continuous variable General Discussion help_request	2	765	October 21, 2021
Creating datasets from scratch tutorial needed General Discussion datasets	1	197	August 9, 2023
Annotation Methods for Images General Discussion models , object-detection	3	601	December 26, 2023

How to mange image data for Deep Learning?

Related topics