Design and development of a data augmentation tool to support the training of neural networksArtificial Intelligence
Prof. Del Riccio
The purpose of this tool is to create a reliable dataset, in order to improve the training of a machine learning model, improving the accuracy of the predictions. To do this we chose to operate on two fundamental characteristics: quality and quantity. The quality allows building a dataset, containing the most representative samples in terms of information, applying the concept of entropy on the amount of information contained in a certain data. The second characteristic, the quantity, allows building a dataset containing a greater number of samples, in order to provide more examples for learning. In this way, we provide samples that simulate possible scenarios found in the application contexts. For this, it has been chosen to apply the data augmentation.
Design and development of a data augmentation tool to support neural network training, with Keras, OpenCv, SIFT, Canny
The purpose of the thesis is to develop a tool whose main objective is to build a qualitatively and quantitatively reliable dataset. In that sense, it was decided to operate by applying the data augmentation operations, in order to increase the samples present in the dataset, and by applying filtering operations, using entropy, in order to make the dataset highly informative. By doing so, the training of the neural network is improved and consequently, its results are improved.
The tool allows building a qualitative and quantitative dataset of images, through the functions implemented for data augmentation, in order to improve neural network training and its performance.
Automation of data labeling and image segmentation through CNN or clustering