Statkevych R. Method of image segmentation using deep neural networks

Українська версія

Thesis for the degree of Doctor of Philosophy (PhD)

State registration number

0824U001840

Thesis Registration Form

0824U001840.pdf

Applicant for

Roman V. Statkevych

Specialization

121 - Інженерія програмного забезпечення

Specialized Academic Board

ДФ 26.002.165; ID 5594

National Technscal University of Ukraine "Kiev Polytechnic Institute".

Essay

This work is dedicated to the development and improvement of the neural networks in a context of semantic segmentation, based on U-Net architecture, which allows for improved overall evaluation and performance metrics, compared to the baseline U-Net model. The topic of the dissertation was agreed upon by the Department of Computer Engineering of the National Technical University of Ukraine “Ihor Sikorskiy Kyiv Polytechnic Institute”, in accordance with Cabinet of Ministers Order №1556-p on the concept of development of Artificial Intelligence in Ukraine. The goal of the dissertation was an improvement of existing semantic image segmentation methods, that allow to improve results and effectiveness of neural networks. To achieve the stated goal, the next tasks and problems were solved: - Existing semantic image segmentation methods and neural network architectures. - The family of U-Net neural network architectures was studied in detail. - As part of the proposed method, expansion rate variation and depthwise separable skip connections were proposed as modifications to the baseline U-Net architecture. - Many experiments were conducted on different datasets, using different approaches and proposed novel improvements. To confirm the quality nature of the change, K-fold cross-validation was performed; - The benchmark was conducted for the proposed improvements and modifications to get inference time and memory usage metrics. The proposed expansion rate variation method allows the developer to regulate the number of parameters for the U-Net network, enabling deeper networks with fewer parameters. Using this approach, it is possible to optimize the size and the performance of the model and get the same evaluation results as a baseline model, with 2.5 fewer parameters. Another proposal is using Depthwise Separable Skip Connections, based on Depthwise Separable Convolutions. This modification allowed to improve the results of the model with a relatively small size increase for a model. It is also possible to improve other U-Net-like models, as demonstrated in the Attention-UNet model. For different datasets, at least one of the suggested modifications managed to improve a result of the baseline model, with this improvement measured between 1-5%. It was also shown on the K-fold cross-validation, that these modifications could steadily outperform the baseline model. In some cases, this improvement was achieved by a model with just a 1% increase in a number of parameters, which proves this is a quality improvement. Utilizing the proposed modifications, a method for image segmentation using modified U-Net architectures was developed with Python programming language and TensorFlow library, to prove the feasibility of these modifications. Experiments were conducted in different knowledge domains, such as Computer Assisted Diagnostics and road environment analysis. Also, these methods were used for analyzing both 2-dimensional images and 3-dimensional volumes, which proves the practicality of using these methods. For the experiments, well-known state-of-the-art datasets were used, such as the University of Wisconsin Gastrointestinal (UWGIT), Cityscapes, Synapse, and Brain Tumor Segmentation (BraTS). It was also demonstrated that the proposed methods may match, and sometimes outperform some of the State-Of-The-Art models, aside from baseline U-Net. Besides that, some performance and productivity metrics were measured for the proposed network architectures. It was noted, that deeper networks with an expansion rate approach may have better inference times than a baseline model and match its quality. The suggested methods have a big practical value and a wide range of applications in the field of image analysis, which was proven during the conducted experiments.

Thesis supervisor

Gordienko Yuri

Official opponents

Andrii M. Hlybovets
Nataliia B. Shakhovska

Reviewers

Oleksiy О. Pysarchuk
Mykhailo A. Novotarskyi

Research papers

Statkevych, R., Gordienko, Y., Stirenko, S. (2022). Improving U-Net Kidney Glomerulus Segmentation with Fine-Tuning, Dataset Randomization and Augmentations. In: Advances in Computer Science for Engineering and Education. Lecture Notes on Data Engineering and Communications Technologies, vol 134. ISSN 2367-4520 (electronic) | Springer, Cham. https://doi.org/10.1007/978-3-031-04812-8_42

Statkevych, R., Gordienko, Y., Stirenko, S. (2023). Expansion Rate Parametrization and K-Fold Based Inference with U-Net Neural Networks for Multiclass Medical Image Segmentation. In: Artificial Intelligence and Soft Computing. Lecture Notes in Computer Science(), vol 14125. ISSN 0302-9743, Springer, Cham. https://doi.org/10.1007/978-3-031-42505-9_22

Statkevych R, Stirenko S, Gordienko Y. Human kidney tissue image segmentation by U-Net models. | IEEE Xplore digital library. https://doi.org/10.1109/EUROCON52738.2021.9535599

Statkevych R, Gordienko Y, Stirenko S. Improving Pedestrian Detection Methods by Architecture and Hyperparameter Modification of Deep Neural Networks. In Advances in Artificial Systems for Logistics Engineering 2021 (pp. 44-53). ISSN 2367-4512, Springer International Publishing. https://doi.org/10.1007/978-3-030-80475-6_5

Files

autoreferat-Статкевич Аннотація.docx

editrequest-Лист до УкрІНТЕІ_PhD_Статкевич Р-В.pdf

Дисертація_СтаткевичРВ.pdf

Similar theses

0824U002664

Krykun Valentyn А.

Method and software tools for interpreting machine learning models of nonlinear dynamic objects

0824U002621

Vadym ZABCHUK

2. Methods and software tools for modeling processes in biogas plants based on interval data analysis

0824U002601

Oleksandra D. Syzonenko

Development of technology and software tools for detecting and recognizing objects in real time

0824U002554

Bohdan V. Molodets

Development of technologies and software for operational air quality monitoring

0824U002508

Vladyslav Antoniuk

Development of a cross-platform system for automating processes and interacting with computerized devices