METHODS OF DETERMINING THE THRESHOLD VALUE OF IMAGE AS A WAY TO IMPROVE AUGMENTED REALITY
DOI:
https://doi.org/10.20535/kpisn.2020.4.226964Keywords:
amidation, image threshold, specialized images, document imagesAbstract
Background. Augmented reality technology is at the initial stage of its development and requires further development of new and improvement of existing methods and algorithms. The most common and reliable approach to the implementation of augmented reality is the marker approach. The definition of the marker is divided into several stages, one of the most important of which is to determine the threshold of the image. Properly chosen algorithm for determining this threshold can significantly increase the efficiency of the algorithm for finding the marker.
Objective. The aim of the study is a comparative analysis of threshold algorithms based on the type of information used and evaluation of their effectiveness based on a set of objective indicators of segmentation quality. Comparison is made for images and documents. For an objective comparison of efficiency, we used a combination of 4 quality criteria for form segmentation.
Methods. The following performance criteria were used to determine the performance of the threshold determination methods: classification error, boundary determination error, relative foreground error, and non-uniformity region. Two approaches were considered to obtain an average performance score according to the previous criteria. The first approach was the arithmetic averaging of normalized scores obtained by all criteria. Using a specific threshold algorithm for the image, the average value of all criteria was determined, which was an indicator of its segmentation quality. In turn, the sum of these quality indicators for all images was determined as the performance indicator of the algorithm. The second approach used rank averaging, so that for each test image there was a ranking of algorithms of the threshold value from 1 to 16 according to each criterion separately. Then the ranks (not the actual scores) were averaged both by images and by all criteria.
Results. A general unified comparative analysis of the performance of 16 algorithms for determining the threshold value of the image was conducted, which makes it possible to improve the functioning of technologies that depend on determining the threshold of images, including the mechanism of augmented reality marker recognition.
Conclusions. It has been observed that the clustering-based method, namely the Kittler method, and the entropy-based method, namely the Kapoor method, are the best algorithms for determining thresholds for specialized images. Similarly, the Kittler clustering method and the Sauwol and White local variable methods are the best working algorithms for binarization of text documents.
References
H. Huang et al., “Brain image segmentation based on FCM clustering algorithm and rough set”, IEEE Access, vol. 7, pp. 12386–12396, 2019. doi: 10.1109/ACCESS.2019.2893063
M.A. El Aziz et al., “Multi-objective whale optimization algorithm for multilevel thresholding segmentation”, Advances in Soft Computing and Machine Learning in Image Processing, vol. 730, pp. 23–39, 2018. doi: 10.1007/978-3-319-63754-9_2
R. Ratnakumar and S.J. Nanda, “A low complexity hardware architecture of K-means algorithm for real-time satellite image segmentation”, Multimed. Tools Appl., vol. 78, no. 9, pp. 11949–11981, 2019. doi: 10.1007/s11042-018-6726-6
S. Srivastava et al., “Optical character recognition on bank cheques using 2D convolution neural network”, Advances in Intelligent Systems and Computing, pp. 589–596, 2019. doi: 10.1007/978-981-13-1822-1_55
A. Kumar et al., “Enhanced identification of malarial infected objects using Otsu algorithm from thin smear digital images”, Int. J. Lat. Research Sci. Technol., vol. 1, no. 2, pp. 159–163, 2012.
U. Jamil et al., “Melanoma segmentation using bio-medical image analysis for smarter mobile healthcare”, J. Ambient Intell. Human. Comput., vol. 10, no. 10, pp. 4099–4120, 2019. doi: 10.1007/s12652-019-01218-0
C. Liu et al., “Lung segmentation based on random forest and multi-scale edge detection”, IET Image Process., vol. 13, no. 10, pp. 1745–1754, 2019. doi: 10.1049/iet-ipr.2019.0130
M.A. Elaziz, et al., “Multi-level thresholding-based grey scale image segmentation using multi-objective multi-verse optimizer”, Exp. Syst. Applicat., vol. 125, pp. 112–129, 2019. doi: 10.1016/j.eswa.2019.01.047
X.-S. Yang, “Cuckoo search and firefly algorithm: Overview and analysis”, Cuckoo Search and Firefly Algorithm, vol. 516, pp. 1–26, 2014. doi:10.1007/978-3-319-02141-6_1
M. Dorigo and T. Stützle, “Ant colony optimization: Overview and recent advances”, in Handbook of Metaheuristics. International Series in Operations Research & Management Science, vol. 146. Boston: Springer, 2010, pp. 311–351. doi: 10.1007/978-1-4419-1665-5_8
Downloads
Published
Issue
Section
License
Copyright (c) 2020 Oleksandr S. Bezpalko
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under CC BY 4.0 that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work