Diacritic segmentation technique for Arabic handwritten using region-based

Ahmed Abdalla Sheikh, Mohd Sanusi Azmi, Maslita Abd Aziz, Mohammed Nasser Al-Mhiqani, Salem Saleh Bafjaish

Research output: Contribution to journalArticlepeer-review

6 Citations (Scopus)

Abstract

Arabic is a broadly utilized alphabetic composition framework on the planet, and it has 28 essential letters. The letters in order was first used to compose messages in Arabic, most prominently the Qur'an the holy book of Islam. However, Arabic language has diacritics in the word or letters which are not something extra or discretionary to the language, rather they are a vital piece of it. By changing some diacritics may change both the syntax and semantics of a word by turning a word into another. However, the current researches address the foreground image and consider the diacritics as noises or secondary images. Thus, it is not suitable for Arabic handwritten. The diacritics will be removed from the image and this will lead to losing some good features. Furthermore, to extract the diacritics, the region-based segmentation technique is used. The image will be measured based on the region properties by first finding the connected component in binary image, and then we will determine the best area range measurement in that region for each image. The proposed technique region based has been tested in nine different images with different handwritten style, and successfully extracted secondary foreground images (diacritics) for each image.

Original languageEnglish
Pages (from-to)478-484
Number of pages7
JournalIndonesian Journal of Electrical Engineering and Computer Science
Volume18
Issue number1
DOIs
Publication statusPublished - 1 Apr 2020
Externally publishedYes

Cite this