TY - JOUR
T1 - Diacritic segmentation technique for Arabic handwritten using region-based
AU - Sheikh, Ahmed Abdalla
AU - Azmi, Mohd Sanusi
AU - Aziz, Maslita Abd
AU - Al-Mhiqani, Mohammed Nasser
AU - Bafjaish, Salem Saleh
N1 - Funding Information:
The authors thank the Ministry of Education for funding this study through the following grants: FRGS/1/2017/ICT02/FTMK-CACT/F00345. Gratitude is also due to Universiti Teknikal Malaysia Melaka and Faculty of Information Technology and Communication for providing excellent research facilities.
Publisher Copyright:
Copyright © 2020 Institute of Advanced Engineering and Science. All rights reserved.
PY - 2020/4/1
Y1 - 2020/4/1
N2 - Arabic is a broadly utilized alphabetic composition framework on the planet, and it has 28 essential letters. The letters in order was first used to compose messages in Arabic, most prominently the Qur'an the holy book of Islam. However, Arabic language has diacritics in the word or letters which are not something extra or discretionary to the language, rather they are a vital piece of it. By changing some diacritics may change both the syntax and semantics of a word by turning a word into another. However, the current researches address the foreground image and consider the diacritics as noises or secondary images. Thus, it is not suitable for Arabic handwritten. The diacritics will be removed from the image and this will lead to losing some good features. Furthermore, to extract the diacritics, the region-based segmentation technique is used. The image will be measured based on the region properties by first finding the connected component in binary image, and then we will determine the best area range measurement in that region for each image. The proposed technique region based has been tested in nine different images with different handwritten style, and successfully extracted secondary foreground images (diacritics) for each image.
AB - Arabic is a broadly utilized alphabetic composition framework on the planet, and it has 28 essential letters. The letters in order was first used to compose messages in Arabic, most prominently the Qur'an the holy book of Islam. However, Arabic language has diacritics in the word or letters which are not something extra or discretionary to the language, rather they are a vital piece of it. By changing some diacritics may change both the syntax and semantics of a word by turning a word into another. However, the current researches address the foreground image and consider the diacritics as noises or secondary images. Thus, it is not suitable for Arabic handwritten. The diacritics will be removed from the image and this will lead to losing some good features. Furthermore, to extract the diacritics, the region-based segmentation technique is used. The image will be measured based on the region properties by first finding the connected component in binary image, and then we will determine the best area range measurement in that region for each image. The proposed technique region based has been tested in nine different images with different handwritten style, and successfully extracted secondary foreground images (diacritics) for each image.
KW - Arabic diacritics
KW - Arabic handwritten
KW - Diacritics segmentation
KW - Region-Based
KW - Segmentation
UR - http://www.scopus.com/inward/record.url?scp=85075558751&partnerID=8YFLogxK
U2 - 10.11591/ijeecs.v18.i1.pp478-484
DO - 10.11591/ijeecs.v18.i1.pp478-484
M3 - Article
AN - SCOPUS:85075558751
VL - 18
SP - 478
EP - 484
JO - Indonesian Journal of Electrical Engineering and Computer Science
JF - Indonesian Journal of Electrical Engineering and Computer Science
SN - 2502-4752
IS - 1
ER -