Vol.11, No.1, February 2022.                                                                                                                                                                           ISSN: 2217-8309

                                                                                                                                                                                                                        eISSN: 2217-8333

 

TEM Journal

 

TECHNOLOGY, EDUCATION, MANAGEMENT, INFORMATICS

Association for Information Communication Technology Education and Science


WSNet – Convolutional Neural Networkbased Word Spotting for Arabic and English Handwritten Documents

 

Hanadi Hassen Mohammed, Nandhini Subramanian, Somaya Al-Maadeed, Ahmed Bouridane

 

© 2022 Hanadi Hassen Mohammed, published by UIKTEN. This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. (CC BY-NC-ND 4.0)

 

Citation Information: TEM Journal. Volume 11, Issue 1, Pages 264-271, ISSN 2217-8309, DOI: 10.18421/TEM111-33, February 2022.

 

Received: 02 October 2021.

Revised:   26 January 2022.
Accepted: 02 February 2022.
Published: 28 February 2022.

 

Abstract:

 

This paper proposes a new convolutional neural network architecture to tackle the problem of word spotting in handwritten documents. A Deep learning approach using a novel Convolutional Neural Network is developed for the recognition of the words in historical handwritten documents. This includes a pre-processing step to re-size all the images to a fixed size. These images are then fed to the CNN for training. The proposed network shows promising results for both Arabic and English and both modern and historical documents. Four datasets – IFN/ENIT, Visual Media Lab – Historical Documents (VML-HD), George Washington and IAM datasets – have been used for evaluation. It is observed that the mean average precision for the George Washington dataset is 99.6%, outperforming other state-of-the-art methods. Historical documents in Arabic are known for being complex to work with; this model shows good results for the Arabic datasets, as well. This indicates that the architecture is also able to generalize well to other languages.

 

Keywords – Word spotting, Deep learning, Word recognition, Arabic word spotting.

 

 

-----------------------------------------------------------------------------------------------------------

Full text PDF >  

-----------------------------------------------------------------------------------------------------------

 


Copyright © 2022 UIKTEN
Copyright licence: All articles are licenced via Creative Commons CC BY-NC-ND 4.0 licence