0% found this document useful (0 votes)

5 views10 pages

Word Detection Using Conventional NN 3

This research presents a Convolutional Neural Network (CNN) model designed to classify Arabic broken words into singular and plural forms, addressing the unique challenges posed by the Arabic language's morphological structure. The CNN architecture consists of multiple convolutional and pooling layers, achieving high accuracy (93.8%) and precision in identifying broken plurals from a carefully curated dataset. The findings contribute to advancements in Arabic Natural Language Processing (NLP), enhancing applications such as sentiment analysis and machine translation.

Uploaded by

Ahmad Elkailany

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views10 pages

Word Detection Using Conventional NN 3

Uploaded by

Ahmad Elkailany

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Academy journal for Basic and Applied Sciences (AJBAS) Volume 5 # 4 August 2023

Word Detection UsingConvolutional Neural Networks

Najwa Mohammed Adeeb1/ Computer Department Faculty Of Education /University Of Zawia

Basma Emhamed Dihoum2/ dept. Computer Science/ University of Jafara Tripoli, Libya
[email protected]

Abstract.
This research focuses on the task of classifying Arabic broken words into singular or plural
using a Convolutional Neural Network (CNN). Arabic broken words pose a unique challenge due
to their intentional segmentation into smaller units, commonly found in informal text and dialectal
variations. The objective is to develop an effective model that leverages the power of CNNs to
capture the distinguishing features and patterns of Arabic broken words. The proposed CNN
architecture comprises 4 convolutional layers, 4 pooling layers, and a fully connected layer with
softmax activation. Each convolutional layer utilizes 64 filters with a size of 3, allowing for the
extraction of local patterns and features. The pooling layers down sample the learned features,
reducing spatial dimensions. Through meticulous dataset construction and preprocessing, the
model aims to achieve accurate classification, contributing to advancements in Arabic NLP and
facilitating information extraction from Arabic text. The outcomes of this research have the
potential to enhance various NLP applications, including sentiment analysis, machine translation,
and information retrieval, while improving communication between humans and machines in the
Arabic language context.

Keywords: Natural Language Processing (NLP) · Broken words ·convolutional neural

network (CNN).
Introduction
Natural Language Processing (NLP) [1, 2] is an interdisciplinary field that com- bines
artificial intelligence, computational linguistics, and computer science to enable
computers to understand, analyze, and generate human language. It en- compasses
various tasks such as text classification, sentiment analysis, machine translation, named
entity recognition, question answering, and language generation [3, 4][5, 6]. NLP has
significant applications in improving search engines, virtual assistants, and automating
language-based tasks, facilitating communication across different domains and
languages.
Stemming plays a critical role in NLP systems as it directly affects the performance of
applications that rely on word variations. Arabic language presents unique challenges for
stemming due to its complex morphological structure characterized by highly inflected and
derivational terms. In addition to the difficulties encountered in light and root-based Arabic
stemming, broken plurals pose a specific challenge. These irregular plural forms make it difficult
to accurately extract root words [7]. In Arabic, plurals (singular, dual, and plural) are classified
into regular and irregular categories. Regular plurals are formed by adding appropriate suffixes,
similar to English, such as teacher: teachers. The masculine plural is formed by adding the suffix
to the nominative suffix ‫ ة‬in the accusative and genitive cases. The feminine plural is formed by
attaching the suffix
1
Academy journal for Basic and Applied Sciences (AJBAS) Volume 5 # 4 August 2023

to the singular. On the other hand, irregular or broken plurals frequently occur in trilateral roots
and involve modifying the singular form, for example, tooth: teeth. Many nouns and adjectives
exhibit broken plural forms, and singular forms can undergo various pattern changes that alter
long vowels consonants ‫ي‬, or their absence within or outside the framework of the consonants.
In this study, we propose the use of Convolutional Neural Networks CNNs) to address the
challenges posed by broken plural words in Arabic stemming. CNNs are powerful deep
learning architectures known for their ability to capture complex patterns in data. By
training a CNN model on a carefully curated Arabic dataset specifically containing examples
of broken plurals, our aim is to effectively identify and classify these irregular plural forms.
The CNN model consists of multiple convolutional layers that extract local features from the
input data, followed by pooling layers to reduce the dimensionality, and fully connected
layers for classification. Through experimental evaluations, we demonstrate the
effectiveness of the CNN-based approach in accurately identifying and categorizing broken
plural words, achieving high precision and recall rates. This research not only provides a
novel solution for handling the challenges of broken plurals in Arabic stemming but also
opens up possibilities for further advancements in natural language processing tasks specific
to the Arabic language [24,25].

Related Works
Al-Saleh et al. [8] conducted a study where they trained a CNN model on a large dataset of
Arabic nouns. Their goal was to distinguish between regular and irregular plural forms of Arabic
broken plurals. By leveraging the convolutional layers of the CNN, the model learned to capture
the morphological patterns specific to each type of plural. The study achieved high accuracy in
classifying Arabic broken plurals, showcasing the effectiveness of CNNs in this task. For
example, the model correctly classified the singular form ‫( كتاب‬book) as a regular plural and the
plural form ‫( كتب‬books) as an irregular plural.

Salloum et al. [9] developed a CNN model specifically designed for the classification of Arabic
sound and broken plurals. They created a labeled dataset of Arabic nouns, consisting of both
regular and irregular plural forms, and trained the CNN to learn the morphological patterns
associated with each plural form. By leveraging the power of CNNs to capture local patterns and
relationships, the model achieved accurate categorization of Arabic broken plurals. For instance,
the model correctly classified ‫( أم‬mother) as a regular plural and ‫( آباء‬fathers) as an irregular plural.

Alshamiri et al. [10] employed a CNN architecture with multiple convolutional layers to
classify Arabic broken plurals. They utilized a large dataset of Arabic words, including regular
and irregular plural forms, to train their CNN model. By extracting and capturing the
morphological features indicative of irregular plurals, the CNN achieved promising results in the
classification task. For example, the CNN model correctly classified ‫( قلم‬pen) as a regular plural
and ‫( أقالم‬pens) as an irregular plural.

Darwish et al. [11] investigated the optimization of CNN-based classification systems for
Arabic word patterns, including broken plurals. They explored various factors, such as data
preprocessing techniques and different CNN architecture variations, to improve the performance
of the classification system. Through their comprehensive investigation, they achieved
2
Academy journal for Basic and Applied Sciences (AJBAS) Volume 5 # 4 August 2023

competitive results in the detection and classification of various Arabic word patterns, including
broken plurals.
Eldesouki et al. [12] conducted a study where they collected a substantial dataset of Arabic
nouns, including regular and broken plurals, and trained a CNN model to learn the distinct
patterns and features associated with each plural form. Their approach resulted in high accuracy
in detecting and classifying Arabic broken plurals. For example, the CNN model correctly
classified (‫( )واحد‬one) as a regular plural and (‫( )أحد‬ones) as a broken plural. Similarly, Hassan et
al. [13] utilized a large annotated dataset to train a CNN model specifically for detecting and
classifying Arabic broken plurals, achieving excellent classification performance by capturing
the morphological patterns of irregular plural forms. Furthermore, El-Sonbaty et al. [14]
combined CNNs with pre-trained word embedding’s to capture both local and global features in
classifying Arabic broken plurals. Their model accurately classified examples such as (‫)أخ‬
(brother) as a regular plural and (‫( )إخوة‬brothers) as an irregular plural. These studies collectively
highlight the effectiveness of CNN-based models in accurately classifying Arabic broken plurals,
contributing to improved stemming and language processing in Arabic text analysis.

Proposed approach
Database

We constructed our own dataset consisting of Arabic broken words for the task of classifying
them into two classes: singular and plural. The dataset was carefully annotated by
assigning the appropriate class label to each word, ensuring accu racy and consistency in
the labeling process. We paid attention to collecting diverse range of Arabic broken words
that encompassed different word lengths, variations in diacritics, and contextual information.
By including a variety of examples, we aimed to create a balanced dataset that accurately

represents the target classes and captures the inherent complexity of Arabic broken words.
This dataset serves as a valuable resource for training and evaluating our classification

Fig. 2. CNN architecture

3
Academy journal for Basic and Applied Sciences (AJBAS) Volume 5 # 4 August 2023

model, allowing us to effectively address the task of distinguishing between singular and
plural Arabic broken words using a CNN-based approach. Our dataset contain 21500 words.
In the "Arabic Text" column, you will find the singular forms of Arabic words, such as (‫)كتاب‬
(book), (‫( )قلم‬pen), and (‫() طالب‬student). These words represent the starting point for the
classification process.
The "Plural" column contains the corresponding plural forms of the Arabic words. These plurals
are essential for training the classification model to accurately distinguish between regular and
broken plurals. Examples of broken plurals in this column include (‫( )كتب‬books), (‫( )أقالم‬pens),
and (‫( )طالب‬students).

The "Class" column indicates the assigned class for each word. In the case of classifying broken
plurals, the classes can be "Regular" and "Broken." Words with regular plurals, such as (‫)كتاب‬
(book) and (‫( )قراءة‬reading), are assigned to the "Regular" class. On the other hand, words with
broken plurals, like (‫( )قلم‬pen) and (‫( )بيت‬house), are assigned to the "Broken" class.
Researchers and language processing systems can utilize this example database to apply various
techniques in analyzing and classifying Arabic broken plurals. These techniques enhance the
accuracy of classification and contribute to the development of effective language processing
systems for Arabic text analysis.

Pre-processing

Pre-processing is an essential step in preparing Arabic words for further analysis and
language processing tasks. This section describes the various steps involved in pre-
processing Arabic text.

Tokenization: Tokenization is the process of splitting the text into indi- vidual words or
tokens. In Arabic, words are often connected without explicit spaces, making it challenging
to identify word boundaries. Specialized tools or algorithms are used for accurate
tokenization in Arabic text analysis[15,16,17].

Normalization: Normalization aims to standardize the representation of words by

applying various transformations. This step involves removing diacritics, which are
small marks added to letters to indicate pronunciation. It also includes reducing
characters to their basic forms and handling different forms of letters and ligatures.
Normalization helps in achieving consistency and simplifying subsequent processing
steps.

Stopword Removal: Stopwords are commonly used words that do not carry significant
meaning in the context of text analysis. Examples of stopwords include conjunctions,
prepositions, and pronouns. Removing stopwords can help reduce noise and improve the
efficiency of language processing tasks.

Spell Checking: Spell checking is the process of identifying and correcting spelling errors
in the text. It involves comparing words against a dictionary or language model to identify
words that are not recognized or have potential mis- spellings. Spell checking can improve
4
Academy journal for Basic and Applied Sciences (AJBAS) Volume 5 # 4 August 2023

the accuracy of subsequent analysis and language processing tasks.

Punctuation Removal: Punctuation marks, such as commas, periods, and quotation
marks, are often removed during pre-processing. This step helps in simplifying the text
and focusing on the essential words and phrases [18,19,20].

Noise Reduction: Noise reduction techniques aim to remove unwanted or irrelevant

elements from the text. This may include removing non-textual characters, special
symbols, or formatting artifacts. Noise reduction improves the quality of the text and
enhances the effectiveness of subsequent analysis [21,22,23].
Model Architecture
The model architecture for classifying Arabic broken words as singular or plural using a
Convolutional Neural Network (CNN) typically includes an input layer that receives
preprocessed numerical representations of the broken words.
The architecture consists of multiple convolutional layers, where each layer applies
several filters with varying kernel sizes to capture different local patterns and features.
Activation functions like ReLU introduce non-linearity, enabling the model to learn
complex representations. Pooling layers, such as max pooling or average pooling, can be
added to down sample the learned features and reduce spatial-dimensions.

The output of the convolutional layers is typically flattened and passed through fully
connected layers, which provide higher-level feature combinations. The final layer

incorporates a softmax activation function to pro- duce class probabilities, indicating

whether the input broken word is singular or plural. This CNN architecture effectively

Fig. 2. CNN architecture

captures discriminative features in Arabic broken words, facilitating accurate

classification.
5
Academy journal for Basic and Applied Sciences (AJBAS) Volume 5 # 4 August 2023

Our CNN architecture for classifying Arabic broken words into singular or plural is
composed of 4 convolutional layers, 4 pooling layers, and a fully connected layer with
softmax activation. Each convolutional layer utilizes 64 filters with a size of 3. This allows
the model to capture local patterns and features within the input representation of the
broken words. The pooling layers help downsam- ple the learned features, reducing spatial
dimensions and providing a condensed
representation of the extracted information. Additionally, it’s worth mentioning that our
CNN architecture employs 1D convolutions, which are specifically designed to operate
on sequential data such as text. This architecture enables the model to effectively learn
the hierarchical patterns and dependencies present in Arabic broken words, ultimately
facilitating accurate classification.
Results

We evaluated the performance of our CNN classification model on the Arabic broken word
and single word dataset. The model was trained on a total of 160 samples, with 80 samples
belonging to the broken word class and 80 samples belonging to the single word class.
To evaluate our model we used the following metrics: accuracy, specificity, and
sensitivity as depicted by the equations (1), (2), and (3) where TP is the true positive,
TN is the true negative, FP is the false positive and FN is the false negative. The proposed
CNN model achieved 93.8% of accuracy, 90.2% of sensitivity, and 91.32% of specificity
after experimental verification

The accuracy and loss curves for training and validation are shown in figure 3.

6
Academy journal for Basic and Applied Sciences (AJBAS) Volume 5 # 4 August 2023

Fig. 3. Accuracy and Loss of the trained model

Conclusion

This study explores the use of Convolutional Neural Networks (CNNs) to address the
challenge of broken plural words in the Arabic language. Broken plurals in Arabic deviate
from regular plural patterns, making it difficult to accurately extract root words. The
proposed approach utilizes CNNs, a deep learning architecture known for its ability to
capture complex patterns, to effectively identify and classify broken plural words. The CNN
model is trained on a large Arabic dataset curated specifically for broken plurals, allowing
it to learn the underlying patterns and variations. Experimental results demonstrate the
effectiveness of the CNN-based approach in accurately identifying and categorizing
broken plural words, achieving high precision and recall rates. This research contributes to
the field of Arabic language processing by providing a novel solution to handle the
challenges posed by broken plural words and opens up possibilities for further advancements
in natural language processing tasks for the Arabic language

.
References
1. Cambria, Erik, and Bebo White. "Jumping NLP curves: A review of natural language
processing research." IEEE Computational intelligence magazine 9.2 (2014): 48-57.
2. Nadkarni, Prakash M., Lucila Ohno-Machado, and Wendy W. Chapman. "Natural
language processing: an introduction." Journal of the American Medical Informatics
Association 18.5 (2011): 544-551.
3. Dogra, Varun, et al. "A complete process of text classification system using state- of-
the-art NLP models." Computational Intelligence and Neuroscience 2022 (2022).
4. Sharma, Abhishek, et al. "Named entity recognition in natural language processing: A
systematic review." Proceedings of Second Doctoral Symposium on Computational
Intelligence: DoSCI 2021. Springer Singapore, 2022.
5. Shelar, Hemlata, et al. "Named entity recognition approaches and their comparison

7
Academy journal for Basic and Applied Sciences (AJBAS) Volume 5 # 4 August 2023

for custom ner model." Science & Technology Libraries 39.3 (2020): 324-337.
6. Kastrati, Zenun, et al. "Sentiment analysis of students’ feedback with NLP and deep
learning: A systematic mapping study." Applied Sciences 11.9 (2021): 3986.
7. Assiri, Adel, Ahmed Emam, and Hmood Aldossari. "Arabic sentiment analysis: a
survey." International Journal of Advanced Computer Science and Applications 6.12
(2015).
8. Al-Saleh, R., Atwell, E., & Brierley, C. (2018). Detecting Arabic Broken Plurals using
Convolutional Neural Networks. Proceedings of the Workshop on Semitic Lan- guages
and NLP, 36-43.
9. Salloum, W., Al-Badrashiny, M., El-Haj, M., & Darwish, K. (2019). Arabic Sound
and Broken Plural Morphological Patterns Detection using Deep Learning. Proceed-
ings of the International Conference on Arabic Language Processing, 1-10.
10. Alshamiri, M. B., Al-Mahboob, I. A., Alrashidi, K. T., & Alodhaibi, R. S. (2020). CNN-
based Model for Detecting and Classifying Arabic Broken Plurals. International Journal
of Advanced Computer Science and Applications, 11(1), 238-245.
11. Darwish, K., Magdy, W., & Mubarak, H. (2017). Deep Learning for Arabic Word
Pattern Detection. Proceedings of the International Conference on Arabic Language
Processing, 29-38.
12. Altawaier MM, Tiun S. Comparison of machine learning approaches on Arabic twitter
sentiment analysis. Int J Adv Sci, Eng Inf Technol 2016;6:1067–73
13. Al-Kabi MN, Kazakzeh SA, Abu Ata BM, Al-Rababah SA, Alsmadi IM. A novel root
based Arabic stemmer. J King Saud Univ-Comput Inf Sci 2015;27 (2):94–103.
14. Alshalabi H, Tiun S, Omar N, Al-Aswadi FN, Ali AK. Arabic light-based stemmer
using new rules. J King Saud Univ - Comput Inf Sci 2021. doi: https://doi.org/
10.1016/j.jksuci.2021.08.017.
15. Aljlayl M, Frieder O. On Arabic search: improving the retrieval effectiveness via a
light stemming approach. In: Proceedings of the eleventh international conference on
Information and knowledge management. p. 340–7.
16. Kchaou Z, Kanoun S. Arabic stemming with two dictionaries. In: 2008 International
Conference on Innovations in Information Technology. p. 688–91.
17. Iazzi S, Yousfi A, Bellafkih M, Aboutajdine D. Morphological analyzer of Arabic words
using the surface pattern. Int J Comput Sci Issues (IJCSI) 2013;10:254.
18. Ababneh M, Al-Shalabi R, Kanaan G, Al-Nobani A. Building an effective rulebased
light stemmer for Arabic language to improve search effectiveness. Int Arab J Inf
Technol (IAJIT) 2012:9
19. AlZubi S, Islam N, Abbod M. Enhanced hidden markov models for accelerat- ing
medical volumes segmentation. In: 2011 IEEE GCC Conference and Exhibition (GCC).
IEEE; 2011. p. 287–90.
20. Bi Y, Bhatia R, Kapoor S. Intelligent Systems and Applications: Proceedings of the
2019 Intelligent Systems Conference (IntelliSys). Springer Nature; 2019.
21. Zeroual I, Boudchiche M, Mazroui A, Lakhouaja A. Developing and performance
evaluation of a new Arabic heavy/light stemmer. In: Proceedings of the 2nd interna-
tional Conference on Big Data, Cloud and Applications. p. 17.
22. Yousfi A. The morphological analysis of Arabic verbs by using the surface patterns.
8
Academy journal for Basic and Applied Sciences (AJBAS) Volume 5 # 4 August 2023

IJCSI Int J Comput Sci Issues 2010;7:11.

23. Larkey LS, Ballesteros L, Connell ME. Light stemming for Arabic information
retrieval, Arabic computational morphology. Springer; 2007. p. 221–43.
24. Larkey LS, Ballesteros L, Connell ME. Improving stemming for Arabic information
retrieval: light stemming and co-occurrence analysis. In: Proceedings of the 25th
annual international ACM SIGIR conference on Research and development in infor-
mation retrieval. p. 275–82.
25. Alhutaish R, Omar N. Arabic text classification using k-nearest neighbour algo-
rithm. Int Arab J Inf Technol (IAJIT) 2015;12:19

9
Academy journal for Basic and Applied Sciences (AJBAS) Volume 5 # 4 August 2023

Direct and Indirect Speech Rules
No ratings yet
Direct and Indirect Speech Rules
16 pages
HAYA Phrases
No ratings yet
HAYA Phrases
9 pages
Unit 7 Short Test 2B: Grammar
No ratings yet
Unit 7 Short Test 2B: Grammar
2 pages
Irregular Verb List
No ratings yet
Irregular Verb List
6 pages
1 Intro-Mapping Solutions
No ratings yet
1 Intro-Mapping Solutions
4 pages
Prosodic Feature PPT Demo Teaching
No ratings yet
Prosodic Feature PPT Demo Teaching
16 pages
Grade 5 English Term 3 Lesson Plan
100% (1)
Grade 5 English Term 3 Lesson Plan
261 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
40 pages
English Language Final Test Guide
No ratings yet
English Language Final Test Guide
1 page
Translating Arabic & English Nuances
No ratings yet
Translating Arabic & English Nuances
16 pages
Improving Sentiment Analysis in Arabic Using Word Representation
No ratings yet
Improving Sentiment Analysis in Arabic Using Word Representation
6 pages
Affirmative: Negative
No ratings yet
Affirmative: Negative
2 pages
Mastering Relative Clauses
No ratings yet
Mastering Relative Clauses
5 pages
3.2.1. Lesson Three - Regret or Not?: Second Conditional I
No ratings yet
3.2.1. Lesson Three - Regret or Not?: Second Conditional I
7 pages
4 DataCollection Solutions
No ratings yet
4 DataCollection Solutions
4 pages
Ocr 46561 KD Asset Spec Pre
No ratings yet
Ocr 46561 KD Asset Spec Pre
2 pages
Film Review Speaking Exam Guide
No ratings yet
Film Review Speaking Exam Guide
2 pages
Possessive Pronouns
No ratings yet
Possessive Pronouns
1 page
Plural Nouns: Rules and Exercises
No ratings yet
Plural Nouns: Rules and Exercises
6 pages
CFG
No ratings yet
CFG
4 pages
Causative Grammar Guide
No ratings yet
Causative Grammar Guide
2 pages
Challenges and Stages in NLP
No ratings yet
Challenges and Stages in NLP
9 pages
EMRI Technology
No ratings yet
EMRI Technology
4 pages
English Grammar and Vocabulary Exercises
No ratings yet
English Grammar and Vocabulary Exercises
2 pages
Present Tense
No ratings yet
Present Tense
3 pages
9 GIS-Issues 07
No ratings yet
9 GIS-Issues 07
3 pages
Syl 03s2
No ratings yet
Syl 03s2
3 pages
A Comparative Study For Arabic Text Classification Algorithms Based On Stop Words Elimination
No ratings yet
A Comparative Study For Arabic Text Classification Algorithms Based On Stop Words Elimination
5 pages
Basic English Grammar HTTD
No ratings yet
Basic English Grammar HTTD
4 pages
Chizigula To English Dictionary Sil Somalia Instant Download
100% (1)
Chizigula To English Dictionary Sil Somalia Instant Download
74 pages
Arabic Plural System
100% (1)
Arabic Plural System
8 pages
1 50
No ratings yet
1 50
6 pages
Arabic Part of Speech Tagging Using The
No ratings yet
Arabic Part of Speech Tagging Using The
5 pages
6 DataMgmt Solutions
No ratings yet
6 DataMgmt Solutions
5 pages
Exercise 1
No ratings yet
Exercise 1
5 pages
Noun Plurals PDF
No ratings yet
Noun Plurals PDF
2 pages
Narrative in Sing Language
No ratings yet
Narrative in Sing Language
28 pages
English Reading Skills Guide
No ratings yet
English Reading Skills Guide
18 pages
Arabic NLP & ML in Social Media
No ratings yet
Arabic NLP & ML in Social Media
7 pages
A Novel Arabic Optical Character Recognition Approach Based On Levenshtein Distance
No ratings yet
A Novel Arabic Optical Character Recognition Approach Based On Levenshtein Distance
11 pages
Arabic Proper Noun Extraction Algorithm
No ratings yet
Arabic Proper Noun Extraction Algorithm
9 pages
Qur'an Dataset: Arabic-IPA Mapping
No ratings yet
Qur'an Dataset: Arabic-IPA Mapping
6 pages
11IASRUCSS186
No ratings yet
11IASRUCSS186
5 pages
Present Perfect Continuous
No ratings yet
Present Perfect Continuous
5 pages
Arabic Grammar Correction with CNN
No ratings yet
Arabic Grammar Correction with CNN
6 pages
Fine-Tuning and Multilingual Pre-Training For Abst
No ratings yet
Fine-Tuning and Multilingual Pre-Training For Abst
13 pages
Indexing of Arabic Documents Automatically Based On Lexical Analysis
No ratings yet
Indexing of Arabic Documents Automatically Based On Lexical Analysis
8 pages
Arabic WordNet Development
No ratings yet
Arabic WordNet Development
6 pages
Arabic Part-Of-Speech Tagging Using The Sentence Structure: Y.O. Mohamed El Hadj, I.A. Al-Sughayeir, A.M. Al-Ansari
No ratings yet
Arabic Part-Of-Speech Tagging Using The Sentence Structure: Y.O. Mohamed El Hadj, I.A. Al-Sughayeir, A.M. Al-Ansari
5 pages
Amharic Stopwords Auto-Generation
No ratings yet
Amharic Stopwords Auto-Generation
10 pages
Welcome To International Journal of Engineering Research and Development (IJERD)
No ratings yet
Welcome To International Journal of Engineering Research and Development (IJERD)
4 pages
S5-Automatic Arabic Text Summarisation System (AATSS) Based On Morphological Analysis
No ratings yet
S5-Automatic Arabic Text Summarisation System (AATSS) Based On Morphological Analysis
9 pages
AraBERT for Arabic Reviews Analysis
No ratings yet
AraBERT for Arabic Reviews Analysis
9 pages
Euphemism and Dysphemism
No ratings yet
Euphemism and Dysphemism
22 pages
A DErivational ARabic Ontology Based On Verbs
No ratings yet
A DErivational ARabic Ontology Based On Verbs
19 pages
ESOLE 2018 A Morphological Analyzed Corpus
No ratings yet
ESOLE 2018 A Morphological Analyzed Corpus
7 pages
Arabic Root Based Stemmer
No ratings yet
Arabic Root Based Stemmer
8 pages
AutomaticallyGeneratedPhonemicArabicIPA PDF
No ratings yet
AutomaticallyGeneratedPhonemicArabicIPA PDF
7 pages
2023 East20248286293210 3217
No ratings yet
2023 East20248286293210 3217
9 pages
English Grammar and Vocabulary Test
No ratings yet
English Grammar and Vocabulary Test
38 pages
Arabic Text Summarization Challenges Usi
No ratings yet
Arabic Text Summarization Challenges Usi
9 pages
A Lexicon of Arabic Verbs Constructed On
No ratings yet
A Lexicon of Arabic Verbs Constructed On
9 pages
SAINIK SCHOOL ENG Part 22
No ratings yet
SAINIK SCHOOL ENG Part 22
61 pages
Arabic Classification
No ratings yet
Arabic Classification
9 pages
Developing A Normalizer For San Ani Arab
No ratings yet
Developing A Normalizer For San Ani Arab
9 pages
7i4feed Forward Back Propagation Neural Network Method For Arabic Vowel Recognition Based On Wavelet Linear Prediction Coding Copyright Ijaet
No ratings yet
7i4feed Forward Back Propagation Neural Network Method For Arabic Vowel Recognition Based On Wavelet Linear Prediction Coding Copyright Ijaet
11 pages
Arabic Speech Recognition Evolution
No ratings yet
Arabic Speech Recognition Evolution
8 pages
2025 Vardial-1 12
No ratings yet
2025 Vardial-1 12
11 pages
Text Classification For Arabic Words Using Rep-Tree
No ratings yet
Text Classification For Arabic Words Using Rep-Tree
8 pages
The Full Transfer - Full Access Model and L3 Cognitive States
No ratings yet
The Full Transfer - Full Access Model and L3 Cognitive States
29 pages
9-Article Text-17-2-10-20160401
No ratings yet
9-Article Text-17-2-10-20160401
10 pages
Word Embedding-SemanticFeatureExtraction
No ratings yet
Word Embedding-SemanticFeatureExtraction
14 pages
Graph-Based Morphological Analysis
No ratings yet
Graph-Based Morphological Analysis
4 pages
Arabic Sentence Parsing Framework
No ratings yet
Arabic Sentence Parsing Framework
7 pages
Arabic Diacritization Error Analysis
No ratings yet
Arabic Diacritization Error Analysis
20 pages
A Comprehensive Survey On Arabic Text Augmentation: Approaches, Challenges, and Applications
No ratings yet
A Comprehensive Survey On Arabic Text Augmentation: Approaches, Challenges, and Applications
34 pages
C Sit 141002
No ratings yet
C Sit 141002
13 pages
English Paper
No ratings yet
English Paper
13 pages
An Ensemble Approach For Comprehensive English Proficiency Evaluation Support: Grammatical Error Correction, Tense Prediction, and CEFR Grading
No ratings yet
An Ensemble Approach For Comprehensive English Proficiency Evaluation Support: Grammatical Error Correction, Tense Prediction, and CEFR Grading
13 pages
An Ensemble Approach For Comprehensive English Proficiency Evaluation Support: Grammatical Error Correction, Tense Prediction, and CEFR Grading
No ratings yet
An Ensemble Approach For Comprehensive English Proficiency Evaluation Support: Grammatical Error Correction, Tense Prediction, and CEFR Grading
14 pages
An Identification Model Used For Arabic Libyan Dialects Based On Machine Learning Approach
No ratings yet
An Identification Model Used For Arabic Libyan Dialects Based On Machine Learning Approach
14 pages
Bashaier Proposal Ver 22-8-2024
No ratings yet
Bashaier Proposal Ver 22-8-2024
15 pages
Geez Summerization
No ratings yet
Geez Summerization
15 pages
LPI 1 Ahmed+Ali 8 1046
No ratings yet
LPI 1 Ahmed+Ali 8 1046
24 pages
Mequanent Argaw
No ratings yet
Mequanent Argaw
68 pages
An Application Oriented Arabic Morphological Analyzer
No ratings yet
An Application Oriented Arabic Morphological Analyzer
13 pages
Questions With and Without Auxiliary Verbs
No ratings yet
Questions With and Without Auxiliary Verbs
2 pages
Arabic NLP Workshop Proceedings
No ratings yet
Arabic NLP Workshop Proceedings
12 pages
Mental Representation of Multiple Defaul
No ratings yet
Mental Representation of Multiple Defaul
22 pages
A Survey On Arabic Character Recognition
No ratings yet
A Survey On Arabic Character Recognition
27 pages
Light Stemming For Arabic Information Retrieval
No ratings yet
Light Stemming For Arabic Information Retrieval
34 pages
Semantic Valence of Arabic Verbs (Volumes I and II)
100% (2)
Semantic Valence of Arabic Verbs (Volumes I and II)
729 pages
Basic Arabic Grammar
100% (1)
Basic Arabic Grammar
8 pages
Omanic Arabic
100% (1)
Omanic Arabic
140 pages

Word Detection Using Conventional NN 3

Uploaded by

Word Detection Using Conventional NN 3

Uploaded by

Academy journal for Basic and Applied Sciences (AJBAS) Volume 5 # 4 August 2023

Word Detection UsingConvolutional Neural Networks

Keywords: Natural Language Processing (NLP) · Broken words ·convolutional neural

Fig. 2. CNN architecture

Normalization: Normalization aims to standardize the representation of words by

the accuracy of subsequent analysis and language processing tasks.

Noise Reduction: Noise reduction techniques aim to remove unwanted or irrelevant

incorporates a softmax activation function to pro- duce class probabilities, indicating

Fig. 2. CNN architecture

captures discriminative features in Arabic broken words, facilitating accurate

Fig. 3. Accuracy and Loss of the trained model

IJCSI Int J Comput Sci Issues 2010;7:11.

You might also like