0% found this document useful (0 votes)

11 views15 pages

Textile Defect Detection Algorithm Based On The Im

Uploaded by

manjharis80

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views15 pages

Textile Defect Detection Algorithm Based On The Im

Uploaded by

manjharis80

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

This article has been accepted for publication in IEEE Access.

This is the author's version which has not been fully edited and
content may change prior to final publication. Citation information: DOI 10.1109/ACCESS.2025.3528771

Date of publication xxxx 00, 0000, date of current version xxxx 00, 0000.
Digital Object Identifier 10.1109/ACCESS.2017.DOI

Textile Defect Detection Algorithm

Based on the Improved YOLOv8
WENFEI SONG1 , DU LANG2 , JIAHUI ZHANG3 , MEILIAN ZHENG4 , and XIAOMING LI.5
1,2,3
School of Information and Design, Zhejiang Industry Polytechnic College, Shaoxing, Zhejiang 312000, China (e-mail: song [email protected])
4
School of Management, Zhejiang University of Technology,Hangzhou, Zhejiang, 310014, China
5
School of International Business, Zhejiang Yuexiu University, Shaoxing, Zhejiang 312000 China
Corresponding author: Meilian Zheng (e-mail: [email protected]).
This work was partly supported by the National Science Foundation of China (62272311,62102262) and by the 2020 Visiting Scholar
Program of the Zhejiang Provincial Department of Education.

ABSTRACT Automatic detection of textile defects is a crucial factor in improving textile quality. Fast
and accurate detection of these defects is key to achieving automation in the textile industry. However,
the detection of textile defects faces challenges such as small defect targets, low contrast between defects
and the background, and significant variations in the aspect ratio of defects. To address these issues, this
study proposes a new method for textile defect detection based on an improved version of YOLOv8(You
Only Look Once Version 8) called DA-YOLOv8s. Deep & Cross Network(DCNv2) is introduced into
the Backbone Network to replace the C2F module, enhancing the extraction of network features; an
self-attention mechanism, Polarized Self-Attention(PSA), is adopted to increase feature fusion capability
and reduce feature loss in both channel and spatial dimensions; finally, a Small Object Detection Head
(SOHead) is added to improve the feature extraction ability for small targets. Experimental results show
that the improved YOLOv8 algorithm achieves has achieved [email protected] and mAP of 44.6% and 48.6%
respectively, which is an improvement of 4.2% and 3.8% over the original algorithm, and also outperforms
the Optimal YOLOv9s model and the latest YOLOv11s model in these two metrics. The speed of textile
defect detection has reached 257.38 frames per second (FPS) and the floating-point operation speed is 36.6
GFLOPS, ensuring the accuracy and speed of textile defect detection, with practical engineering application
value.

INDEX TERMS Interest Point Detection, Textile Industry, Quality Management, YOLOv8, Textile Defect
Detection, Polarized Self-Attention,Deep & Cross Network

I. INTRODUCTION detection [2]. The most commonly used computer vision

N the transformation and upgrading of the textile industry, methods include statistical feature methods, spectral analysis
I the automated detection of defects and minor damages in
textiles is an important field of industrial upgrading, as well
methods, model-based detection methods, and deep learning
methods.
as a significant factor in improving the quality of textiles. The principle of the statistical feature method is to study
Traditional defect detection in textiles relies on manual visual the pixel grayscale values in the texture of textile images,
inspection, that is, through the naked eye observation of compare the differences in statistical features of different
inspectors, which is very limited in effectiveness and has a regions, and thus determine whether a region is a defect point
high rate of false positives and false negatives. According area. Zhang et al. [3] proposed a multi-window grayscale
to statistics, manual inspection can detect about 40%-60% ratio (MWGR) method for detecting defects in curtain cloth,
defects [1]. Therefore, how to detect textile defects through which divides the image into several regions and analyzes
automated means has become a research hotspot and diffi- the changes in grayscale ratio to determine the defects in the
culty in the textile industry. window of textiles. Zhu et al. [4] used the autocorrelation
With the development of visual technology, defect detec- function to determine the pattern cycle of colored fabrics
tion based on computer vision technology has become an to determine the size of the detection window, then cal-
important field in the research of automatic fabric defect culated the Gray Level Co-occurrence Matrix (GLCMs) to

VOLUME 4, 2016 1

This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
This article has been accepted for publication in IEEE Access. This is the author's version which has not been fully edited and
content may change prior to final publication. Citation information: DOI 10.1109/ACCESS.2025.3528771

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

characterize the original image, and computed the Euclidean denim fabrics and constructed a defect detection algorithm
distance of GLCMs between the images to be detected and based on a cascading architecture by merging the two models.
the defect-free template image to achieve defect detection. Deep learning methods lack in speed and accuracy.
Guan et al. [5]first highlighted the defect areas using image In recent years, with the further development of computer
enhancement techniques, then used the first-order derivative vision technology, research on object detection algorithms
for edge detection while employing the Roberts operator to has largely focused on those based on candidate regions and
detect the edges of the defect areas to improve detection regression-based deep convolutional neural networks. The
accuracy. Faster R-CNN algorithm is a representative of the candidate
Spectral analysis methods treat images as two-dimensional region-based object detection algorithms, demonstrating ex-
signals with amplitude variations and perform frequency cellent performance in the field of object detection [14]– [17].
domain analysis through certain transformation algorithms, Wei et al. [18] proposed a Faster R-CNN model based on an
commonly including Fourier transform, Wavelet transform, improved VGG structure, which adapts to the characteristics
and Gabor filter transform. Hu et al. [6] proposed an unsu- of fabric defect images by reducing the number of anchor
pervised method based on the combination of the discrete points in the Faster R-CNN. The VGG16 was modified to
Fourier transform (DFT) and the discrete wavelet transform include 13 convolutional layers (with the activation function
(DWT), which performs wavelet shrinkage denoising on being ReLu) and four pooling layers to extract feature maps.
the residual image after Fourier recovery, and applies in- Additionally, the Region Proposal Network (RPN) and the
verse transformation to the approximation coefficients and ROI pooling layer were improved to enhance the model.AN
processed wavelet coefficients separately to achieve defect et al. [19] improved the Faster R-CNN network for textile
information segmentation using simple thresholding. Li et defect detection by using deep residual networks instead
al. [7] proposed a Defect Direction Projection Algorithm of traditional VGG-16 for feature extraction, and incorpo-
(DDPA) based on fabric defects characteristics, which filters rating methods such as adding feature pyramid modules
the input image using Gabor filters, performs a Radon trans- and increasing the number of anchor boxes. Chen et al.
form projection after using hard threshold segmentation, and [20] designed a Genetic Algorithm Gabor Faster R-CNN
selects the optimal Gabor filter channel,that is, the channel (Faster GG R-CNN) model, which embeds Gabor kernels
with the maximum defect value, to detect defects. Xiang into Faster R-CNN and employs a two-stage training method
et al. [8] proposed a defect detection algorithm based on based on Genetic Algorithm (GA) and backpropagation for
Fourier convolution, which generates image pairs for training textile defect detection. The training of the Faster R-CNN
using random masking in the training phase and incorporates algorithm is a two-stage object detection algorithm, with the
a Fourier convolution layer into the autoencoder to achieve first stage completing region box proposals and the second
automatic detection of dyed fabric defects. stage conducting object recognition within the region boxes,
The defect detection algorithms based on traditional com- which affects the speed and accuracy of object detection.
puter vision have high computational requirements and need Regression-based object detection algorithms directly
to be improved in terms of detection speed and accuracy. regress the bounding box coordinates and object categories
Deep learning is a new framework in computer vision at multiple positions in the input image, addressing the co-
research, which has been widely applied in the field of existence of accuracy and speed issues; YOLO is a typical
defect detection with the rapid development of big data and representative of such algorithms. The diversity of YOLO’s
artificial intelligence technologies [9]– [10]. Deep learning applications in industrial defect detection also verifies the
can automatically extract features, optimize and iterate pa- effectiveness of the algorithm [21]– [24]. Yue et al. [25]
rameters, thus achieving the function of detecting defects proposed an improved YOLOv4 textile defect detection al-
in textile images. Mei et al. [11] designed a Multi-Scale gorithm, which, based on the expansion of the dataset using
Convolutional Denoising Autoencoder Network (MSCDAE) combined data augmentation methods, improved the head
that achieved unsupervised detection of textile defects. The prediction layer and integrated the Convolutional Block At-
algorithm trained the Convolutional AutoEncoder (CAE) tention Module (CBAM) to achieve accurate classification
with positive samples, enabling it to extract fabric features and localization of tiny target defects. Jin et al. [26] also made
and reconstruct fabric images. Detection is realized by iden- improvements to the YOLOv5 network, introducing spatial
tifying defects based on the difference in features between and channel attention models into the backbone network and
defective images and normal fabric images. Jing et al. [12] designing a multi-task learning strategy with two detection
proposed an automatic detection method for fabric defects heads for detecting common defects and identifying specific
based on convolutional neural networks. This method decom- defects to improve the accuracy of defect recognition. These
poses textile images into multiple local patches and labels methods have weaker detection capabilities for irregularly
them, then transmits them to a pre-trained deep CNN for sized defects, and the accuracy can also be further enhanced.
learning, and uses the trained model to detect each patch, Our research focuses on improving the accuracy of textile
thereby obtaining the category and position of each defect. defect detection using deep neural network technology.
Ma et al. [13] used an improved parameter VGG16 model Section I introduces the background of textile defect detec-
to train a classifier for detecting and recognizing defects in tion and the development of detection methods, summarizes
2 VOLUME 4, 2016

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

the characteristics of these methods, and proposes improve- (3)We add a detection head for small objects SOHead to
ments and innovations to the YOLOv8s benchmark model. prevent the loss of small object features, thereby improving
Section II describes the principles of convolutional neural detection performance.
networks, the feasibility of incorporating attention mecha- Experiments show that using YOLOv8s as the base net-
nisms into neural networks, and introduces the basic network work model achieves textile defect detection and recogni-
structure of YOLOv8. tion, with improved DA-YOLOv8s increasing [email protected] and
Section III specifically proposes the construction of DA- [email protected],0.3,0.1 by 4.2% and 3.8% , reaching 0.446% and
YOLOV8s model, including the introduction of DCNv2 to 48.6%, but there is still room for improvement in terms of
enhance the feature extraction capability of the backbone model accuracy.
network, the introduction of PSA to improve the feature
fusion capability of the network neck, and the addition of II. RELATED WORK
SOHead detection head to enhance the detection capability In order to enhance the feature extraction capabilities of
of small targets. computer images and improve detection accuracy, extensive
research has been conducted. In this section, we introduce
Section IV introduces the dataset and evaluation metrics,
the relevant research work and, combining it with various
conducts comparative experiments and ablation study, and
studies, propose an improved method as shown in the Table
analyzes the experimental results.
1.
Section V summarizes the work achievements of this paper
and looks forward to future research. TABLE 1. Related Studies and Improvement Methods
The structure of the thesis content is shown in Fig 1.
Research Focus Improved Method Section
CNN DCNv2 Section III, Part B
Attention Mechanism PSA Section III, Part C
YOLOv8 DCNv2+PSA+SOHead Section III, Part D

A. CONVOLUTIONAL NEURAL NETWORKS

Convolutional Neural Networks (CNNs), as one of the most
significant algorithms in computer vision, were successfully
applied in the LeNet network structure in 1998 for recogniz-
ing handwritten digits [29], demonstrating the powerful capa-
bilities of CNNs in image processing. Since then, CNNs have
been widely used in various fields such as object detection
and classification algorithms. A CNN is a feedforward neural
network that includes convolutional computations and deep
architectures, with its core concept being the extraction of
features through operations such as convolution and pooling.
The components of a CNN include:
Input Layer: This layer standardizes the input image data
and performs data augmentation operations to ensure that the
input data meets the requirements of the convolutional layer.
Convolutional Layer: This layer performs convolutional
operations and feature extraction on the input image. Each
FIGURE 1. Thesis Content Structure Diagram. element of the convolutional kernel corresponds to a weight
coefficient and a bias vector. When operating, the convolu-
tional kernel systematically sweeps across the input features,
YOLOv8 is an open-source object detection model re- performing matrix element multiplication and summation
leased in January 2023. For textile defect detection, we within the receptive field, and adding the bias term.
propose a new method, DA-YOLOv8s (Deep Attention Activation Function: Its primary role is to apply a non-
YOLOv8s), based on an improved version of YOLOv8s.The linear mapping to the output of the convolutional layer,
main contributions of this study are as follows: enabling the network to have better learning capabilities.
(1)We incorporate DCNv2 (Deep & Cross Network) [27] Commonly used activation functions include Sigmoid, Tanh,
into the Backbone to replace the C2f module, enhancing the ReLU, etc.
model’s feature extraction capability. Pooling Layer: After feature extraction by the convo-
(2)We introduce a new feature fusion algorithm, PSA lutional layer, the output feature maps are passed to the
(Polarized Self-Attention) [28], which uses an extreme self- pooling layer for feature selection and information filtering.
attention mechanism to reduce feature loss in channel and The pooling layer contains pre-defined pooling functions,
spatial dimensions. which replace the result of a single point in the feature map
VOLUME 4, 2016 3

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

with a statistical measure of its neighboring area. Common and the head module, as shown in Fig 2. The input end mainly
pooling functions include average pooling and max pooling includes Mosaic data augmentation, adaptive anchor box cal-
strategies. culation, and adaptive grayscale padding. The Backbone net-
Fully Connected Layer: While the convolutional neural work contains structures such as Conv, C2f, and SPPF, among
network is capable of feature extraction from input data, the which the C2f module is the primary module for learning
role of the fully connected layer is to perform non-linear residual features, with branch connections across layers that
combinations of the features extracted by the convolutional enrich the gradient flow of the model and form a neural
and pooling layers to produce the output. network module with stronger feature representation capabil-
Output Layer: For image classification problems, the out- ities. The Neck network adopts the PAN (Path Aggregation
put layer uses a logistic function or a normalized exponential Network) structure, which enhances the network’s ability to
function (softmax function) to output classification labels. In fuse features of objects at different scaling scales. The Head
object detection problems, the output layer can be designed module is the output end, which decouples the classification
to output the center coordinates, size, and classification of the and regression processes; the loss calculation process mainly
object. includes positive and negative sample assignment strategies
Traditional CNNs extract features in the form of linear and loss computation, with the assignment strategy using the
models, which have limited extraction capabilities. In con- dynamic assignment method of Task Aligned Assigner [33],
trast, the Cross Network can achieve multi-layer feature which selects positive samples based on the weighted results
interactions, with each layer producing higher-order interac- of classification and regression scores. The loss calculation
tions based on existing ones, and retaining interactions from for the classification branch adopts the BCE Loss method,
previous layers. The cross network can be trained jointly with while the regression branch uses the distribution focal loss
a deep neural network [30]. Here, we introduce the DCNv2 [34] and the CIOU (complete intersection over union) loss
model to improve the C2f in YOLOv8, which includes the function algorithm. The network structure is shown in the Fig
classic CNN, as discussed in Section III, Part B. 2.

B. ATTENTION MECHANISM III. CONSTRUCTION OF DA-YOLOV8S MODEL

The Attention Mechanism is an approach that emulates the Our study adopts YOLOv8s as the benchmark model, with
human visual and cognitive systems. By incorporating at- the aim of enhancing the precision of textile defect detection
tention mechanisms into neural networks, the networks can while also meeting the speed requirements for industrial-
automatically learn to selectively focus on important infor- grade applications. To achieve these objectives, we propose
mation within the input data, thereby enhancing the model’s an algorithm called DA-YOLOv8s. This section will provide
performance and generalization capabilities. The most typ- a detailed introduction to the model architecture of DA-
ical attention mechanisms in the field of computer vision YOLOv8s and its innovative points of improvement.
include channel attention, spatial attention, and self-attention
mechanisms. Channel attention adaptively recalibrates the A. STRUCTURE OF DA-YOLOV8S NETWORK MODEL
weight of each channel, which can be considered an object To address issues such as small defect targets, low contrast
selection process, thus determining what to pay attention between defects and background, and variable aspect ratios
to [31]. Spatial attention can be regarded as an adaptive of defects in textile defect detection, this study proposes a
spatial region selection mechanism: where to pay attention. new textile defect detection method, DA-YOLOv8, based on
It emphasizes or suppresses information at these locations an improved YOLOv8s. By incorporating DCNv2 into the
by assigning different weights to each pixel or pixel block. Backbone network to replace the C2f module, adopting the
Self-attention mechanisms, on the other hand, automatically Polarized Self-Attention (PSA) mechanism to improve the
learn the associations between different positions, thereby Neck network, and introducing the SOHead detection head
capturing richer contextual information [32]. for small objects, the improved DA-YOLOv8s network struc-
Attention mechanisms have achieved significant progress ture is shown in Fig 3. The parts marked with an asterisk "*"
in computer vision applications and are often used in con- and in bold italic are the modified or newly added modules.
junction with classic object detection models. Therefore,
based on the characteristics of textile defects, we intro- B. INTRODUCTION OF DCNV2 TO ENHANCE FEATURE
duce the Polarized Self-Attention(PSA) mechanism, which EXTRACTION ABILITY OF BACKBONE NETWORK
combines channel, spatial, and self-attention mechanisms, to In conventional image convolution operations, the ability to
improve the network structure of YOLOv8 and enhance al- extract targets of various shapes is limited. Therefore, to
gorithm performance. The specific principles are elaborated enhance the feature extraction capability of the YOLOv8
in Section III, Part C. backbone network for target features, this paper designs the
DCNv2 network structure to replace four C2f modules in the
C. THE NETWORK STRUCTURE OF YOLOV8 backbone. The DCNv2 integrates a deep cross network struc-
The network structure of YOLOv8 can be divided into four ture, modeling explicit feature interactions through multiple
parts: the input end, the backbone network, the neck network, cross-layer networks, and combines deep networks to model
4 VOLUME 4, 2016

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

FIGURE 2. YOLOv8 Network Structure Diagram.Here w represents the width of the convolutional kernel, r represents the scale factor.

implicit feature interactions, thereby achieving automated deep network layer, and finally connecting them. The net-
feature cross-encoding and improving the efficiency of high- work structure of DCNv2 and DCNv2 Block is shown in Fig
order feature extraction. We designed stacked and parallel 4. This not only enables efficient learning of the intersection
structures, and here we adopt the parallel structure, passing of sparse and dense features in images but also enhances the
the input features through the cross network layer and the model’s perception and learning ability for defect details.

VOLUME 4, 2016 5

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

FIGURE 3. Improved DA-YOLOv8 Network Structure Diagram.

FIGURE 4. The Network Structure of DCNv2. The left figure details the specific algorithm of DCNv2 Block.

6 VOLUME 4, 2016

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

Embedding Layer: Classifies the input features into a C. INTRODUCING PSA TO ENHANCE FEATURE FUSION
combination of sparse and dense features. Transforms the ABILITY OF THE NECK NETWORK
sparse features into embedding vectors and normalizes the The Neck network in YOLOv8, through a series of convolu-
dense features, with its output being the concatenation of tional layers and upsampling layers, fuses low-level feature
all embedding vectors and normalized dense features: x0 = maps with high-level feature maps, thereby enhancing the
[xembed,1 ; · · · ; xembed,n ; xdense ]. accuracy of object detection. However, to more effectively
Cross networks and deep networks: Cross networks are capture the details of objects and the feature information
characterized by the features of the th layer being operated within small target regions, with a focus on feature extraction
with the learned weight matrix and bias vector, then com- at the channel and specific spatial levels. In response to
bined with the first-order original features in the base layer to this issue, Polarized Self-Attention (PSA) is introduced to
produce the features of the next layer. The operation rule for enhance the representation fusion capability of the entire
a single layer is as shown in Fig 5. network.
Polarized self-attention mechanisms are used to address
pixel-level regression tasks, maintaining relatively high res-
olution in both channel and spatial dimensions (retaining
C/2 dimensions in the channel and [H, W] dimensions in the
spatial dimension), which can reduce information loss caused
by dimensionality reduction; it also synthesizes nonlinear
functions that directly correspond to the typical fine-grained
regression output distribution, making the fitted output more
refined and closer to the actual output. This structure includes
self-attention mechanisms in both channel and spatial dimen-
sions and fuses the results of these two dimensions to obtain
FIGURE 5. Single-Layer Operation Rule for Cross Networks.
the polarized self-attention output. There are two structures
of the polarized attention mechanism, namely the parallel
structure and the sequential structure, as shown in the left
Deep networks, on the other hand, take the features of a figure in Fig 6 and Fig 7 respectively. In this project, the
certain layer, operate them with the weight matrix and bias parallel structure is integrated into the YOLOv8 network.
vector, then activate them with the ReLU function to serve as Within the Neck network, the PSA module receives the
the feature input for the next layer, following the operation output from the C2F module and processes it in parallel
rules as shown in (1) . using channel-wise self-attention and spatial self-attention,
performing convolutions, reshaping, and applying the Sig-
hl+1 = f (Wl hl + bl ) (1) moid function, among other operations. The results of these
computations are then combined and output to the detection
Deep and cross combination: The combination of cross head and Conv modules. The enhanced structure of the
networks and deep networks results in two structures, namely neck network in YOLOv8 is illustrated in Fig 6. The PSA
the stacked structure and the parallel structure. The stacked computation process is as follows:
structure feeds the output of the cross network into the Channel Dimension Self-Attention Ach (X) ∈ RC×1×1 :
deep network as its input. The parallel structure, however, First, the input features X are transformed into Wq and Wv
processes the two networks in parallel and finally combines using the convolution of 1 × 1 , where the channels of Wq
their outputs with a single output layer. In practice, which are fully compressed, while the channel dimension of Wv
architecture performs better depends on the data. remains at a relatively high level (C/2) . Because the channel
The formula for the predictive function is as shown in (2): dimension of Wq is compressed, information enhancement is
required through HDR, so the information of Wq is enhanced
T

ybi = σ Wlogit xfinal (2) using Softmax. Then, Wq and Wv are subjected to matrix
multiplication, followed by a 1 × 1 convolution and LN to
Where Wlogit is the weight vector for the logit, σ (x) = increase the dimension on the channels to C . Finally, the
1/ (1 + exp (−x)) ). For the final loss, the logarithmic loss Sigmoid function is used to keep all parameters between 0-1
function Log Loss is used, as shown in (3). . The operation is as shown in (4):

N
1 X 2 ch (X) = F

Wl A SG Wz|θ1 (σ1 (Wv (X)) × FSM (σ2 (Wq (X))))
X
loss = − yi ) + (1 − yi ) log (1 − ybi ) + λ
yi log (b 2
N i=1 (4)
l
(3) Where Wq , Wv and Wz are the convolutional layers
Where ybi is the prediction; yi is the true label; N is the total 1 × 1, σ1 and σ2 are two tensor reshaping operators, FSM (·)
number of inputs; λ is the L2 regularization parameter. is the SoftMax operator," × " is the matrix dot product
VOLUME 4, 2016 7

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

FIGURE 6. Polarized Self-Attention Module with a Parallel Structure in YOLOV8.

operation as (5). The number of internal channels between In the formula, Wq and Wv are standard 1 × 1 con-
Wv | Wq and Wz is C/2 , and the output of channel self- volutional layers, σ1 ,σ2 and σ3 are three tensor reshaping
attention is Zch = Ach (X) ⊙ch X ∈ RC×H×W , where ch is operators, FSM (·) is the SoftMax operator. FGP (·) is a
the channel multiplication operator. global pooling operator as (7). “ × ” denotes the dot product
operation of matrices. The output of spatial self-attention is
Np
X exj Zsp = Asp (X) ⊙sp X ∈ RC×H×W , where sp is the spatial
FSM (X) = Np
xj (5) multiplication operator.
j=1
P
exm H X W
m=1 1 X
FGP (X) = X (:, i, j) (7)
Spatial Dimension Self-Attention Asp (X) ∈ RC×H×W : H × W i=1 j=1
First, the input features were transformed into Wq and Wv Combination of Channel and Spatial Self-Attention: The
using convolution of 1 × 1 . For Wq features, Global Pooling parallel combination of channel and spatial self-attention
was applied to compress the spatial dimension, transforming forms a PSA parallel structure, as shown in (8).
it into the size of 1 × 1 ; while the spatial dimension of Wv
features was maintained at a relatively large level (H × W) .
P SAP (X) = Z ch + Z sp
b
Since the spatial dimension of Wq was compressed, Softmax (8)
was used to enhance the information of Wq . Finally, matrix = Ach (X) ⊙ch X + Asp (X) ⊙sp X
b b

multiplication was performed on Wq and Wv , followed

The sequential combination of channel and spatial self-
by reshape and Sigmoid to ensure all parameters remained
attention forms a PSA series structure, as shown in (9).
between 0-1 . The operation is as shown in (6).
P SAP (X) = Z sp Z ch

Asp (X) = FSG [σ3 (FSM (σ1 (FGP (Wq (X)))) × σ2 (Wv (X)))] = Asp Ach (X) ⊙ch X ⊙sp Ach (X) ⊙ch X

(6) (9)
8 VOLUME 4, 2016

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

this context, we incorporate the Unsample, Concat, C2f, and

PSA modules in sequential order to perform computations.
The outcomes of these operations are then fed into a Conv
module for continued feature extraction, and concurrently
into a Detect module to finalize the object detection process.
The network architecture is depicted in Fig 8.

FIGURE 8. Modules Added for Small Target Detection.

IV. RESULTS ANALYSIS

FIGURE 7. Polarized Self-Attention Module in Serial Structure. Our study used the Tianchi 2019 Guangdong Industrial In-
telligent Manufacturing Innovation Competition dataset to
D. INCREASE SOHEAD DETECTION HEAD TO IMPROVE conduct comparative experiments with DA-YOLOv8s and
THE ABILITY OF SMALL TARGET DETECTION ten other models, including a baseline model. Ablation ex-
periments were also performed. The results indicated that the
The classic YOLOv8 has three detection heads, which can
DA-YOLOv8s model showed a significant improvement in
detect targets at multiple scales, with detection sizes of:
mAP and achieved a balance between accuracy and speed.
The corresponding detection feature map size for P3/8 is
80 × 80 , used for detecting targets larger than 8 × 8 .
The corresponding detection feature map size for P4/16 is
A. DATASET INTRODUCTION
40 × 40 , used for detecting targets larger than 16 × 16 .
The corresponding detection feature map size for P5/32 is The dataset is from the Tianchi 2019 Guangdong Industrial
20 × 20 , used for detecting targets larger than 32 × 32 . Intelligent Manufacturing Innovation Competition. It is a tex-
However, there may be missed detections or poor detec- tile defect dataset created from images collected at a textile
tion ability for tiny targets, hence adding a detection head workshop. The dataset contains a total of 4774 images, with
for small objects, maintaining the extraction of small target image sizes of 2446 px * 1000 px. There are 3819 images
features through downsampling. in the training set and 955 images in the validation set. It
The newly added detection feature map for 160 × 160 covers 20 important categories of fabric defects in the textile
is used for detecting targets larger than 4 × 4.Upon the industry, with a total of 6126 defect instances, and provides
addition of this detection head, the Neck network necessitates detailed annotations. Each image contains one or more defect
the augmentation of a corresponding detection module. In instances, with some typical defects shown in Fig 9.
VOLUME 4, 2016 9

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

(a) Off-size filling (b) Three yarn and felter

(c) Knots (d) Heavy filling

FIGURE 9. Visual Comparisons of Original Models.

The dataset has an imbalanced distribution of categories,

with the ’knots’ defect category exceeding 1200 instances, FIGURE 11. Scatter Plot of Defect Sizes.
while the ’centiped’ category has only 79 instances. The
statistics of each defect category are shown in Fig 10; we
TABLE 2. Experimental Environment Configuration
have normalized the standard size of defect points, and the
distribution of the length and width of defect points is shown Software/Hardware Version Information
in Fig 11. It can be observed from the figure that there is a Operating System Ubuntu 20.04
significant size variation among defects in the dataset, and the Programming Language Python
CUDA Version CUDA12.0
detection target is relatively small compared to the original Deep Learning Framework Pytorch 2.0.1
image. The proportion of defect point widths to the width of GPU Model RTX 4090 24GB
the original image is within 0.1, accounting for 86.0%.
In the experiment, parameters are set according to the
default hyperparameters of the model, batch size (batch size)
is 16, the number of training periods (epochs) is 100, and
the initial learning rate (learning rate) is 0.0001, as shown in
Table 3.

TABLE 3. Parameter Configuration

Hyperparameter Value
Batch Size 16
Epochs 100
Learning Rate 0.0001

C. INTRODUCTION TO COMPARATIVE ALGORITHMS

To verify the effectiveness of the improved model, we
compared it with the classical object detection algorithms
Faster R-CNN [35], Cascade R-CNN [36], YOLOv3 [37],
YOLOv5, YOLOX [38], YOLOv6 [39], YOLOv9 [40],
FIGURE 10. Statistics of Defect Points by Category. YOLOv10 [41] and YOLOv11 [42], under the experimental
environment consistent with Table 3, using the default pa-
rameters of the model.
Faster R-CNN Algorithm Model: Faster R-CNN is a clas-
sical object detection algorithm proposed by Ross Girshick in
2016. Faster R-CNN uses VGG16 as the network’s backbone,
B. EXPERIMENTAL ENVIRONMENT
first extracting features with a set of basic conv+relu+pooling
The experimental environment for this paper is set up on a layers, then passing through the RPN (Region Proposal
server with the following hardware configuration as Table Networks) to generate candidate boxes based on the An-
2: Jensen RTX 4090 24GB graphics card, 64 Gb memory. chor mechanism. Finally, feature extraction, candidate box
The server system is configured with Ubuntu 20.04 version, selection, bounding box regression, and classification are
CUDA Toolkit 12.0 version, and the deep learning frame- integrated into one network, thus providing higher detection
work platform is Pytorch 2.0.1 version. accuracy and efficiency.
10 VOLUME 4, 2016

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

The Cascade R-CNN Algorithm Model: This represents an network layer replaces the CSPBlock used in YOLOv5 with
enhancement over the Faster R-CNN, proposing a cascaded RepBlock, i.e., Rep-PAN, and correspondingly adjusts width
R-CNN detection and inference architecture composed of a and depth; the Head network adopts a hybrid-channel strat-
sequence of detectors trained with increasing IoU thresholds. egy to construct a more efficient decoupled head, which
This cascaded sampling progressively enhances the quality further reduces computational cost.
of detection. YOLOv9 introduced the concept of Programmable Gradi-
The YOLOv3 Algorithm Model: Introduced in 2018, the ent Information (PGI) to address the diverse changes required
YOLO detection model utilizes a Feature Pyramid Network by deep networks to meet various objectives. Furthermore,
(FPN) for feature fusion, incorporating residual connection YOLOv9 has developed a new lightweight network architec-
modules to enable multi-scale training. It outputs feature ture, the Generalized Efficient Layer Aggregation Network
maps at three scales, and in this context, we adopt the (GELAN), which employs gradient path planning to signifi-
YOLOv3-tiny as the baseline model. cantly improve detection performance.
YOLOv5 algorithm model: YOLOv5 is an object detection YOLOv10 has introduced a consistent dual assignment
model released by Ultralytics in June 2020, which excels in strategy with dual label assignments and a consistent match-
inference speed. This paper adopts the YOLOv5s model ver- ing metric to address the issue of redundant predictions in
sion, which has a smaller number of parameters and is suit- post-processing without the need for Non-Maximum Sup-
able for lightweight resource devices or scenarios requiring pression (NMS). It has proposed a lightweight classification
fast inference. The YOLOv5 network structure consists of an head, spatial-channel decoupled downsampling, and rank-
input end (Input), a Backbone network, a Neck network, and guided block design to reduce explicit computational redun-
a detection end (Head). The input end uses data augmentation dancy and achieve a more efficient architecture.
techniques and adaptive anchor calculation methods to enrich The YOLOv11 Algorithm Model: This is the latest iter-
the dataset and reduce the occupation of GPU resources; ation of the YOLO model, which builds upon YOLOv8 by
the Backbone network utilizes the Focus and CSPDarknet53 refining the network architecture. It replaces the C2f module
structures to optimize the classifier, enhancing the diversity with the C3K2 module and adds an attention mechanism
and robustness of features; the Neck network adopts the SPP C2PSA module following the SPPF layer. Additionally, it
module and FPN+PAN structure, which strengthens semantic improves the structure of the detection head. We have also
and positional dual information, fuses feature information introduced this model as a comparative model in our experi-
extracted by the Backbone network, and further improves the mental evaluation.
model’s performance and accuracy.
YOLOX algorithm model: YOLOX is an object detection D. EVALUATION METRICS
algorithm proposed by Megvii Technology in 2021. It is im- To validate the effectiveness and execution time of the model,
proved based on YOLOv3-SPP, adopting the design idea of this paper uses GFLOPS (Giga Floating-point Operations Per
decoupled heads, with classification prediction and position Second, representing 1 billion floating-point operations per
prediction handled by two separate networks. This approach second) as a measure of the execution time of the network
not only reduces information redundancy but also improves model; the mean average precision (mAP) metric is used
detection accuracy. YOLOX introduces an anchor-free de- to evaluate the accuracy of the model, with its calculation
sign concept, achieving anchor-free detection by directly formula as shown in (10).
predicting the center point coordinates and width and height X
information of objects, which not only enhances detection PmA = PA /N (10)
flexibility but also effectively avoids performance bottle-
necks caused by anchor boxes. Moreover, YOLOX employs In the formula, mAP, N represents the total number of cat-
an advanced label assignment strategy, taking into account egories, and PΛ is the area enclosed by the curve formed by
factors such as the size, position, and shape of objects, recall on the horizontal axis and precision on the vertical axis.
making label assignment more rational and accurate. We use [email protected] and mAP as evaluation metrics, where
YOLOv6 algorithm model: YOLOv6 is an object detection [email protected] is the mean Average Precision at an Intersection
framework developed by Meituan’s Visual Intelligence De- over Union (IOU) threshold of 0.5, and mAP is the mean
partment and released in 2022. YOLOv6 has made numerous Average Precision at IOU thresholds of 0.5, 0.3, and 0.1.
improvements to network structures such as Backbone, Neck, Simultaneously, we employ FPS (Frames Per Second) to
and Head. In this paper, YOLOv6 also uses the YOLOv6s evaluate the detection speed of the model. FPS denotes the
version as the comparison algorithm. The Backbone network number of image frames that can be processed and outputted
in YOLOv6s is inspired by the RepVGG [43] Style structure, within a second. The calculation method is illustrated in (11).
composed of RepBlock in the training phase, and in the In this equation, t1 represents the image preprocessing time,
inference phase, each RepBlock is converted into a 3*3 t2 signifies the image inference time, and t3 indicates the
convolutional layer stack (represented as RepConv) through post-processing time.
the ReLU activation function, which can reduce inference la- 1000 ms
tency while enhancing representation capabilities. The Neck FPS = ; (11)
t1 + t2 + t3
VOLUME 4, 2016 11

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

The evaluation metrics are shown in the Table 4:

TABLE 4. Evaluation Metrics Functions and Objectives

Metric Function Objective

mAp Evaluates the average precision To achieve a significant
of the model improvement
GFLOPS Evaluates the computational For reference, based on
complexity of the model FPS
FPS Evaluates the detection speed ≥ 60 FPS
of the model

E. EXPERIMENTAL RESULTS
1) Comparative Detection Experiments with Different Models
To fully evaluate the detection algorithm of the improved (a) Comparison of [email protected] across different epochs
YOLOv8s model in this paper, ten algorithms were selected
for experimental comparison, including the Faster R-CNN,
Cascade R-CNN, and unimproved YOLOV3-tiny, YOLOv5s,
YOLOv6s, YOLOX, YOLOv8s, YOLOv9s, YOLOv10s, and
YOLOv11s. Meanwhile, a comparison was made with the
typical two-stage object detection algorithm, Faster-RCNN.
The mean Average Precision (mAP) after 100 iterations
was used as the evaluation criterion for different detection
algorithms, which can scientifically and reasonably assess
the object detection capability and computational speed of
various detection algorithms. The results of the experimental
comparison are shown in Table 5. The mean precision after
each training iteration is shown in Fig 12.The results indicate (b) Comparison of average precision mAP across epochs
that, considering the balance between detection accuracy and FIGURE 12. Comparison Chart of Training Accuracy Across Epochs
speed, the YOLOv8s, YOLOv9s, and YOLOv11s models
significantly outperform other models.Using the improved
DA-YOLOv8s model, which is superior to all the models it
is compared with, there is a 4.2% increase in [email protected] and
a 3.8% improvement in average mAP relative to the baseline
YOLOv8s model, although the detection speed has decreased
by 52.2%, and the GFLOPS has risen by 12.1. Given YOLO’s
excellent performance in detection speed, the DA-YOLOv8s
still achieves a detection speed of 257.38 FPS, which meets
the requirements for industrial applications.

TABLE 5. Comparison of Detections for Different Models

Model Name [email protected] mAP FPS GFLOPS

Faster R-CNN 0.296 0.339 85.76 157.7
Cascade R-CNN 0.255 0.295 90.82 187.0
YOLOv3-tiny 0.256 0.313 723.01 18.9
YOLOX 0.316 0.353 190.4 36.9
YOLOv5s 0.375 0.421 590.88 23.9
YOLOv6s 0.238 0.271 546.81 44.1
YOLOv9s 0.412 0.452 483.15 26.8
YOLOv10s 0.349 0.387 535.37 24.5 FIGURE 13. Box loss, Cls loss, and Dfl loss Metrics.
YOLOv11s 0.401 0.445 518.29 21.3
YOLOv8s 0.404 0.448 546.83 28.5
DA-YOLOv8s(Ours) 0.446 0.486 257.38 36.6
loss, both showing a downward trend, suggests that the DA-
The box loss, cls loss (Classification Loss), and dfl loss YOLOv8s model has good generalization capabilities.
(Distributional Feature Loss) of the DA-YOLOv8s model
gradually decreased during the training process, as shown in 2) Ablation Study
Fig 13, indicating that the model’s performance in identifying In the Backbone, the DCNv2 module is used to replace all
the location, size, and category of textile defects is improv- original C2f modules of YOLOv8 for ablation experiments,
ing. The consistency between training loss and validation [email protected] and mAP have improved 2.3% and 2.6% respec-
12 VOLUME 4, 2016

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

tively, indicating that this modification effectively enhances

the feature extraction capability for small objects.At the same
time, the FPS has decreased to 351.92 FPS, and GFLOPS
has increased, but this does not affect its requirements for
industrial applications.
In the Neck network, a PSA self-attention module is
added after each C2f module connected to the detection
head, reducing information loss in both channel and spatial
dimensions and increasing feature fusion capability. We also
conducted ablation experiments, where [email protected] and mAP
have improved 1.1% and 1.3% respectively.The decrease in
FPS and the increase in GFLOPS do not affect the feasibility
of its application. (a) Comparison of [email protected] across different epochs
In the Head, we added a small object detection head 4 ×
4, which requires an additional structure consisting of Conv,
Concat, C2f, and PSA in the Neck network to connect to this
detection head. The experiment shows that [email protected] and
mAP have improved 0.7% and 0.9% respectively. The FPS
still reached 391.72, which does not affect the feasibility of
its application.
Subsequently, we conducted pairwise combination exper-
iments of DCNv2, PSA, and the small object detection
head SOHead on the YOLOv8s benchmark model, which
all showed improvements over individual experiments. After
applying all three to the model and undergoing 100 epochs
of iterative training, [email protected] and mAP were improved (b) Comparison of average precision mAP across different epochs
by 4.2% and 3.8% respectively, as shown in Table 6 and FIGURE 14. Comparison Chart of Training Accuracy Across Different Epochs
Fig 14. However, the detection speed FTP evaluation met-
ric decreased by 52.2%, and GFLOPS also increased. But
detection ability for small targets. Through these innovations,
given YOLO’s excellent performance in detection speed,
compared to the basic model, DA-YOLOv8s has improved
DA-YOLOv8s can still achieve 257.38 FPS, which meets the
by 4.2% and 3.8% respectively in [email protected] and mAP.
requirements of industrial applications.The partial detection
The textile defect detection algorithm designed in this paper
results of the final DA-YOLOv8s on the validation set are
has high real-time performance and accuracy, meeting the
shown in Fig 14.
practical application scenarios of enterprises.The detection
results are presented in Fig 15.
TABLE 6. Ablation Study Results Comparison

DCNv2 PSA SOHead [email protected] mAP FPS GLOPS

- - - 0.404 0.448 546.83 28.5
- ✓ - 0.415 0.461 506.22 29.6
✓ - - 0.427 0.474 351.92 25.8
- ✓ 0.411 0.457 391.72 37.9
✓ ✓ - 0.434 0.478 335.99 26.9 (a) (b)
✓ - ✓ 0.432 0.476 273.19 35.2
- ✓ ✓ 0.421 0.459 352.18 39.3
✓ ✓ ✓ 0.446 0.486 257.38 36.6

(c) (d)
V. CONCLUSION AND FUTURE PROSPECTS
A textile defect detection algorithm based on an improved FIGURE 15. Detection Results of the DA-YOLOv8s Model on the Test
Dataset.
YOLOv8 algorithm is proposed to address the issues of low
detection accuracy and poor real-time performance of tra- Textile defect detection is mostly dominated by small ob-
ditional methods. Experimental results show that improving jects, with the detection ability for small targets as a research
the C2f to DCNv2 in the YOLOv8s baseline network can focus for the next step.
enhance the feature extraction capability of the network,
incorporating the self-attention mechanism PSA can increase REFERENCES
the feature fusion capability on the channel and spatial [1] L. Tong,W.K. Wong, C.K. Kwong, "Fabric defect detection for apparel
dimensions, and adding detection heads can improve the industry: a nonlocal sparse representation approach," IEEE access, vol.5,

VOLUME 4, 2016 13

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

pp.5947-5964, Feb. 2017, 10.1109/ACCESS.2017.2667890. [22] X. Liao, S. Lv, D. Li, Y. Luo, Z. Zhu, and C. Jiang, "YOLOv4-MN3 for
[2] A. Rasheed,B. Zafar,and A. Rasheed,et al., "Fabric Defect Detec- PCB Surface Defect Detection," Appl. Sci., vol 11, no. 24, pp. 11701.1–
tion Using Computer Vision Techniques: A Comprehensive Review," 17, Dec. 2021, 10.3390/app112411701
Math. Probl. Eng., vol. 2020, no. 41, pp. 8189403.1–24, Dec. 2020, [23] S. Teng, Z. Liu, and X. Li, "Improved YOLOv3-based bridge sur-
10.1155/2020/8189403. face defect detection by combining High-and low-resolution feature im-
[3] W.Y. Zhang, J. Zhang, Y. Hou, and S. Geng, "MWGR: a New Method for ages," Buildings, vol.12, no.8, pp.1225.1–18, Aug. 2022, 10.3390/build-
Real-time Detection of Cord Fabric Defects," in Proc. 2012 Int. Conf. on ings12081225
Adv. Mechatronic Syst., Tokyo, Japan, September 2012, pp. 458—461. [24] Z. Cong, X. Li, and Z. Huang, "Research on Brake Pad Surface Defect
[4] D. D. Zhu, R. R. Pan, W. D. Gao, et al., "Yarn-Dyed Fabric Defect De- Detection Method based on Deep Learning," in IEEE Proc. of 2023 Int.
tection Based On Autocorrelation Function And GLCM,” Autex Research Conf. on Advances in Elect. Eng. and Comput. Appl. (AEECA), Dalian,
Journal,vol. 15,no.3, pp.226–232, Aut.2015,10.1515/aut-2015-0001. China, Aug. 2023, pp.813–818, 10.1109/AEECA59734.2023.00149
[5] M. Guan, Z. Zhong, and Y. Rui, "Automatic Defect Segmentation for Plain [25] X. Yue, Q. Wang, L. He, Y. Li, and D. Tang, "Research on tiny target
Woven Fabric Images, " in Proc. of 2019 Int. Conf. on Commun., Inf. detection technology of fabric defects based on improved Yolo," Appl.
Syst. and Comput. Eng. (CISCE), Haikou, China, Jul. 2019, pp. 465–468, Sci., voL.12, no.13, pp.6823.1–16, Jul. 2022, 10.3390/app12136823.
10.1109/CISCE.2019.00108. [26] Y. Jin, L. Di, "Textile defect detection based on multi-proportion spatial
[6] G.H. Hu, Q.H. Wang, and G.H. Zhang, "Unsupervised Defect Detection in attention mechanism and channel memory feature fusion network," IET
Textiles Based on Fourier Analysis and Wavelet Shrinkage,” Appl. Opt., Image Process., vol. 18, no.2, pp.412-427, Feb. 2024, 10.1049/ipr2.12957
vol. 54, no. 10, pp. 2963–2980, Feb. 2015, 10.1364/AO.54.002963. [27] R. Wang, R. Shivanna, D. Cheng, et al., "Dcn v2: Improved deep & cross
[7] Y. H. Li and X. Y. Zhou, "Fabric Defect Detection with Optimal Gabor network and practical lessons for web-scale learning to rank systems," in
Wavelet Based on Radon," in Proc. of 2020 IEEE Int. Conf. on Power, Proc. of the web conf. 2021,New York, USA, Jun. 2021, pp.1785–1797,
Intell. Comput. and Syst. (ICPICS), Shenyang, China, Sep. 2020, pp. 788- 1.1145/3442381.34500
793, 10.1109/ICPICS50287.2020.9202242. [28] H. Liu, F. Liu, X. Fan, et al., "Polarized self-attention: Towards high-
quality pixel-wise regression," 2021, arXiv:2107.00782.
[8] J. Xiang, R. R. Pan, and W. D. Gao, "Yarn-dyed Fabric Defect Detection
[29] Y. Lecun, L. Bottou, Y. Bengio and P. Haffner, "Gradient-based learning
Based on an Improved Autoencoder with Fourier Convolution," Text. Res.
applied to document recognition," in Proc. IEEE, vol.86, no.11, pp.2278-
J., vol. 93. no. 5/6, p1153–1165, Mar. 2023, 10.1177/00405175221130519
2324, Nov. 1998, 10.1109/5.726791.
[9] A. M. Kamoona, A. K. Gostar, A. Bab-Hadiashar and R. Hoseinnezhad,
[30] R. Wang, B. Fu, G. Fu, et al., "Deep & cross network for ad click
"Point Pattern Feature-Based Anomaly Detection for Manufacturing De-
predictions," in In Proc. of the ADKDD’17 (ADKDD’17), New York,
fects, in the Random Finite Set Framework," IEEE Access, vol. 9, pp.
USA, Aug. 2017, pp.2278-2324, 10.1109/5.726791
158672–158681, Nov. 2021, 0.1109/ACCESS.2021.3130261.
[31] M.H. Guo, T.X. Jin, J.J. Liu, et al., "Attention mechanisms in computer
[10] F. Alghanim, M. Azzeh, A. El-Hassan and H. Qattous, "Software Defect vision: A survey," Comput. vis. media, vol. 8, no.3, pp.331-368, Mar. 2022,
Density Prediction Using Deep Learning," IEEE Access, vol. 10, pp. 10.1007/s41095-022-0271-y
114629–114641, Oct. 2022, 10.1109/ACCESS.2022.3217480. [32] A. VaswaniR, S. Noam, P. Niki, et al., "Attention is all you need,"
[11] S. Mei, Y.D. Wang, and G.J. Wen, "Automatic Fabric Defect De- 2017,arXiv:1706.03762
tection with a Multi-scale Convolutional Denoising Autoencoder Net- [33] C. Feng, Y. Zhong, Y. Gao, et al., "Tood: Task-aligned one-stage
work Model," Sensors, vol. 18, no. 4, pp. 1064.1–18, Apr. 2018, object detection," in Proc. of 2021 IEEE Int. Conf. on Com-
10.3390/s18041064. put. Vis. (ICCV), Montreal, QC, Canada, Oct. 2021, pp.3490–3499,
[12] J. F. Jing, H. Ma, and H. H. Zhang, "Automatic fabric defect detection 10.1109/ICCV48922.2021.00349.
using a deep convolutional neural network," Color. Technol., vol. 35, no. [34] X. Li, W. Wang, L. Wu, et al., "Generalized focal loss: Learning qualified
3, pp. 213–223, Mar. 2019, 135.10.1111/cote.12394. and distributed bounding boxes for dense object detection," Advances in
[13] S. Ma, R. Zhang, Y. Dong, Y. H. Feng, and G. Zhang, "A Defect Detection Neural Inf. Proc. Syst., vol. 33, 2020, pp. 21002-21012.
Algorithm of Denim Fabric Based on Cascading Feature Extraction Ar- [35] S. Ren, K. He, R. Girshick, and J. Sun, "Faster R-CNN: Towards
chitecture," J. Inf. Process. Syst., vol. 19, no. 1, pp. 109–117, Feb. 2023, Real-Time Object Detection with Region Proposal Networks," IEEE
10.3745/JIPS.04.0265 Trans. Pattern Anal. Mach. Intell.,vol.39, no.6, pp.1137–1149, Jun. 2017,
[14] F. Xu, Y. Liu, B. Zi, and L. Zheng, "Application of Deep Learning for 10.1109/TPAMI.2016.2577031.
Defect Detection of Paint Film," in Proc. of 6th Int. Conf. on Intell. [36] Z. Cai, N. Vasconcelos, O. Tuzel, et al., "Cascade r-cnn: Delving into high
Comput. and Signal Proc. (ICSP), Xi’an, China, Apr. 2021, pp. 1118– quality object detection," in Proc. of the IEEE conf. on comput. vis. and
1121, 10.1109/ICSP51882.2021.9408956. pattern recognit. (CVPR),Salt Lake City, USA, Jun. 2018, PP.6154-6162
[15] B. Zhao, M. Dai, P. Li, and X. Ma, "Data Mining in Railway Defect Image [37] A. Farhadi, J. Redmon, "Yolov3: An incremental improvement," 2018,
Based on Object Detection Technology," in Proc. of 2019 Int. Conf. on arXiv:1804.02767
Data Mining Workshops (ICDMW), Beijing, China, Nov. 2019, pp. 814– [38] G. Zheng, S. Liu, F. Wang, Z. Li, and J. Sun, "YOLOX: Exceeding YOLO
819, 10.1109/ICDMW.2019.00120. Series in 2021," 2021, arXiv:2107.08430
[16] Y. Zhang, Z. Zhang, K. Fu, and X. Luo, "Adaptive Defect Detec- [39] C. Li, L. Li, H. Jiang, et al., "YOLOv6: A Single-Stage Object Detection
tion for 3-D Printed Lattice Structures Based on Improved Faster R- Framework for Industrial Applications,", 2022, arXiv:2209.02976
CNN," IEEE Trans. Instrum. Meas., vol. 71, no. 5020509, pp. 1–9, Aug. [40] C.Y. Wang, I.H. Yeh, H.Y.M. Liao, YOLOv9: Learning what you want to
2022,10.1109/TIM.2022.3200362. learn using programmable gradient information. 2024, arXiv:2402.13616.
[17] X. Gao, M Jian, M. Hu, M. Tanniru, and S. Li. "Faster multi-defect [41] A. Wang, H. Chen, L. Liu, et al. YOLOv10: Real-Time End-to-End Object
detection system in shield tunnel using combination of FCN and faster Detection. 2024, arXiv:2405.14458.
RCNN," Adv. in Structural Eng., vol 22, no. 13, pp.2907–2921, May. 2019, [42] R. Khanam, M. Hussain, "YOLOv11: An Overview of the Key Architec-
10.1177/1369433219849829 tural Enhancements," 2024, arXiv:2410.17725
[18] B. Wei, K. Hao, X. Tang, et al., "Fabric Defect Detection Based on Faster [43] X. Ding, X. Zhang, N. Ma, et al., "Repvgg: Making vgg-style convnets
RCNN", in Proc. of Artif. Int. on Fashion and Textiles Conf., Shanghai, great again," in Proc. of the IEEE/CVF Conf. on Comput. Vis. and
China, 2019, pp.45–51 Pattern Recognit., Nashville, TN, USA, Jun. 2021, pp.13733–13742,
[19] M. An, S. Wang, L. Zheng, and X. Liu, "Fabric defect detection using 10.1109/CVPR46437.2021.01352
deep learning: An Improved Faster R-approach," in IEEE Proc. of 2020
Int. Conf. on Comput. Vis., Image and Deep Learn. (CVIDL), Chongqing,
China, Jul. 2020, pp.319–324, 10.1109/CVIDL51233.2020.00-78
[20] M. Chen, L. Yu, C. Zhi, et al., "Improved faster R-CNN for fabric
defect detection based on Gabor filter with Genetic Algorithm opti-
mization," Comput. Ind., vol. 134, no. 2022, pp.103551, Jan. 2022,
10.1016/j.compind.2021.103551
[21] Z. Liu, W. Wu, X. Gu, et al., "Application of combining YOLO models
and 3D GPR images in road detection and maintenance," Remote Sens.,
vol. 13, no. 6, pp.1081.1–19, Mar. 2021, 10.3390/rs13061081.

14 VOLUME 4, 2016

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

WENFEI SONG received the B.S. and M.S. de-

grees in Information Science from Beijing Nor-
mal University in 2001 and 2004, respectively.
Since 2004, she has been a faculty member in the
Computer Science department at Zhejiang Indus-
try Polytechnic College, where she currently holds
the position of Associate Professor. She has au-
thored four textbooks and published more than ten
papers. Her research interests include computer
vision technology and intelligent information pro-
cessing.

DU LANG received the B.S. degree in Artifi-

cial Intelligence from the School of Information
Science and Engineering at Northeastern Univer-
sity in China in 2023. Currently working at Zhe-
jiang Industry Polytechnic College, his research
includes machine learning, deep learning, and in-
dustrial control.

JIAHUI ZHANG received the B.S. degree in

computer science and technology from Hangzhou
Dianzi University, China, in 2021. Then he re-
ceived the M.S. degree in computer science and
engineering from University at Buffalo(SUNY),
USA, in 2023. He is currently a teaching assistant
in Zhejiang Industry Polytechnic College. His re-
searches include machine learning, deep learning
and computer vision.

MEILIAN ZHENG is an associate professor at

the School of Management, Zhejiang University
of Technology; and the vice director of the Zhe-
jiang Hithink RoyalFlush Artificial Intelligence
Research Institute. She received her doctoral de-
gree from Zhejiang University in June 2008. Her
main research areas include organizational behav-
ior, innovation management, digital fusion and
data analytics.

XIAOMING LI is a PhD and Professor, College of

International Business, Zhejiang Yuexiu Univer-
sity, Deputy Director of Shaoxing Key Laboratory
of Intelligent Monitoring and Prevention of Smart
City. He received his PhD degree from Tianjin
University in March 2020. Before that, he received
his Master’s degree from Xinjiang University in
2008. His research interests include human be-
haviour dynamics and multi-layer network local
community detection.

VOLUME 4, 2016 15

This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/

1.1.1 Binary Systems Worksheet
No ratings yet
1.1.1 Binary Systems Worksheet
5 pages
Rav4 Distribucion
100% (2)
Rav4 Distribucion
50 pages
Critical Resistance and Critical Speed For DC Shunt Generator For PDF
No ratings yet
Critical Resistance and Critical Speed For DC Shunt Generator For PDF
10 pages
Mastercam PDF
0% (1)
Mastercam PDF
2 pages
Electronics 13 02009 v2
No ratings yet
Electronics 13 02009 v2
21 pages
Automated Defect Detection in Fabric
No ratings yet
Automated Defect Detection in Fabric
8 pages
Textile Defect Detection Algorithm Based On The Improved YOLOv8
No ratings yet
Textile Defect Detection Algorithm Based On The Improved YOLOv8
15 pages
Waves Exam Q
0% (1)
Waves Exam Q
24 pages
Data-Driven Optimization of Textile Production Processes - A Machine Learning Approach To Reducing de
No ratings yet
Data-Driven Optimization of Textile Production Processes - A Machine Learning Approach To Reducing de
8 pages
Fabrication Details for Engineers
No ratings yet
Fabrication Details for Engineers
1 page
Automatic Fabric Defect Detection Based YOLO5
No ratings yet
Automatic Fabric Defect Detection Based YOLO5
13 pages
2022textile Fabric Defect Detectionsept
No ratings yet
2022textile Fabric Defect Detectionsept
9 pages
Fabric Defect Inspection System Using Neural Network and Microcontroller
No ratings yet
Fabric Defect Inspection System Using Neural Network and Microcontroller
11 pages
Security and Communication Networks - 2021
No ratings yet
Security and Communication Networks - 2021
13 pages
Fab-ME: A Vision State-Space and Attention-Enhanced Framework For Fabric Defect Detection
No ratings yet
Fab-ME: A Vision State-Space and Attention-Enhanced Framework For Fabric Defect Detection
6 pages
Electronics 12 02950
No ratings yet
Electronics 12 02950
15 pages
Unsupervised Textile Defect Detection Using Convolutional Neural Networks
No ratings yet
Unsupervised Textile Defect Detection Using Convolutional Neural Networks
40 pages
CTM 8
No ratings yet
CTM 8
30 pages
General Anisotropic Elasticity: Abstract This Chapter Is An Introduction To General Anisotropic Elasticity, I.E. To The
100% (1)
General Anisotropic Elasticity: Abstract This Chapter Is An Introduction To General Anisotropic Elasticity, I.E. To The
56 pages
Fabric Defect Detection Using Computer Vision Tech
No ratings yet
Fabric Defect Detection Using Computer Vision Tech
24 pages
Applsci 14 00938
No ratings yet
Applsci 14 00938
21 pages
Fabric Defect Detection in Image Using Labview Based Multiclass Classification Approach
No ratings yet
Fabric Defect Detection in Image Using Labview Based Multiclass Classification Approach
20 pages
High-Precision Fabric Defect Detection Via Adaptive Shape Convolutions and Large Kernel Spatial Modeling
No ratings yet
High-Precision Fabric Defect Detection Via Adaptive Shape Convolutions and Large Kernel Spatial Modeling
8 pages
14 Fault
No ratings yet
14 Fault
8 pages
Stairwell Pressurization Analysis
No ratings yet
Stairwell Pressurization Analysis
17 pages
Open AccessArticle
No ratings yet
Open AccessArticle
20 pages
Fabric Defect Detection Using Transfer Learning
No ratings yet
Fabric Defect Detection Using Transfer Learning
10 pages
HNNLS Robust Design To Identify Textile Fabric Defects Using Hybrid Neural Network Assisted Learning Strategy
No ratings yet
HNNLS Robust Design To Identify Textile Fabric Defects Using Hybrid Neural Network Assisted Learning Strategy
6 pages
RRCS 4156 Final
No ratings yet
RRCS 4156 Final
11 pages
Chemistry Basics for Students
No ratings yet
Chemistry Basics for Students
16 pages
Shahrabadi 2022 J. Phys. Conf. Ser. 2224 012010
No ratings yet
Shahrabadi 2022 J. Phys. Conf. Ser. 2224 012010
13 pages
IJRAR23B2819
No ratings yet
IJRAR23B2819
7 pages
FF-YOLO Fashion Fabric Detection Algorithm Based On YOLOv8
No ratings yet
FF-YOLO Fashion Fabric Detection Algorithm Based On YOLOv8
15 pages
Fabric Defect Detectionusing Deep Convolutional Neural Network
No ratings yet
Fabric Defect Detectionusing Deep Convolutional Neural Network
8 pages
Textile Defect Detection Advances
No ratings yet
Textile Defect Detection Advances
18 pages
Improved YOLOV5 Based Industrial Surface Defect Detection Method
No ratings yet
Improved YOLOV5 Based Industrial Surface Defect Detection Method
4 pages
Fabric Detergent Detection
No ratings yet
Fabric Detergent Detection
13 pages
Defect Detection in Fabric
No ratings yet
Defect Detection in Fabric
6 pages
Fabric Defect Detection and Classification Using Modified VGG Network
No ratings yet
Fabric Defect Detection and Classification Using Modified VGG Network
10 pages
REPORT - PPTX 20250111 113107 0000
No ratings yet
REPORT - PPTX 20250111 113107 0000
31 pages
01 Fabric Defect Document
No ratings yet
01 Fabric Defect Document
31 pages
Theoretical and Experimental Determination of Cell Constants of Planar-Interdigitated Electrolyte Conductivity Sensors
No ratings yet
Theoretical and Experimental Determination of Cell Constants of Planar-Interdigitated Electrolyte Conductivity Sensors
5 pages
IJIRSTV5I11011
No ratings yet
IJIRSTV5I11011
6 pages
Automatic Fabric Defect Detection Using Learning-Based Local Textural Distributions in The Contourlet Domain
No ratings yet
Automatic Fabric Defect Detection Using Learning-Based Local Textural Distributions in The Contourlet Domain
13 pages
Grade 8 Informal Activities For Algebraic Expressions Teacher Guide
No ratings yet
Grade 8 Informal Activities For Algebraic Expressions Teacher Guide
31 pages
Fabric Defect Classification Using Transfer Learning and Deep Learning
No ratings yet
Fabric Defect Classification Using Transfer Learning and Deep Learning
8 pages
Fabric Defect Detection Using Computer Vision Techniques A
No ratings yet
Fabric Defect Detection Using Computer Vision Techniques A
24 pages
AAMRA
100% (1)
AAMRA
50 pages
Fabric Defect Detection Reviewed Digital - Image - Processing
No ratings yet
Fabric Defect Detection Reviewed Digital - Image - Processing
6 pages
Fabric Defect Detection System
No ratings yet
Fabric Defect Detection System
8 pages
Real-Time Detection of Knitting Fabric Defects Using Shearlet Transform
No ratings yet
Real-Time Detection of Knitting Fabric Defects Using Shearlet Transform
9 pages
Fabric Defect Detection Based On Multi-Input Neural Network
No ratings yet
Fabric Defect Detection Based On Multi-Input Neural Network
6 pages
Fabric Fault Detection via MATLAB
No ratings yet
Fabric Fault Detection via MATLAB
5 pages
SSRN Id4460036
No ratings yet
SSRN Id4460036
22 pages
Shital+K+Dhamal Updated+
No ratings yet
Shital+K+Dhamal Updated+
8 pages
Altas Copco FD 230 PDF
No ratings yet
Altas Copco FD 230 PDF
16 pages
Fabric Defects Detecting in Textile Industries
No ratings yet
Fabric Defects Detecting in Textile Industries
6 pages
Square-Root in DCS or Flow Transmitter
No ratings yet
Square-Root in DCS or Flow Transmitter
3 pages
Applsci 12 05285 With Cover
No ratings yet
Applsci 12 05285 With Cover
14 pages
Review Article: Fabric Defect Detection in Textile Manufacturing: A Survey of The State of The Art
No ratings yet
Review Article: Fabric Defect Detection in Textile Manufacturing: A Survey of The State of The Art
13 pages
Automatic Fabric Defect Detection With A Multi-Sca
No ratings yet
Automatic Fabric Defect Detection With A Multi-Sca
18 pages
Coloration Technology: Automatic Fabric Defect Detection Using A Deep Convolutional Neural Network
No ratings yet
Coloration Technology: Automatic Fabric Defect Detection Using A Deep Convolutional Neural Network
11 pages
Defect-Paper 2
No ratings yet
Defect-Paper 2
7 pages
Mineral Processing with CrossFlow
No ratings yet
Mineral Processing with CrossFlow
2 pages
Automated Fabric Fault Detection
No ratings yet
Automated Fabric Fault Detection
4 pages
Algorithms 14 00257 v2
No ratings yet
Algorithms 14 00257 v2
14 pages
Analysis of Defects Classified For Fabric Images of Different Classes-Updated
No ratings yet
Analysis of Defects Classified For Fabric Images of Different Classes-Updated
5 pages
Surface Defect Detection of Industrial Parts Based
No ratings yet
Surface Defect Detection of Industrial Parts Based
11 pages
Toward Automated Fabric Defect Detection A Survey of Recent Computer Vision Approaches
No ratings yet
Toward Automated Fabric Defect Detection A Survey of Recent Computer Vision Approaches
20 pages
A Novel Approach To Fabric Defect Detection Using Digital Image Processing
No ratings yet
A Novel Approach To Fabric Defect Detection Using Digital Image Processing
5 pages
Kit - 500 Coating Thickness Gauge
No ratings yet
Kit - 500 Coating Thickness Gauge
8 pages
An Effective Way of Designing The Automated System To Detecting The Fabric Defect
No ratings yet
An Effective Way of Designing The Automated System To Detecting The Fabric Defect
3 pages
Proplem Chapter 2.pdf - 2023.02.03 - 12.38.41pm
No ratings yet
Proplem Chapter 2.pdf - 2023.02.03 - 12.38.41pm
7 pages
Performance Measurament
No ratings yet
Performance Measurament
42 pages
IPM Indore 2021 by Cracku
No ratings yet
IPM Indore 2021 by Cracku
16 pages
Ccnet 10f Lec02 ch2
No ratings yet
Ccnet 10f Lec02 ch2
42 pages
Anchor Selection Based Deep Learning Two Stage Fabric Defect Localization
No ratings yet
Anchor Selection Based Deep Learning Two Stage Fabric Defect Localization
11 pages
Revenue Grade Metering Standards
No ratings yet
Revenue Grade Metering Standards
2 pages
Automated Product Defect Detection Using Image Processing Techniques For Effective Sorting and Quality Assurance: A Survey
No ratings yet
Automated Product Defect Detection Using Image Processing Techniques For Effective Sorting and Quality Assurance: A Survey
3 pages
Decision Making in Risk & Uncertainty
No ratings yet
Decision Making in Risk & Uncertainty
9 pages
CPX27xx-0010: Installation and Operating Instructions - EN
No ratings yet
CPX27xx-0010: Installation and Operating Instructions - EN
39 pages
Aluminum in Galvanizing Graham Poag
No ratings yet
Aluminum in Galvanizing Graham Poag
16 pages
Fabric Property and Defect Detection Using Deep Learning Model
No ratings yet
Fabric Property and Defect Detection Using Deep Learning Model
9 pages
2023 Assessments Final
No ratings yet
2023 Assessments Final
12 pages
Polarization Index Value Measurement
No ratings yet
Polarization Index Value Measurement
12 pages
2024 Spring Project
No ratings yet
2024 Spring Project
7 pages
Banklogs Report
No ratings yet
Banklogs Report
3 pages
Carbon Black Surface Area Analysis
No ratings yet
Carbon Black Surface Area Analysis
39 pages

Textile Defect Detection Algorithm Based On The Im

Uploaded by

Textile Defect Detection Algorithm Based On The Im

Uploaded by

This article has been accepted for publication in IEEE Access.

Textile Defect Detection Algorithm

I. INTRODUCTION detection [2]. The most commonly used computer vision

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

A. CONVOLUTIONAL NEURAL NETWORKS

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

B. ATTENTION MECHANISM III. CONSTRUCTION OF DA-YOLOV8S MODEL

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

FIGURE 3. Improved DA-YOLOv8 Network Structure Diagram.

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

FIGURE 6. Polarized Self-Attention Module with a Parallel Structure in YOLOV8.

multiplication was performed on Wq and Wv , followed

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

this context, we incorporate the Unsample, Concat, C2f, and

FIGURE 8. Modules Added for Small Target Detection.

IV. RESULTS ANALYSIS

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

(a) Off-size filling (b) Three yarn and felter

(c) Knots (d) Heavy filling

The dataset has an imbalanced distribution of categories,

TABLE 3. Parameter Configuration

C. INTRODUCTION TO COMPARATIVE ALGORITHMS

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

The evaluation metrics are shown in the Table 4:

Metric Function Objective

TABLE 5. Comparison of Detections for Different Models

Model Name [email protected] mAP FPS GFLOPS

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

tively, indicating that this modification effectively enhances

DCNv2 PSA SOHead [email protected] mAP FPS GLOPS

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

Author et al.: Preparation of Papers for IEEE TRANSACTIONS and JOURNALS

WENFEI SONG received the B.S. and M.S. de-

DU LANG received the B.S. degree in Artifi-

JIAHUI ZHANG received the B.S. degree in

MEILIAN ZHENG is an associate professor at

XIAOMING LI is a PhD and Professor, College of

You might also like