Automated Test Case Generation Using T5 and GPT-3

Uploaded by

brunojimezz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

49 views7 pages

Automated Test Case Generation Using T5 and GPT-3

Uploaded by

brunojimezz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

2023 9th International Conference on Advanced Computing and Communication Systems (ICACCS)

Automated Test Case Generation Using T5 and

GPT-3
2023 9th International Conference on Advanced Computing and Communication Systems (ICACCS) | 979-8-3503-9737-6/23/$31.00 ©2023 IEEE | DOI: 10.1109/ICACCS57279.2023.10112971

Alok Mathur Shreyaan Pradhan Prasoon Soni

School of Computer Science School of Computer Science School of Computer Science
Engineering Engineering Engineering
Vellore Institute of Technology, Vellore Institute of Technology, Vellore Institute of Technology,
Vellore, India Vellore, India Vellore, India
[email protected] shreyaan.pradhan2020@vitstudent. [email protected]
ac.in
Dhruvil Patel
School of Computer Science Rajeshkannan Regunathan
Engineering School of Computer Science
Vellore Institute of Technology, Engineering
Vellore, India Vellore Institute of Technology,
[email protected] Vellore, India
[email protected]

Abstract— Test case generation for a given topic can be a Additionally, the manual process of creating test cases
challenging and time-consuming task, particularly when the can be time-consuming and prone to human error, resulting
conversation about the topic includes boundary conditions in wasted resources and inefficiencies in the testing process.
and requirements. This process requires a complete Furthermore, the test designer may miss certain edge cases
understanding of the system and its scenarios to cover all while creating the test cases due to a lack of attention or
possible examples, both good and bad. This paper proposes an understanding of the system.
automated solution for generating test cases using natural
language processing techniques. The proposed methodology This paper intends to develop an automated solution for
uses T5 and GPT-3 models to extract the context and topic of test case generation for a given topic using natural language
the conversation and generate test cases. The model identifies processing techniques, with the goal of improving the
relevant keywords and generates test cases based on them, efficiency, accuracy, and completeness of the test case
providing context-based meanings of specific words and generation process. It also reduces the dependency on the
defining new terms used in the conversation. The proposed expertise of the test designer and ensures complete coverage
approach eliminates the need for manual test case generation of all scenarios. This will not only save time and resources
and allows for the testing of more complex systems with a but also make the process of test case generation more
larger number of scenarios. This will reduce the dependency manageable.
on the expertise of the test designer, resulting in complete
coverage of all scenarios. Additionally, this automation will A. Implications Of The Study And Benefits
save time and resources, making the process of test case
There is a requirement to improve the efficiency,
generation more manageable. By utilizing NLP techniques, it
accuracy, and completeness of the test case generation
will help to ensure that all scenarios are taken into account
and that the test cases generated are comprehensive and
process, by reducing the dependency on the expertise of the
accurate. test designer and ensuring complete coverage of all
scenarios. The potential benefits of the proposed solution
Keywords—Natural Language Processing, T5 Model, GPT- should include saving time and resources, making the
3, Text Conversation, Test Cases, Text Summarisation. process of test case generation more manageable, and
improving the quality of various software applications such
I. INTRODUCTION as automated customer service, quality assurance for
chatbots, language understanding for virtual assistants, user
Generating test cases from a conversation can present
testing for mobile apps, and text-based game testing.
various obstacles that make the task arduous and time-
consuming. One of the major challenges is the need for a Automating test case generation can offer many
thorough comprehension of the system and its potential potential benefits to organizations involved in software
scenarios to cover all examples, both positive and negative. development and testing. These benefits include reduced
This can be difficult as the conversation about the topic may time and effort required for testing, improved quality of
involve boundary conditions and requirements that must be testing, reduced costs, improved collaboration and
considered. communication within development teams, and improved
overall quality of software products.
Another issue is the dependency on the test designer's
expertise. The test designer must have a solid understanding The literature survey provides examples of related
of the system and scenarios to create comprehensive and research, including methods for generating short test cases
accurate test cases. However, this dependency can result in using an ensemble technique, model-based approaches for
overlooked sub-systems when generating test cases, leading generating test cases from business process architecture,
to incomplete coverage of all scenarios. and techniques for improving the performance of natural

979-8-3503-9737-6/23/$31.00 ©2023 IEEE

1986
Authorized licensed use limited to: UNIVERSIDADE DE SAO PAULO. Downloaded on May 11,2024 at 15:01:54 UTC from IEEE Xplore. Restrictions apply.
2023 9th International Conference on Advanced Computing and Communication Systems (ICACCS)

language understanding models. This study implies that an The paper [3] proposed by Katuri et al in 2022 shows
automated solution for test case generation using natural the development of an interface that transforms speech and
language processing techniques could provide significant other auditory inputs using a digital filter. One of the
benefits for the software development industry, by primary benefits of this technology is that it helps to avoid
improving the efficiency and effectiveness of the testing software errors in the translation of one system to another.
process. As a result, it can prevent issues such as gender recognition
errors and speech recognition failures. This interface can
B. Applications improve the accuracy of speech and auditory input
Automated customer service: By analysing text translation, leading to more effective communication and
conversations between customers and support less confusion.
representatives, test cases can be automatically generated to
ensure that the customer service system is able to handle Liu et al 2021 investigated the sensitivity of GPT-3 to
common issues and questions. out-of-context examples [4]. It proposes a non-parametric
selection approach called KATE, which retrieves in-context
Quality assurance for chatbots: Test cases can be examples based on their semantic similarity to test samples.
generated from text conversations between users and The study shows that this method improves GPT-3's
chatbots to ensure that the chatbot is able to under-stand and performance on several natural language understanding,
respond appropriately to user input. and generation tasks compared to a random sampling
Language understanding for virtual assistants: Test baseline. Additionally, it was found that fine-tuning
cases can be generated from text conversations between sentence embeddings for retrieval on task-related datasets
users and virtual assistants to evaluate the assistant’s ability result in further performance gains. The study suggests that
to understand natural language and perform tasks. this work will aid in understanding GPT-3's behavior and
could be a useful step toward improving its few-shot
User testing for mobile apps: Test cases can be capabilities.
generated from text conversations between users and the
mobile app to evaluate the app's usability and functionality. This study [5] in the year 2021 proposed by Ni et al
presents a two-stage contrastive learning strategy for fine-
Text-based game testing: Test cases can be generated tuning a pre-trained text-to-text model, T5, to create
from text conversations between players and the game's sentence encoders (ST5). The study suggests three
NPCs (non-player characters) to ensure that the NPCs are architectures and compares the performance of encoder-
able to understand and respond to player input. only and encoder-decoder designs as phrase encoders. The
results of extensive SentEval benchmark trials show that
As an aid to manual test case generation: By analysing
encoder-only models have high transfer performance, while
text conversations be-tween users, test cases can be
encoder-decoder models are more effective on textual
generated that cover the common scenarios and edge cases,
similarity tasks. The study also finds that increasing the
which can then be reviewed and refined by manual testers.
model size from millions to billions of parameters results in
II. LITERATURE SURVEY significant benefits for sentence embedding. The study also
describes efficient techniques for creating ST5 from trained
Zhuang et al 2018 describe a method for generating models and highlights the value of increasing model size.
short test cases using an ensemble technique [1]. It utilizes The results imply that future advancements in the size and
multiple rankers and a generation-based approach, and the quality of pre-trained text-to-text models could lead to
final response is chosen based on the rankings provided by further benefits for sentence encoder models.
an ensemble module. However, it is important to note that
the output of an ensemble model is difficult to predict and The P-tuning method proposed by Liu et al in 2021
understand, making it less interpretable. Additionally, enhances the natural language understanding (NLU) of
assembling an ensemble model can be challenging and GPT-3 by automatically searching for better prompts in the
time-consuming and may result in a model with lower continuous space [6]. This method reduces dependence on
prediction accuracy than a single model. a large validation set, has fewer adversarial prompts, and
reduces overfitting. P-tuning recovers 64% of world
The paper proposed by Yazdani et al in 2019 aims to knowledge and enables GPT-style models to compete with
address the issue of a significant proportion of software similar-size BERT models in NLU on the SuperGLUE
errors stemming from a lack of understanding during the benchmark. It also outperforms state-of-the-art methods in
early stages of development [2]. To achieve this, the paper the few-shot SuperGLUE benchmark, improving
presents a model-based approach for automatically bidirectional models.
generating test cases from business process architecture and
process characterizations. Using process models is an In the paper [7] by Zolotareva et al in 2020 proposed an
efficient way to find test cases for business sectors. The abstraction text summarization that has recently made
proposed approach includes a process model to state graph significant progress by moving from linear models with
conversion algorithm, which sets up the model for data sparse, handcrafted features to nonlinear neural network
creation. Subsequently, a tool is used to automatically build models with dense inputs. This success is due to the ability
the test cases based on the identified preconditions. The of deep learning models to capture complex patterns in
proposed method provides an efficient way to identify and natural language data without relying on handcrafted
generate test cases, which can help in reducing errors and features. In this study, the authors investigate text
improve the software development process. summarization using Sequence-to-sequence recurrent
neural networks and Transfer Learning with a Unified Text-

1987
Authorized licensed use limited to: UNIVERSIDADE DE SAO PAULO. Downloaded on May 11,2024 at 15:01:54 UTC from IEEE Xplore. Restrictions apply.
2023 9th International Conference on Advanced Computing and Communication Systems (ICACCS)

to-Text Transformer approach. The experimental results but unnecessary information such as timestamps, names,
demonstrate that the Transfer Learning-based model and background noise is removed.[11]
achieves considerable improvement for abstractive text
summarization. The authors provide a comprehensive The context and topic of the conversation are derived
review of related works on abstractive text summarization from the pre-processed text using the YAKE Extractor. Key
and discuss the advantages and limitations of various phrases are selected by identifying 1–3-gram candidates that
approaches. They also discuss the potential applications of do not contain punctuation marks and do not begin or end
abstractive text summarization in various domains, with a stop word and are then weighted using the YAKE
including news and social media. weighing scheme. The 10 highest-scoring candidates are
chosen as the key phrases.[12]
The commentary in the paper [8] proposed by Floridi et
al focuses on the nature of reversible and irreversible The context is then tokenized using the Sentence piece
questions and their relevance to the analysis of GPT-3, a library and relevant sentences are extracted using the
third-generation autoregressive language model that uses keywords. The sentences are combined and given as input
deep learning to produce human-like texts. The authors to the T5 model for sequence-to-sequence encoding.
discuss three tests based on mathematical, semantic, and Attention masks and input ids are obtained from the
encoding and are used to generate test cases for the given
ethical questions to demonstrate that GPT-3 is not designed
to pass any of them, and caution against interpreting GPT-3 context.
as a step toward general artificial intelligence. The authors Leveraging the natural language generation (NLG)
provide a literature survey on the industrialization of capability of the GPT-3 model, which is trained to produce
automatic and cheap production of good semantic artifacts, natural human text using online text and can generate
emphasizing the potential consequences of this relevant text in response to any input. This enables us to
development. Overall, this commentary highlights the need summarize the generated test cases in natural language,
for a critical examination of the capabilities and limitations including computer code and summaries. By using this
of AI technologies, such as GPT-3, and the impact they may methodology, there is a improvement meant the efficiency
have on society. and accuracy of test case generation, while reducing the
The paper proposed by Vel et al in 2021 highlights the dependency on human expertise and ensuring
need for efficient ways to access and extract relevant comprehensive coverage of all possible scenarios.
information from the exponential growth of digital Basically, the main purpose of using GPT-3 is to check the
information [9]. Text mining has emerged as a solution by output generated by T5 and refine it further. Basic flow
enabling the mining of relevant information from chart of the implementation is shown in fig1.
unstructured or semi-structured text documents. The paper
provides a comprehensive overview of text mining,
including various concepts, techniques, pre-processing
steps, applications, and issues, making it a valuable resource
for researchers and practitioners interested in the field of
text mining.
With the advent of automation, there is an increasing Fig. 1. Basic flow chart of the implementation
need for automated systems in answer assessment to reduce
the workload of manual evaluators [10]. However, current A. Pre-Processing
systems are limited to option-based questions, making it
difficult to evaluate theory answers. This paper presents the When dealing with text in the form of documents, pre-
AutoEval system, an automatic exam paper evaluation processing is an essential step to prepare the data for further
system that uses natural language processing (NLP) analysis.
methods to assess written responses. By automating the The first step in document pre-processing involves
paper correction process, the system provides a reliable and removing unnecessary information such as time stamps and
standardized approach to evaluation while reducing the time names, which are irrelevant to the text analysis. This can be
and resources required for manual assessment. The paper achieved using a text wrapper function that makes the text
discusses the use of NLP for grammatical analysis, syntactic in more viewable format.
analysis, semantic similarity, and database storage to
evaluate exam papers. The study highlights the limitations Once the content has been extracted, the next step is to
of manual evaluation and the benefits of automated systems tokenize the sentences. Tokenization is the process of
in providing unbiased and efficient assessment. breaking down the text into smaller components or tokens,
such as words or phrases, that can be easily processed by a
III. PROPOSED METHODOLOGY computer.
This paper proposes a methodology which involves the In addition to tokenization, it is also important to
usage of T5, and GPT-3 model together. The T5 model is identify the key phrases or topics within the text. This can
an encoder-decoder model that converts all-natural be achieved using an unsupervised algorithm such as
language processing (NLP) problems into a text-to-text Multipartite or YAKE, which analyse the text to identify the
format. It is trained using a method called teacher forcing, most important words and phrases. The specific algorithm
which requires an input sequence and a corresponding target used may depend on the type of document being analysed
sequence for training. The input text is given by the user,

1988
Authorized licensed use limited to: UNIVERSIDADE DE SAO PAULO. Downloaded on May 11,2024 at 15:01:54 UTC from IEEE Xplore. Restrictions apply.
2023 9th International Conference on Advanced Computing and Communication Systems (ICACCS)

and the desired outcome of the analysis. Preprocessing steps training data, the hyperparameters of the model, and the
incorporated is shown in fig2. training process. Careful evaluation and hyperparameter
tuning are crucial for achieving the best results.
Now after training the T5 Model, GPT-3 is used to check
the output of a T5 model. One way to do this is to use GPT-
3 as a post-processing step, where the output of the T5
model is fed as input to GPT-3 to further refine the
generated text. This can help improve the overall quality and
Fig. 2. Preprocessing steps incorporated coherence of the generated text.
B. NLU Using T5 Model V. EXPERIMENTAL RESULTS
The context produced after pre-processing is then The proposed approach of using T5 and GPT-3 for
tokenized using the Sentence piece library and relevant automated test case generation has the potential to
sentences are extracted using the keywords. The sentences significantly impact software testing. Traditionally, test
are combined and given as input to the T5 model for case generation is a manual and time-consuming process,
sequence-to-sequence encoding. Attention masks and input which can often result in inadequate test coverage and
ids are obtained from the encoding and are used to generate missed defects. With the use of machine learning models
test cases for the given context. like T5 and GPT-3, it may be possible to automate the
The T5 model is a text-to-text format encoder-decoder process of test case generation, thereby reducing the time
model that is trained using teacher forcing which requires and effort required for testing. The approach could
input and target sequences. It is used to extract the context potentially improve the accuracy and completeness of test
and topic of the conversation and generate the test cases. cases generated, leading to more effective testing and
ultimately, better quality software. It may also help to
The input text is pre-processed before being fed to the
identify edge cases and corner cases that are often missed in
model. The T5 model is considered one of the best for
manual test case generation. However, there are also
natural language understanding. Processing done by T5 potential limitations and challenges with this approach, such
Model is shownin fig3. as the need for large amounts of training data and the risk of
generating redundant or irrelevant test cases. Therefore, it is
important to carefully evaluate and validate the
effectiveness of this approach before
implementing it in practice.
Considering a sample conversation. This can be even
Fig. 3. Processing done by T5 Model obtained as a form of the transcript from various meet apps.
C. Nlg Using Gpt-3 Model A. Testcase 1
GPT-3 is trained to produce natural human text using For conversation here is between 3 people discussing
online text and can generate relevant text in response to any passwords and about restrictions imposed on it. Now this
input. The understanding from the T5 Model i.e., the is just taken this as an example the conversation can be
attention mask and input ids are passed on to the GPT-3 about anything our proposed model will work on it. Like a
model for its natural language generation capability. GPT- conversation on palindrome to identify words starting with
3 is trained to produce natural human text and can generate a particular letter and of fixed length. The output generated
relevant text for any input, including computer code and from the website using proposed model is shown in fig5.
summaries. Processing done by GPT-3 Model is shown
infig4. Tom: Password should be 8 characters long
Jane: Password should contain at least one uppercase letter
Kjel: Password should contain at least one number
Jane: Password should contain at least one special character
Fig. 4. Processing done by GPT-3 Model

IV. DATASET
Training the T5 model with the WebNLG2020 dataset
has the potential to result in a high-quality natural language
generation model. The dataset includes over 36,000
examples in two domains, with SPO triples and references
texts for each triple. However, training a high-quality model
is challenging due to the complexity of the task, which
requires the model to understand semantics and generate
natural-sounding text. The quality of the resulting model Fig. 5. The output generated from the website using proposed model
depends on several factors, including the quality of the

1989
Authorized licensed use limited to: UNIVERSIDADE DE SAO PAULO. Downloaded on May 11,2024 at 15:01:54 UTC from IEEE Xplore. Restrictions apply.
2023 9th International Conference on Advanced Computing and Communication Systems (ICACCS)

The output generated are “1. P@sswOrd8 2.

Pa55word! 3. CorrectHorseBat-tery$taple 4. iLuv2code!”.
It is apparent from the dialogue between Tom, Jane,
and Kjel that they are discussing passwords and the
limitations placed upon them. The website is designed to
produce viable solutions that meet the criteria they are
discussing.
B. Testcase 2
Considering a test case where we must generate 4
palindromes of a minimum length 4 starting with the letter
‘a’. Palindrome maybe defined as a string which when read
Fig. 7. Loss Function Associated with T5 Model
from front and back are the same. For ex like the word
“Tenet” when read from front spell the same and when read
C. Comparitive Analysis
from the end also spell the same thing.
To provide a more comprehensive comparison, let's
Hence task of the model is to generate 4 such strings take a look at some other common test case generation
which satisfy the given condition. Now the input can also be techniques and how they compare to the proposed approach
in any form as a conversation or meeting transcript or voice of using T5 and GPT-3.
conversation. Test case 2 output is shown in fig6.
Random testing: In random testing, test cases are
generated randomly without any specific requirements or
constraints. While this approach can help to identify some
defects, it is not a systematic or reliable method of test case
generation. Compared to the proposed approach, random
testing is less accurate and less likely to provide
comprehensive test coverage.
Boundary value analysis: Boundary value analysis
involves testing at the boundaries of input ranges, such as
minimum and maximum values. This approach can help to
identify defects related to input values and can be useful in
combination with other testing techniques. However, it
may not cover all possible scenarios, and it may not be
suitable for all types of software. Compared to the proposed
approach, boundary value analysis is less flexible and may
miss important edge cases.
Equivalence partitioning: Equivalence partitioning
involves dividing input values into equivalence classes and
Fig. 6. Test case 2 output testing representative values from each class. This
approach can help to reduce the number of test cases
The output generated are “1. aba 2. abba 3. abcba 4. required while still achieving good test coverage. However,
acdca”. it requires a significant amount of manual effort to identify
Upon clicking the "GET GENERATED TEST equivalence classes and select representative values.
CASES" button, the model produces varying outputs, one Compared to the proposed approach, equivalence
of which ('aba') does not meet the given criteria. This partitioning is less automated and may not cover all
inconsistency can be attributed to the model that hasn’t Model-based testing: Model-based testing involves
been trained on all dataset values. While the model generating test cases based on a model of the software
produces mostly accurate results, the issue of accuracy and system. This approach can be useful for complex software
loss will be further discussed in the next section.Loss systems and can help to identify defects related to system
function associated with T5 ModelThe T5 model's loss behavior. However, it requires significant effort to develop
function measures the discrepancy between the actual and maintain the model, and it may not cover all possible
output and the expected output. The goal is to minimize the scenarios. Compared to the proposed approach, model-
loss function so that the model can make better predictions. based testing is less flexible and may not be suitable for all
The T5 model's loss function is a type of cross-entropy loss types of software. Overall, the proposed approach of using
function that calculates the difference between the T5 and GPT-3 for automated test case generation offers
predicted probabilities and the true probabilities. By some advantages over other techniques, such as greater
minimizing the loss function, the model can adjust its flexibility, more natural language test case generation, and
weights and biases to improve its predictions. Loss the ability to learn from a wider range of training data.
Function Associated with T5 Model is shown in fig7. However, its effectiveness will depend on the quality of the

1990
Authorized licensed use limited to: UNIVERSIDADE DE SAO PAULO. Downloaded on May 11,2024 at 15:01:54 UTC from IEEE Xplore. Restrictions apply.
2023 9th International Conference on Advanced Computing and Communication Systems (ICACCS)

training data and the specific use case. It is important to The results of this study suggest that the use of NLP
carefully evaluate and compare different test case techniques in test case generation can be a promising
generation techniques before selecting an approach for a solution. Although the test cases in the paper are just a
specific software testing project. small unit of what is expected. The automated approach can
save time and resources while improving the accuracy and
D. Limitations
completeness of test cases. However, there are limitations
Using GPT-3 as a post-processing step can be to this approach, such as the need for high-quality training
computationally expensive and time-consuming, as GPT-3 data and the potential for model biases. Therefore, it is
is a large and complex model that requires significant essential to validate the generated test cases and ensure that
computational resources to run. Additionally, it may not they cover all scenarios adequately.
always be necessary or practical to use GPT-3 to check the
output of a T5 model, especially if the T5 model is The area to be worked on could include developing a
performing well on its own. framework for validating and refining the generated test
cases, evaluating the performance of different NLP models
Ultimately, the decision to use GPT-3 to check the in test case generation, and exploring the potential of
output of a T5 model depends on the specific use case and incorporating machine learning techniques for improving
the desired level of quality and refinement for the generated the efficiency and accuracy of the process. Additionally,
text. the impact of the proposed automated approach on the
Also currently the model doesn’t perform well when overall testing process and its effectiveness in detecting
there are a large number of test cases required. The model faults could be evaluated in real-world scenarios.
either can’t produce that many outputs or the produced B. Business Model
output may not be completely accurate.
The business model provides a test case generation
VI. DISCUSSION service for software development teams, quality assurance
departments, and testing firms. Customers will provide
A. Explanation of Results conversation text, which will be pre-processed by
Consider a scenario where a company wants to test a removing irrelevant information. The model offers a
new e-commerce platform they have developed. The subscription-based pricing system with different tiers and
platform has various features such as user registration, service level agreements. Various marketing channels will
product search, shopping cart, checkout, and payment be used to promote the service, including digital ads and
gateway. In order to test the platform, the testing team social media marketing. The business model aims to
needs to generate a variety of test cases that cover all automate the test case generation process using NLP
possible scenarios. techniques to improve accuracy, reduce errors, and save
time and resources.
The traditional process of test case generation can be
time-consuming and complex, requiring a deep VII. CONCLUSION AND FUTURE WORK
understanding of the platform and its features. The testing
This paper proposes an automated solution for
team needs to create test cases for all possible scenarios, generating test cases from a given conversation. Our
including boundary conditions and requirements. The approach uses the T5 model for natural language
manual process of test case generation can be prone to understanding (NLU) of the conversation and GPT-3 for
errors and may result in missed sub-systems, leading to natural language generation (NLG) of the test cases. The T5
incomplete coverage of scenarios. model is pre-trained on the WebNLG2020 dataset and fine-
To address these challenges, an automated solution tuned on the conversation. The GPT-3 model is fine-tuned
using natural languages processing techniques such as T5 using the knowledge obtained from the T5 model's NLU of
the conversation's context. By combining the T5 model,
and GPT3 can be employed. By using these models, the
which is well-suited for NLU tasks, and the GPT-3 model,
system can generate test cases from text conversations with
which excels in NLG tasks, our proposed model can
improved efficiency, accuracy, and completeness. This generate test cases without any human intervention. The
approach reduces the dependency on the expertise of the model identifies relevant keywords/key phrases and
test designer and ensures that all scenarios are taken into generates test cases based on them. Additionally, it helps
account, resulting in comprehensive and accurate test define new terms used in the conversation and provides
cases. context-based meanings of specific words. This model can
For example, if a customer sends a text message to the be used for test-based development and will help in
customer support team saying "I am unable to complete the generating accurate and comprehensive test cases.
checkout process", the NLP models can be used to generate This proposed approach eliminates the need for manual
test cases that cover scenarios such as incorrect payment test case generation, which can be tedious and error-prone.
details, server downtime, or issues with the product itself. It also allows for the testing of more complex systems with
The generated test cases can be executed on the e- a larger number of scenarios. By using natural language
commerce platform, ensuring that all possible scenarios are processing techniques, the methodology can quickly
covered and the platform is thoroughly tested. analyze large amounts of textual information and identify
the most relevant information for generating test cases. This
significantly reduces the time and resources required for test

1991
Authorized licensed use limited to: UNIVERSIDADE DE SAO PAULO. Downloaded on May 11,2024 at 15:01:54 UTC from IEEE Xplore. Restrictions apply.
2023 9th International Conference on Advanced Computing and Communication Systems (ICACCS)

case generation, allowing organizations to focus their REFERENCES

efforts on other critical tasks.Moreover, this approach helps [1] Zhuang, Y., Wang, X., Zhang, H., Xie, J., & Zhu, X. (2018). An
to ensure that all boundary conditions and requirements are ensemble approach to conversation generation. In National CCF
considered, reducing the risk of missed sub-systems during Conference on Natural Language Processing and Chinese Computing
the testing process. Overall, the methodology presented in (pp. 51-62). Springer, Cham.
this paper offers a more efficient and precise approach to [2] Yazdani Seqerloo, A., Amiri, M. J., Parsa, S., & Koupaee, M. (2019).
Automatic test cases generation from business process models.
test case generation, which will save time and resources and Requirements Engineering, 24(1), 119-132.
improve the overall quality of testing.
[3] Katuri, A., Salugu, S., Tharuni, G., & Gouri, C. S. (2022).
The future of test case generation from the conversation Conversion of Acoustic Signal (Speech) Into Text By Digital Filter
using Natural Language Processing. arXiv preprint
is promising as it has the potential to greatly enhance the arXiv:2209.04189.
efficiency of the testing process. One of the key areas for
[4] Liu, J., Shen, D., Zhang, Y., Dolan, B., Carin, L., & Chen, W. (2021).
future research is to enhance the accuracy and completeness What Makes Good In-Context Examples for GPT-$3 $?. arXiv
of the test cases generated by the model. This can be done preprint arXiv:2101.06804.
by incorporating more advanced natural language [5] Ni, J., Ábrego, G. H., Constant, N., Ma, J., Hall, K. B., Cer, D., &
processing techniques and training the model on larger and Yang, Y. (2021). Sentence-t5: Scalable sentence encoders from pre-
more diverse datasets. Additionally, there is a scope for trained text-to-text models. arXiv preprint arXiv:2108.08877.
integrating the model with other testing tools and platforms [6] Liu, X., Zheng, Y., Du, Z., Ding, M., Qian, Y., Yang, Z., & Tang, J.
to enable seamless integration of test case generation into (2021). GPT understands, too. arXiv preprint arXiv:2103.10385.
the overall testing process. Furthermore, there is a scope of [7] Zolotareva, E., Tashu, T. M., & Horváth, T. (2020). Abstractive Text
Summarization using Transfer Learning. In ITAT (pp. 75-80).
using the model for testing conversational systems such as
[8] Floridi, L., & Chiriatti, M. (2020). GPT-3: Its nature, scope, limits,
chatbots and voice assistants, which can help in improving and consequences. Minds and Machines, 30(4), 681-694.
their overall performance.
[9] Vel, S. S. (2021, March). Pre-Processing techniques of Text Mining
using Computational Linguistics and Python Libraries. In 2021
ACKNOWLEDGMENT International Conference on Artificial Intelligence and Smart
We are thankful to Vellore Institute of Technology, Systems (ICAIS) (pp. 879-884). IEEE.
Vellore for providing this excellent opportunity to create [10] Agarwal, M., Kalia, R., Bahel, V., & Thomas, A. (2021, September).
and research on this project. We are also very grateful for AutoEval: A NLP Approach for Automatic Test Evaluation System.
In 2021 IEEE 4th International Conference on Computing, Power and
the thorough research and analysis presented in the report, Communication Technologies (GUCON) (pp. 1-6). IEEE.
which has greatly informed and influenced the direction of [11] Petukhova, A., & Fachada, N. (2022). TextCL: A Python package for
the study. NLP preprocessing tasks. SoftwareX, 19, 101122.
[12] Bohra, M., Dadure, P., & Pakray, P. (2022). Comparative analysis of
T5 model for abstractive text summarization on different datasets.

1992
Authorized licensed use limited to: UNIVERSIDADE DE SAO PAULO. Downloaded on May 11,2024 at 15:01:54 UTC from IEEE Xplore. Restrictions apply.

2025 - Comparative Analysis of Text Mining and Clustering Techniques For Assessing Functional Dependency Between Manual Test Cases
No ratings yet
2025 - Comparative Analysis of Text Mining and Clustering Techniques For Assessing Functional Dependency Between Manual Test Cases
36 pages
2025 - Clustering Black Box Test Cases by Embedding Models For Test Suite Reduction
No ratings yet
2025 - Clustering Black Box Test Cases by Embedding Models For Test Suite Reduction
78 pages
Tunnel
100% (4)
Tunnel
1 page
Enhancing Large Language Models For Text-to-Testcase Generation
No ratings yet
Enhancing Large Language Models For Text-to-Testcase Generation
15 pages
An Empirical Evaluation of Using Large Language Models For Automated Unit Test Generation
No ratings yet
An Empirical Evaluation of Using Large Language Models For Automated Unit Test Generation
21 pages
C# Chapter 6
No ratings yet
C# Chapter 6
19 pages
JSW V19N4 503
No ratings yet
JSW V19N4 503
13 pages
Automatic Test Case Generation of C' Programs Using CFG
0% (1)
Automatic Test Case Generation of C' Programs Using CFG
11 pages
Benchmark Design Considerations
No ratings yet
Benchmark Design Considerations
6 pages
A Framework For Automated Hypothesis Testing
No ratings yet
A Framework For Automated Hypothesis Testing
6 pages
Model-Based Testing in Practice
No ratings yet
Model-Based Testing in Practice
10 pages
HFHR 1
No ratings yet
HFHR 1
16 pages
AI-Driven Unit Test Generation Study
No ratings yet
AI-Driven Unit Test Generation Study
80 pages
An Empirical Evaluation of Using Large Language Models For Automated Unit Test Generation
No ratings yet
An Empirical Evaluation of Using Large Language Models For Automated Unit Test Generation
21 pages
Topic
No ratings yet
Topic
2 pages
Models Jailbreak Guide
No ratings yet
Models Jailbreak Guide
24 pages
Abstract
No ratings yet
Abstract
2 pages
FULLTEXT01
No ratings yet
FULLTEXT01
46 pages
A3Test: Assertion-Augmented Automated Test Case Generation
No ratings yet
A3Test: Assertion-Augmented Automated Test Case Generation
18 pages
An Empirical Evaluation of Using Large Language Models For Automated Unit Test Generation
No ratings yet
An Empirical Evaluation of Using Large Language Models For Automated Unit Test Generation
21 pages
LLM-Powered Test Case Generation For Detecting Tricky Bugs
No ratings yet
LLM-Powered Test Case Generation For Detecting Tricky Bugs
11 pages
Requirements Traceability in Automated Test Generation - Application To Smart Card Software Validation
No ratings yet
Requirements Traceability in Automated Test Generation - Application To Smart Card Software Validation
7 pages
Automating Metamorphic Testing with LLMs
No ratings yet
Automating Metamorphic Testing with LLMs
4 pages
Intelligent Test Case Generation Report
No ratings yet
Intelligent Test Case Generation Report
34 pages
SBC Conferences Template 1
No ratings yet
SBC Conferences Template 1
7 pages
Nuances Are The Key Unlocking ChatGPT To Find Failure-Inducing Tests With Differential Prompting
No ratings yet
Nuances Are The Key Unlocking ChatGPT To Find Failure-Inducing Tests With Differential Prompting
13 pages
Test Case Generation AI ML
No ratings yet
Test Case Generation AI ML
66 pages
USLTG Test Case Automatic Generation by
No ratings yet
USLTG Test Case Automatic Generation by
33 pages
Wabi, Sabi, and Shibui
No ratings yet
Wabi, Sabi, and Shibui
2 pages
An Automatic Tool For Generating Test Cases From The System's Requirements
No ratings yet
An Automatic Tool For Generating Test Cases From The System's Requirements
6 pages
Automated Testing in Industry
No ratings yet
Automated Testing in Industry
6 pages
Comprehensive Evaluation and Insights Into The Use of Large Language Models in The Automation of Behavior-Driven Development Acceptance Test Formulation
No ratings yet
Comprehensive Evaluation and Insights Into The Use of Large Language Models in The Automation of Behavior-Driven Development Acceptance Test Formulation
7 pages
Model Based Test Generation An Industrial Experien
No ratings yet
Model Based Test Generation An Industrial Experien
7 pages
The Roleof LLMsin Automating Test Case Generationand Software Validation
No ratings yet
The Roleof LLMsin Automating Test Case Generationand Software Validation
12 pages
Phoneme-Based English-Amharic Statistical Machine Translation
No ratings yet
Phoneme-Based English-Amharic Statistical Machine Translation
5 pages
Task-Based Automated Test Case Generation For Human-Machine Interaction
No ratings yet
Task-Based Automated Test Case Generation For Human-Machine Interaction
5 pages
Test Case Generator: Software Engineering CSE 3001
No ratings yet
Test Case Generator: Software Engineering CSE 3001
5 pages
NCDCM 2017 Paper 60-1
No ratings yet
NCDCM 2017 Paper 60-1
7 pages
Lesson Plan For Grade 12 DANCE
100% (1)
Lesson Plan For Grade 12 DANCE
2 pages
Reinventing Test Case Generation
No ratings yet
Reinventing Test Case Generation
8 pages
6 Ijmcr
No ratings yet
6 Ijmcr
7 pages
LLMs in Software Testing
No ratings yet
LLMs in Software Testing
31 pages
Sample 1
No ratings yet
Sample 1
10 pages
Multi-Language Unit Testing LLM 2024
No ratings yet
Multi-Language Unit Testing LLM 2024
13 pages
Beyond The Code - An In-Depth Review of NLP Applications in Software Testing
No ratings yet
Beyond The Code - An In-Depth Review of NLP Applications in Software Testing
5 pages
DLL Sept 19 English III
No ratings yet
DLL Sept 19 English III
3 pages
STE Microproject Sahil Narale 1
No ratings yet
STE Microproject Sahil Narale 1
16 pages
Metamorphic Testing A Review of Challenges-1-16
No ratings yet
Metamorphic Testing A Review of Challenges-1-16
16 pages
AI-Driven Test Case Classification
No ratings yet
AI-Driven Test Case Classification
13 pages
Multidimensional
100% (1)
Multidimensional
42 pages
Identifying The Firmware of A Qlogic or Emulex FC HBA
No ratings yet
Identifying The Firmware of A Qlogic or Emulex FC HBA
2 pages
Infedu 2023 05
No ratings yet
Infedu 2023 05
22 pages
04 AMP in Flutter LAB 2 (Building Layouts)
100% (1)
04 AMP in Flutter LAB 2 (Building Layouts)
7 pages
APPT Boosting Automated Patch Correctness Prediction Via Fine-Tuning Pre-Trained Models
No ratings yet
APPT Boosting Automated Patch Correctness Prediction Via Fine-Tuning Pre-Trained Models
21 pages
Blackbook Format
No ratings yet
Blackbook Format
70 pages
Essay Structures & Phrases Guide
100% (1)
Essay Structures & Phrases Guide
16 pages
Research Statement2
No ratings yet
Research Statement2
5 pages
Apophasis in Plotinus - A Critical Approach, Sells
100% (1)
Apophasis in Plotinus - A Critical Approach, Sells
20 pages
2023 Grade 12 Math Trial Exam Paper 1 GP Memo-1
No ratings yet
2023 Grade 12 Math Trial Exam Paper 1 GP Memo-1
22 pages
Unit Test Case Generation With Transformers and Focal Context
No ratings yet
Unit Test Case Generation With Transformers and Focal Context
15 pages
AI Powered Software Testing The Impact of Large Language Models On Testing Methodologies
No ratings yet
AI Powered Software Testing The Impact of Large Language Models On Testing Methodologies
4 pages
Transforming Software Requirements Into Test Cases Via Model Transformation
No ratings yet
Transforming Software Requirements Into Test Cases Via Model Transformation
18 pages
NLP-assisted Software Testing: A Systematic Mapping of The Literature
No ratings yet
NLP-assisted Software Testing: A Systematic Mapping of The Literature
29 pages
Pc3110rs-Copy Mnual FPDF
No ratings yet
Pc3110rs-Copy Mnual FPDF
68 pages
Testing Approach For Automatic Test Case Generation and Optimization Using GA
No ratings yet
Testing Approach For Automatic Test Case Generation and Optimization Using GA
3 pages
Activity 1 Song Analysis
No ratings yet
Activity 1 Song Analysis
4 pages
Functions and Relations Solutions
No ratings yet
Functions and Relations Solutions
46 pages
Example of Thesis Paragraph
100% (2)
Example of Thesis Paragraph
4 pages
Infographic PDF About Teaching Strategies To English Skills
No ratings yet
Infographic PDF About Teaching Strategies To English Skills
2 pages
Exploring The Capability of ChatGPT in Test Generation
No ratings yet
Exploring The Capability of ChatGPT in Test Generation
9 pages
Large Language Models Are Few-Shot Testers Exploring LLM Based General Bug Reproduction
No ratings yet
Large Language Models Are Few-Shot Testers Exploring LLM Based General Bug Reproduction
12 pages
2.1.2 Structure Chart (MT-L)
No ratings yet
2.1.2 Structure Chart (MT-L)
8 pages
Emerging Trends in QA Automation AI-Driven Test ST
No ratings yet
Emerging Trends in QA Automation AI-Driven Test ST
18 pages
Acharyakulam Samvaad Test: (For Class 8)
No ratings yet
Acharyakulam Samvaad Test: (For Class 8)
14 pages
COMBINATIONAL Unlocked
No ratings yet
COMBINATIONAL Unlocked
20 pages
The Product Rule: Lesson Objective: Be Able To Differentiate The Product of Two Functions Using The Product Rule
No ratings yet
The Product Rule: Lesson Objective: Be Able To Differentiate The Product of Two Functions Using The Product Rule
13 pages
Software Testing With Large Language Models Survey Landscape and Vision
No ratings yet
Software Testing With Large Language Models Survey Landscape and Vision
26 pages
CHAITYAVANDAN
No ratings yet
CHAITYAVANDAN
4 pages
Reflection For QWQ
No ratings yet
Reflection For QWQ
2 pages
Endsem Deep Learning Important
No ratings yet
Endsem Deep Learning Important
2 pages
Lesson Plan - Where Were You at
No ratings yet
Lesson Plan - Where Were You at
6 pages
Accelerating Software Development Using Generative AI ChatGPT Case Study
No ratings yet
Accelerating Software Development Using Generative AI ChatGPT Case Study
12 pages
English Levels Explained for Beginners
No ratings yet
English Levels Explained for Beginners
7 pages
2409.02693v1the Role of Artificial Intelligence and Machine Learning in Software Testing
No ratings yet
2409.02693v1the Role of Artificial Intelligence and Machine Learning in Software Testing
19 pages
The Future of Software Testing AI-Powered Test Cas
No ratings yet
The Future of Software Testing AI-Powered Test Cas
24 pages
Math 2 - Week 3.2nd
No ratings yet
Math 2 - Week 3.2nd
2 pages
Echoes of The Red River
No ratings yet
Echoes of The Red River
2 pages
Aadhaar - Sachin
No ratings yet
Aadhaar - Sachin
1 page
Academic Vocabulary Teaching Guide
No ratings yet
Academic Vocabulary Teaching Guide
4 pages
Micro Project: Test Cases For Telegram
100% (2)
Micro Project: Test Cases For Telegram
16 pages
Neuropsychology Expertise Overview
No ratings yet
Neuropsychology Expertise Overview
4 pages
AI-Driven Test Case Optimization: Enhancing Efficiency in Software Testing Life Cycle
No ratings yet
AI-Driven Test Case Optimization: Enhancing Efficiency in Software Testing Life Cycle
15 pages
The Role of AI and ML in Enhancing Software Testing Automation
No ratings yet
The Role of AI and ML in Enhancing Software Testing Automation
2 pages
AI Research
No ratings yet
AI Research
19 pages