0% found this document useful (0 votes)

4 views4 pages

Prompt

Uploaded by

mangrioaqsa185

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views4 pages

Prompt

Uploaded by

mangrioaqsa185

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Great choice!

“Using Large Language Models (LLMs) for Automated Test Case

Generation” is a cutting-edge and impactful research topic. It combines AI, software
engineering, and NLP, and there's a lot of scope for innovation—perfect for a university-level
team project.

🧠 Research Title:
"Exploring the Effectiveness of Large Language Models for Automated Software Test Case
Generation"

📝 Research Plan Outline

🎯 Objective:

Investigate how effective LLMs (like GPT-4, CodeLlama, or Codex) are at generating unit tests,
integration tests, or system tests for codebases in various programming languages (Python,
Java, etc.).

👥 Team Role Breakdown (for 5 Members)

Role Responsibilities
1. Literature Research existing work on LLMs and test generation. Compare tools like
Reviewer Codex, ChatGPT, CodeT5, etc. Summarize findings.
Gather codebases from GitHub, Codeforces, or open-source projects.
2. Dataset Collector
Prepare datasets for test generation.
Prompt and fine-tune LLMs (or use APIs) to generate test cases.
3. Model Engineer
Evaluate different prompting strategies.
Design evaluation metrics (e.g., test coverage, correctness, mutation
4. Evaluation Lead
testing). Run the generated tests and analyze results.
5. Report & Coordinate documentation, write the final report, and prepare the
Presentation Lead presentation/slides. Assist others where needed.

🧪 Research Phases & Timeline

🔹 Phase 1: Background & Literature Review (Week 1–2)
 Read 8–10 relevant research papers (see below).
 Understand how LLMs are used for code-related tasks.
 Study traditional vs. LLM-based test generation.

🔹 Phase 2: Dataset & Tool Setup (Week 2–3)

 Collect code snippets or full projects (preferably in Python/Java).

 Use GitHub repos, LeetCode/Codeforces problems with solutions, or open-source apps.
 Setup tools: OpenAI API (for GPT), Hugging Face (CodeT5), or any open-source LLMs.

🔹 Phase 3: Test Case Generation (Week 3–5)

 Try multiple prompting strategies:

o “Write unit tests for the following function…”
o “Generate boundary test cases for this method…”
 Compare:
o Zero-shot
o Few-shot (showing 1–2 examples)
o Chain-of-thought prompting

🔹 Phase 4: Evaluation & Analysis (Week 5–7)

 Evaluate:
o Code coverage (e.g., using coverage.py)
o Correctness (do the tests catch real bugs?)
o Comparison with human-written tests
o Mutation testing (e.g., using MutPy or PITest)

🔹 Phase 5: Reporting & Presentation (Week 7–8)

 Summarize findings in a research report (6–10 pages).

 Create visualizations (bar graphs, pie charts for test coverage, etc.).
 Prepare a final presentation (15–20 min talk with slides).

📚 Key Research Questions

1. Can LLMs reliably generate syntactically and semantically correct test cases?
2. What kind of prompting techniques give the best results?
3. How does LLM-generated testing compare to traditional auto-testing tools?
4. Can LLMs detect edge cases or just basic scenarios?
5. What are the limitations of using LLMs in real-world CI/CD pipelines?
🧰 Tools & Technologies
 LLMs/APIs: OpenAI GPT-4 API, CodeLlama, CodeT5, StarCoder
 Languages: Python (easiest for testing), Java
 Test Frameworks: unittest, pytest, JUnit
 Coverage Tools: coverage.py, pytest-cov, JaCoCo
 Mutation Testing: MutPy, PITest
 IDE: VS Code, PyCharm
 Version Control: Git, GitHub

📖 Suggested Papers & Resources

Research Papers:

 [1] "Automated Unit Test Generation with OpenAI's GPT Models" (Arxiv)
 [2] "Evaluating Large Language Models for Code Generation and Debugging"
(Google DeepMind)
 [3] "LLM4Code: Exploring Test Generation Using Language Models"
 [4] "Can ChatGPT Write Effective Unit Tests?"
 [5] "CodeXGLUE Benchmark" – useful for datasets and evaluations.

Datasets (optional):

 CodeXGLUE
 HumanEval
 GitHub repos with test folders (e.g., open-source Python projects)

✅ Final Deliverables
1. Research Paper / Report (6–10 pages):
o Introduction, methodology, results, analysis, conclusion
2. Presentation Slides (10–15 slides)
3. Codebase (test scripts, prompt templates, results)
4. Evaluation Metrics Summary (charts, tables)
5. Optional: a short demo video of test generation in action

🚀 Bonus Ideas (if you want to go deeper):

 Fine-tune CodeT5 or similar models on your dataset.
 Develop a mini GUI or CLI tool that lets users input code and see generated test cases.
 Compare performance across languages (e.g., Python vs Java).

Would you like a starter GitHub template or help drafting the prompt templates for GPT-4 to
generate test cases?

Father of Admission Test 2nd Edition JU PMSCS Version
100% (2)
Father of Admission Test 2nd Edition JU PMSCS Version
215 pages
GenAI Course For QA Engineers
No ratings yet
GenAI Course For QA Engineers
6 pages
133 Large Language Model Evaluatio
No ratings yet
133 Large Language Model Evaluatio
12 pages
An Empirical Evaluation of Using Large Language Models For Automated Unit Test Generation
No ratings yet
An Empirical Evaluation of Using Large Language Models For Automated Unit Test Generation
21 pages
An Empirical Evaluation of Using Large Language Models For Automated Unit Test Generation
No ratings yet
An Empirical Evaluation of Using Large Language Models For Automated Unit Test Generation
21 pages
AI Powered Software Testing The Impact of Large Language Models On Testing Methodologies
No ratings yet
AI Powered Software Testing The Impact of Large Language Models On Testing Methodologies
4 pages
The Roleof LLMsin Automating Test Case Generationand Software Validation
No ratings yet
The Roleof LLMsin Automating Test Case Generationand Software Validation
12 pages
Analysis of Code and Test-Code Generated by Large Language Models
No ratings yet
Analysis of Code and Test-Code Generated by Large Language Models
47 pages
LMARL25_final_projects
No ratings yet
LMARL25_final_projects
8 pages
Evaluating LLMs in Code Generation
No ratings yet
Evaluating LLMs in Code Generation
26 pages
Multi-Language Unit Testing LLM 2024
No ratings yet
Multi-Language Unit Testing LLM 2024
13 pages
Unit Test Generation LLM 2024
No ratings yet
Unit Test Generation LLM 2024
13 pages
ASE2024 CodeGenSurvey-7
No ratings yet
ASE2024 CodeGenSurvey-7
17 pages
An Empirical Evaluation of Using Large Language Models For Automated Unit Test Generation
No ratings yet
An Empirical Evaluation of Using Large Language Models For Automated Unit Test Generation
21 pages
LLMs in Software Testing
No ratings yet
LLMs in Software Testing
31 pages
代码大模型
No ratings yet
代码大模型
18 pages
Performance Review On LLM For Solving Leetcode Pro
No ratings yet
Performance Review On LLM For Solving Leetcode Pro
5 pages
How To Start With Llms Testing
No ratings yet
How To Start With Llms Testing
18 pages
Code Generation With LLMs
No ratings yet
Code Generation With LLMs
59 pages
Base 6
No ratings yet
Base 6
18 pages
LangChain Custom Project - Student Implementation Guide
No ratings yet
LangChain Custom Project - Student Implementation Guide
9 pages
Code Tree
No ratings yet
Code Tree
16 pages
Project Seminar
No ratings yet
Project Seminar
12 pages
ChatGPT Coding CompSac 23
No ratings yet
ChatGPT Coding CompSac 23
9 pages
CIBench Evaluating Your LLMs With A Code Interpret
No ratings yet
CIBench Evaluating Your LLMs With A Code Interpret
22 pages
Rethinking The Influence of Source Code On Test Case Generation
No ratings yet
Rethinking The Influence of Source Code On Test Case Generation
23 pages
Comprehensive Agentic AI v2.0 Learning Roadmap
No ratings yet
Comprehensive Agentic AI v2.0 Learning Roadmap
37 pages
Large Language Models and Prompt Engineering
No ratings yet
Large Language Models and Prompt Engineering
5 pages
Agent Coder 2312.13010v2
No ratings yet
Agent Coder 2312.13010v2
21 pages
SciReplicate-Bench Benchmarking LLMs in Agent-Driv
No ratings yet
SciReplicate-Bench Benchmarking LLMs in Agent-Driv
23 pages
HumanEval Pro and MBPPPro Evaluating Large Language Models
No ratings yet
HumanEval Pro and MBPPPro Evaluating Large Language Models
27 pages
Testing&Evaluationof LLM
No ratings yet
Testing&Evaluationof LLM
223 pages
Tahsin Amin SQA
No ratings yet
Tahsin Amin SQA
8 pages
1 s2.0 S0167739X24002449 Main
No ratings yet
1 s2.0 S0167739X24002449 Main
13 pages
LLM's For Code Generation
No ratings yet
LLM's For Code Generation
31 pages
Big Code Bench
No ratings yet
Big Code Bench
62 pages
Evaluating Large Language Model (LLM) Systems: Metrics, Challenges, and Best Practices
No ratings yet
Evaluating Large Language Model (LLM) Systems: Metrics, Challenges, and Best Practices
27 pages
Mock Test Document
No ratings yet
Mock Test Document
4 pages
Software Test Generation Analysis
No ratings yet
Software Test Generation Analysis
4 pages
Fundamentals of Generative AI
No ratings yet
Fundamentals of Generative AI
5 pages
Liu Et Al. - 2023 - Invited Paper VerilogEval Evaluating Large Language Models For Verilog Code Generation
No ratings yet
Liu Et Al. - 2023 - Invited Paper VerilogEval Evaluating Large Language Models For Verilog Code Generation
8 pages
LLM-ProS Analyzing Large Language Models Performance in Competitive Problem Solving
No ratings yet
LLM-ProS Analyzing Large Language Models Performance in Competitive Problem Solving
8 pages
Ai Testing 2025
No ratings yet
Ai Testing 2025
5 pages
LLM Deployment on Local Network
No ratings yet
LLM Deployment on Local Network
3 pages
LLM Code TAs
No ratings yet
LLM Code TAs
20 pages
Bugs in LLms Genereated Code
No ratings yet
Bugs in LLms Genereated Code
47 pages
RAI AI Engineer Intern Assignments
No ratings yet
RAI AI Engineer Intern Assignments
3 pages
2-Weeks Gen AI & Prompt Training
No ratings yet
2-Weeks Gen AI & Prompt Training
5 pages
Large Language Models Are Few-Shot Testers Exploring LLM Based General Bug Reproduction
No ratings yet
Large Language Models Are Few-Shot Testers Exploring LLM Based General Bug Reproduction
12 pages
Towards Advancing Code Generation With Large Language Models: A Research Roadmap
No ratings yet
Towards Advancing Code Generation With Large Language Models: A Research Roadmap
10 pages
Assessing Large Language Models For Code Generation: A Comprehensive Framework
No ratings yet
Assessing Large Language Models For Code Generation: A Comprehensive Framework
6 pages
AI Agent UC Berkeley
No ratings yet
AI Agent UC Berkeley
14 pages
What Is This Document About
No ratings yet
What Is This Document About
7 pages
Seed-CTS:: Unleashing The Power of Tree Search For Superior Performance in Competitive Coding Tasks
No ratings yet
Seed-CTS:: Unleashing The Power of Tree Search For Superior Performance in Competitive Coding Tasks
19 pages
Oracle-Guided Program Selection From Large Language Models: Zhiyu Fan Haifeng Ruan
No ratings yet
Oracle-Guided Program Selection From Large Language Models: Zhiyu Fan Haifeng Ruan
13 pages
Seed Coder
No ratings yet
Seed Coder
46 pages
S 001: N Q A C LLM E: Afurai EW Ualitative Pproach For ODE Valuation
No ratings yet
S 001: N Q A C LLM E: Afurai EW Ualitative Pproach For ODE Valuation
22 pages
Introduction To Python Programming
No ratings yet
Introduction To Python Programming
17 pages
ABAP Programming Language Guide
100% (1)
ABAP Programming Language Guide
30 pages
Exploring Cqrs and Event Sourcing A Journey Into High Scalability Availability and Maintainability With Windows Azure 1st Edition Dominic Betts Instant Download
100% (1)
Exploring Cqrs and Event Sourcing A Journey Into High Scalability Availability and Maintainability With Windows Azure 1st Edition Dominic Betts Instant Download
78 pages
Turbo Codes Tutorial Guide
No ratings yet
Turbo Codes Tutorial Guide
21 pages
GSM RF Optimization Guide
No ratings yet
GSM RF Optimization Guide
7 pages
SnapMirror ActiveSync
No ratings yet
SnapMirror ActiveSync
2 pages
Mantas Interface Oracle FLEXCUBE Universal Banking Release 11.3.0 (May) (2011) Oracle Part Number E51536-01
100% (1)
Mantas Interface Oracle FLEXCUBE Universal Banking Release 11.3.0 (May) (2011) Oracle Part Number E51536-01
24 pages
Serial Number POC
No ratings yet
Serial Number POC
11 pages
JnU CSE 11th Batch Online Class Schedule, Zoom Link, ID and Password (Version 2.1)
No ratings yet
JnU CSE 11th Batch Online Class Schedule, Zoom Link, ID and Password (Version 2.1)
10 pages
OceanStor Dorado 6.1.3 Command Reference
No ratings yet
OceanStor Dorado 6.1.3 Command Reference
1,939 pages
Assignment
No ratings yet
Assignment
8 pages
Gateway I IFC User Manual
No ratings yet
Gateway I IFC User Manual
30 pages
Syn-2151 10/100/1000baset Ethernet Media Converter
No ratings yet
Syn-2151 10/100/1000baset Ethernet Media Converter
2 pages
11.embedded Systems+GS
No ratings yet
11.embedded Systems+GS
10 pages
PacketFabric Colt Infosheet
No ratings yet
PacketFabric Colt Infosheet
1 page
Cybersecurity & Info Systems Q&A
No ratings yet
Cybersecurity & Info Systems Q&A
4 pages
CCNA Security v2.0 Final Exam Answers 100 1 PDF
100% (3)
CCNA Security v2.0 Final Exam Answers 100 1 PDF
26 pages
HD Video Capture Setup Guide
No ratings yet
HD Video Capture Setup Guide
49 pages
Places: 10M Image Scene Database
No ratings yet
Places: 10M Image Scene Database
14 pages
Cyberstalking & Cyberbullying Guide
No ratings yet
Cyberstalking & Cyberbullying Guide
8 pages
Advanced Security Camera Guide
No ratings yet
Advanced Security Camera Guide
4 pages
BPP - EHS - Incident Management Rev 040607 Final
100% (1)
BPP - EHS - Incident Management Rev 040607 Final
21 pages
Geoinformatics 2007 Vol05
No ratings yet
Geoinformatics 2007 Vol05
48 pages
Live Sound First Assignment Last Version
No ratings yet
Live Sound First Assignment Last Version
3 pages
Madinah Visitor Housing GIS
100% (1)
Madinah Visitor Housing GIS
11 pages
Midterm Exam Data Base
No ratings yet
Midterm Exam Data Base
5 pages
Switch Board Installation Guide Revision 1.2 Playstation Mainboard (Pu-20)
No ratings yet
Switch Board Installation Guide Revision 1.2 Playstation Mainboard (Pu-20)
11 pages
GFT Agile Runbook: A Guick Guide For You To Start Running Your Project Using An Agile Approach
No ratings yet
GFT Agile Runbook: A Guick Guide For You To Start Running Your Project Using An Agile Approach
32 pages
Mastercam X5 Circle Sorting Add-On
No ratings yet
Mastercam X5 Circle Sorting Add-On
9 pages

Prompt

Uploaded by

Prompt

Uploaded by

Great choice!

“Using Large Language Models (LLMs) for Automated Test Case

📝 Research Plan Outline

👥 Team Role Breakdown (for 5 Members)

🧪 Research Phases & Timeline

🔹 Phase 2: Dataset & Tool Setup (Week 2–3)

 Collect code snippets or full projects (preferably in Python/Java).

🔹 Phase 3: Test Case Generation (Week 3–5)

 Try multiple prompting strategies:

🔹 Phase 4: Evaluation & Analysis (Week 5–7)

🔹 Phase 5: Reporting & Presentation (Week 7–8)

 Summarize findings in a research report (6–10 pages).

📚 Key Research Questions

📖 Suggested Papers & Resources

🚀 Bonus Ideas (if you want to go deeper):

You might also like