Reference : Syntactic Vs. Semantic similarity of Artificial and Real Faults in Mutation Testing S...
E-prints/Working papers : Already available on another site
Engineering, computing & technology : Computer science
http://hdl.handle.net/10993/49190
Syntactic Vs. Semantic similarity of Artificial and Real Faults in Mutation Testing Studies
English
Ojdanic, Milos mailto [University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SerVal >]
Garg, Aayush mailto [University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS) >]
Khanfir, Ahmed mailto [University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SerVal >]
Degiovanni, Renzo Gaston mailto [University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SerVal >]
Papadakis, Mike mailto [University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS) >]
Le Traon, Yves mailto [University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SerVal >]
2023
15
Yes
[en] Fault Injection ; fault seeding ; machine learning ; mutation testing ; semantic model ; syntactic distance
[en] Fault seeding is typically used in empirical studies to evaluate and compare test techniques. Central to these techniques lies the hypothesis that artificially seeded faults involve some form of realistic properties and thus provide realistic experimental results. In an attempt to strengthen realism, a recent line of re- search uses machine learning techniques, such as deep learning and Natural Language Processing, to seed faults that look like (syntactically) real ones, implying that fault realism is related to syntactic similarity. This raises the question of whether seeding syntactically similar faults indeed results in semantically similar faults and, more generally, whether syntactically dissimilar faults are far away (semantically) from the real ones. We answer this question by employing 4 state-of-the-art fault-seeding techniques (PiTest - a popular mutation testing tool, IBIR - a tool with manually crafted fault patterns, DeepMutation - a learning-based fault seeded framework and μBERT - a mutation testing tool based on the pre-trained language model CodeBERT) that operate in a fundamentally different way, and demonstrate that syntactic similarity does not reflect semantic similarity. We also show that 65.11%, 76.44%, 61.39% and 9.76% of the real faults of Defects4J V2 are semantically resembled by PiTest, IBIR, μBERT and Deep- Mutation faults, respectively.
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > SerVal - Security, Reasoning & Validation
PayPal
Researchers ; Professionals ; Students ; General public
http://hdl.handle.net/10993/49190
10.1109/TSE.2023.3277564
https://ieeexplore.ieee.org/abstract/document/10136793
FnR ; FNR13646587 > Michail Papadakis > RASoRS > Risk Analysis Of Software Requirements Specification > 01/07/2020 > 30/06/2023 > 2019

File(s) associated to this reference

Fulltext file(s):

FileCommentaryVersionSizeAccess
Open access
Syntactic_Versus_Semantic_Similarity_of_Artificial_and_Real_Faults_in_Mutation_Testing_Studies.pdfPublisher postprint4.72 MBView/Open

Bookmark and Share SFX Query

All documents in ORBilu are protected by a user license.