An Active Learning Approach for Improving the Accuracy of Automated Domain Model Extraction

Arora, Chetan; Sabetzadeh, Mehrdad; Nejati, Shiva; Briand, Lionel

doi:10.1145/3293454

Download

Article (Scientific journals)

An Active Learning Approach for Improving the Accuracy of Automated Domain Model Extraction

Arora, Chetan; Sabetzadeh, Mehrdad; Nejati, Shiva et al.

2019 • In ACM Transactions on Software Engineering and Methodology, 28 (1)

Peer Reviewed verified by ORBi

Permalink
https://hdl.handle.net/10993/37054

DOI
10.1145/3293454

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

TOSEM_ASNB.pdf

Author postprint (1.09 MB)

Download

All documents in ORBilu are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

Requirements Engineering; Active Learning; Natural-language Requirements; Domain Modeling; Case Study Research

Abstract :

[en] Domain models are a useful vehicle for making the interpretation and elaboration of natural-language requirements more precise. Advances in natural language processing (NLP) have made it possible to automatically extract from requirements most of the information that is relevant to domain model construction. However, alongside the relevant information, NLP extracts from requirements a significant amount of information that is superfluous, i.e., not relevant to the domain model. Our objective in this article is to develop automated assistance for filtering the superfluous information extracted by NLP during domain model extraction. To this end, we devise an active-learning-based approach that iteratively learns from analysts’ feedback over the relevance and superfluousness of the extracted domain model elements, and uses this feedback to provide recommendations for filtering superfluous elements. We empirically evaluate our approach over three industrial case studies. Our results indicate that, once trained, our approach automatically detects an average of ≈ 45% of the superfluous elements with a precision of ≈ 96%. Since precision is very high, the automatic recommendations made by our approach are trustworthy. Consequently, analysts can dispose of a considerable fraction – nearly half – of the superfluous elements with minimal manual work. The results are particularly promising, as they should be considered in light of the non-negligible subjectivity that is inherently tied to the notion of relevance.

Research center :

Interdisciplinary Centre for Security, Reliability and Trust (SnT) > Software Verification and Validation Lab (SVV Lab)

Disciplines :

Computer science

Author, co-author :

Arora, Chetan ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) ; SES Networks > Systems Engineering

Sabetzadeh, Mehrdad ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)

Nejati, Shiva ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)

Briand, Lionel ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)

External co-authors :

Language :

English

Title :

An Active Learning Approach for Improving the Accuracy of Automated Domain Model Extraction

Publication date :

February 2019

Journal title :

ACM Transactions on Software Engineering and Methodology

ISSN :

1049-331X

Publisher :

Association for Computing Machinery (ACM), United States

Volume :

Issue :

Peer reviewed :

Peer Reviewed verified by ORBi

Focus Area :

Computational Sciences

European Projects :

H2020 - 694277 - TUNE - Testing the Untestable: Model Testing of Complex Software-Intensive Systems

FnR Project :

FNR11601446 - Reconciling Natural-language Requirements And Model-based Specification For Effective Development Of Critical Infrastructure Systems, 2017 (01/11/2017-31/10/2019) - Chetan Arora

Funders :

FNR - Fonds National de la Recherche [LU]
CER - Conseil Européen de la Recherche [BE]
CE - Commission Européenne [BE]

Available on ORBilu :

since 29 October 2018

Statistics

Number of views

622 (155 by Unilu)

Number of downloads

714 (79 by Unilu)

More statistics

Scopus citations^®

Scopus citations^®
without self-citations

OpenCitations

WoS citations^™