Model Extraction; Natural-Language Requirements; Natural Language Processing; Case Study Research
Abstract :
[en] Domain modeling is an important step in the transition from natural-language requirements to precise specifications. For large systems, building a domain model manually is laborious. Several approaches exist to assist engineers with this task, where Natural Language Processing is employed for automated extraction of domain model elements. Despite the existing approaches, important facets remain under-explored. Notably, there is limited empirical evidence about the usefulness of existing extraction rules in industry. Furthermore, important opportunities for enhancing the extraction rules are yet to be exploited. We develop a domain model extractor by bringing together existing extraction rules and proposing important enhancements. We apply our model extractor to four industrial requirements documents, reporting on the frequency of different extraction rules being applied. We conduct an expert study over one of these documents, investigating the accuracy and overall effectiveness of our domain model extractor.
Disciplines :
Computer science
Author, co-author :
Arora, Chetan ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)
Sabetzadeh, Mehrdad ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)
Briand, Lionel ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)
Zimmer, Frank
External co-authors :
no
Language :
English
Title :
Extracting Domain Models from Natural-Language Requirements: Approach and Industrial Evaluation
Publication date :
October 2016
Event name :
19th International Conference on Model Driven Engineering Languages and Systems
Event place :
Saint-Malo, France
Event date :
October 2-7, 2016
Main work title :
19th International Conference on Model Driven Engineering Languages and Systems, Saint-Malo 2-7 October 2016
Peer reviewed :
Peer reviewed
FnR Project :
FNR6911386 - Enhancing The Automation And Accuracy Of Requirements Quality Assurance Processes Via Disciplined Use Of Natural Language, 2013 (01/09/2013-31/10/2016) - Chetan Arora
R. J. Abbott. Program design by informal English descriptions. Communications of the ACM, 26(11), 1983.
A. Akbik and J. Broß. Wanderlust: Extracting semantic relations from natural language text using dependency grammar patterns. In Workshop on Semantic Search at the 18th International World Wide Web Conference (WWW'09), 2009.
S. Ambler. The Object Primer: Agile Model-Driven Development with UML 2.0. Cambridge University Press, 2004.
V. Ambriola and V. Gervasi. On the systematic analysis of natural language requirements with CIRCE. Automated Software Engineering, 13(1), 2006.
C. Arora, M. Sabetzadeh, L. Briand, and F. Zimmer. Automated checking of conformance to requirements templates using natural language processing. IEEE Transactions on Software Engineering, 41(10), 2015.
G. Attardi and F. Dell'Orletta. Chunking and dependency parsing. In Workshop on Partial Parsing: Between Chunking and Deep Parsing at 6th International Conference on Language Resources and Evaluation (LREC'08), 2008.
W. Ben Abdessalem Karaa, Z. Ben Azzouz, A. Singh, N. Dey, A. S Ashour, and H. Ben Ghazala. Automatic builder of class diagram (ABCD): an application of UML generation from functional requirements. Software: Practice and Experience, 2015.
P. P. Chen. English sentence structure and entity-relationship diagrams. Information Sciences, 29(2), 1983.
M. C. De Marneffie and C. D. Manning. Stanford typed dependencies manual. Technical report, Stanford University, 2008.
D. K. Deeptimahanti and R. Sanyal. Semi-automatic generation of UML models from natural language requirements. In 4th India Software Engineering Conference (ISEC'11), 2011.
B. Efron and R. J. Tibshirani. An introduction to the bootstrap. CRC press, 1994.
M. Elbendak, P. Vickers, and N. Rossiter. Parsed use case descriptions as a basis for object-oriented class model generation. Journal of Systems and Software, 84(7), 2011.
A. Fader, S. Soderland, and O. Etzioni. Identifying relations for open information extraction. In Conference on Empirical Methods in Natural Language Processing, 2011.
GATE NLP Workbench. http://gate.ac.uk/.
H. Harmain and R. Gaizauskas. CM-Builder: A natural language-based CASE tool for object-oriented analysis. Automated Software Engineering, 10(2), 2003.
J. Holt, S. Perry, and M. Brownsword. Model-Based Requirements Engineering. IET, 2011.
M. Ibrahim and R. Ahmad. Class diagram extraction from textual requirements using natural language processing (NLP) techniques. In 2nd International Conference on Computer Research and Development (ICCRD'10), 2010.
N. Indurkhya and F. J. Damerau. Handbook of natural language processing. CRC Press, 2010.
C. Larman. Applying UML and Patterns. Prentice Hall, 2004.
D. Liu, K. Subramaniam, A. Eberlein, and B. H. Far. Natural language requirements analysis and class model generation using UCDA. In Innovations in Applied Artificial Intelligence. Springer, 2004.
D. Liu, K. Subramaniam, B. H. Far, and A. Eberlein. Automating transition from use-cases to class model. In Canadian Conference on Electrical and Computer Engineering (CCECE'03), 2003.
C. D. Manning and H. Schütze. Foundations of statistical natural language processing. MIT press, 1999.
M. Marneffe, B. Maccartney, and C. Manning. Generating typed dependency parses from phrase structure parses. In 5th International Conference on Language Resources and Evaluation (LREC'06), 2006.
L. Mich. NL-OOPS: from natural language to object oriented requirements using the natural language processing system LOLITA. Natural language engineering, 2(02), 1996.
K. Pohl and C. Rupp. Requirements Engineering Fundamentals. Rocky Nook, 2011.
D. Popescu, S. Rugaber, N. Medvidovic, and D. M. Berry. Innovations for Requirement Analysis. From Stakeholders' Needs to Formal Designs, chapter Reducing Ambiguities in Requirements Specifications Via Automatically Created Object-Oriented Models. Springer, 2008.
K. Schneider. Experience and Knowledge Management in Software Engineering, chapter Structuring Knowledge for Reuse. Springer, 2009.
N. A. Smith. Linguistic Structure Prediction. Synthesis Lectures on Human Language Technologies. Morgan and Claypool, 2011.
V. B. Vidya Sagar and S. Abirami. Conceptual modeling of natural language functional requirements. Journal of System and Software, 88, 2014.
Z. Yang, H. Lin, and Y. Li. BioPPISVMExtractor: A protein-protein interaction extractor for biomedical literature using SVM and rich feature sets. Journal of biomedical informatics, 43(1), 2010.
T. Yue, L. Briand, and Y. Labiche. A systematic review of transformation approaches between user requirements and analysis models. Requirements Engineering, 16(2), 2011.
T. Yue, L. C. Briand, and Y. Labiche. aToucan: An automated framework to derive UML analysis models from use case models. ACM Transactions on Software Engineering and Methodology, 24(3), 2015.
M. Zhu, Y. Zhang, W. Chen, M. Zhang, and J. Zhu. Fast and accurate shift-reduce constituent parsing. In 51st Annual Meeting of the Association for Computational Linguistics (ACL'13), 2013.