DOI QR코드

DOI QR Code

Discourse Structure Analysis for Requirement Mining

  • Received : 2013.10.24
  • Accepted : 2013.12.28
  • Published : 2013.12.30

Abstract

In this work, we first introduce two main approaches to writing requirements and then propose a method based on Natural Language Processing to improve requirement authoring and the overall coherence, cohesion and organization of requirement documents. We investigate the structure of requirement kernels, and then the discourse structure associated with those kernels. This will then enable the system to accurately extract requirements and their related contexts from texts (called requirement mining). Finally, we relate a first experimentation on requirement mining based on texts from seven companies. An evaluation that compares those results with manually annotated corpora of documents is given to conclude.

Keywords

References

  1. Alred, G. J., Brusaw, C. T., & Oliu, W.E. (2012). Handbook of Technical Writing. St Martin's Press: New York.
  2. Ament, K. (2002). Single Sourcing. Building modular documentation. W. Andrew Pub: New York.
  3. Barcellini, F., Grosse, C., Albert, C., & Saint-Dizier, P. (2012). Risk Analysis and Prevention: LELIE, a Tool dedicated to Procedure and Requirement Authoring, proceedings of LREC'12, Istanbul.
  4. Bourse, S., & Saint-Dizier, P. (2011). The language of explanation dedicated to technical documents. Syntagma, 27(1), 67-89.
  5. Buddenberg, A. (2011). (Private communication). Guidelines for writing requirements.
  6. Di Eugenio, B., & Webber, B.L. (1996). Pragmatic Overloading in Natural Language Instructions. International Journal of Expert Systems, 9(2), 53-84.
  7. Donin, J., Bracewell, R. J., Frederiksen, C. H., & Dillinger, M. (1992). Students' Strategies for Writing Instructions: Organizing Conceptual Information in text. Written Communication journal, 9(3), 209-236. https://doi.org/10.1177/0741088392009002002
  8. Fontan, L., & Saint-Dizier, P. (2008). Analyzing the explanation structure of procedural texts: dealing with Advices and Warnings. In: J. Bos (Eds.). International Symposium on Text Semantics (pp. 221-242), Association for Computational Linguistics (ACL).
  9. Gnesi, S., Lami, G. (2005). An Automatic Tool for the Analysis of Natural Language Requirements. International Journal of Computer Systems Science & Engineering, 20(1), 233-245.
  10. Grady. J. O. (2006). System Requirements Analysis. Academic Press: USA.
  11. Hull, E., Jackson, K., & Dick, J. (2011). Requirements Engineering. Springer Verlag.
  12. Keil, F.C., & Wilson, R.A. (2000). Explanation and Cognition, Bradford Book.
  13. Kintsch, W. (1988). The Role of Knowledge in Discourse Comprehension: A Construction Integration Model. Psychological Review, 95(2), 163-182. https://doi.org/10.1037/0033-295X.95.2.163
  14. Mann, W., & Thompson, S. (1988). Rhetorical Structure Theory: Towards a Functional Theory of Text Organisation, TEXT 8 (3), 243-281.
  15. Mann, W., & Thompson, S.A. (eds) (1992). Discourse Description: diverse linguistic analyses of a fund raising text, John Benjamins.
  16. Marcu, D. (1997). The Rhetorical Parsing of Natural Language Texts, proceedings of ACL'97.
  17. Marcu, D. (2000). The Theory and Practice of Discourse Parsing and Summarization. MIT Press.
  18. Marcu, D. (2002). Au unsupervised approach to recognizing Discourse relations, ACL.
  19. Miltasaki, E., Prasad, R., Joshi, A., & Webber, B. (2004). Annotating Discourse Connectives and Their Arguments, proceedings of the HLT/NAACL Workshop on Frontiers in Corpus Annotation.
  20. Nuseibeh, B., & Easterbrook, S. (2000). Requirements Engineering: A Roadmap, ICSE'00 Proceedings of the 22nd international conference on Software engineering, 37-46.
  21. Pohl, K. (2010). Requirements Engineering: Fundamentals, Principles, and Techniques. Springer Verlag.
  22. Rosner, D., & Stede, M. (1992). Customizing RST for the Automatic Production of Technical Manuals. In: R. Dale, E. Hovy, D. Rosner & O. Stock (Eds.). Aspects of Automated Natural Language Generation, Lecture Notes in Artificial Intelligence, 199-214, Springler-Verlag.
  23. Sage, P.A., & Rouse, W. B., (2009). Handbook of Systems Engineering and Management, 2nd Edition. Wiley: USA.
  24. Saint-Dizier, P., (2012). Processing Natural Language Arguments with the Platform. Journal of Argumentation and Computation, 3(1), 86-112.
  25. Saint-Dizier, P., (2014). Challenges of Discourse Processing: the case of technical documents. Cambridge Scholars: UK.
  26. Saito, M., Yamamoto, K., & Sekine, S. (2006). Using Phrasal Patterns to Identify Discourse Relations, proceedings of the Human Language Technology Conference of the NAACL'06, 133-136.
  27. Sampaio, A., Loughran, N., Rashid, A., & Rayson, P. (2005). Mining Aspects in Requirements. Aspect-Oriented Requirements Engineering and Architecture Design Workshop, Chicago, Illinois, USA.
  28. Stede, M. (2012). Discourse Processing, Morgan and Claypool Publishers.
  29. Takechi, M., Tokunaga, T., Matsumoto, Y., & Tanaka, H. (2003). Feature Selection in Categorizing Procedural Expressions, Sixth International Workshop on Information Retrieval with Asian Languages (IRAL2003), 49-56.
  30. Taboada, M., & Mann, W.C. (2006). Rhetorical Structure Theory: Looking back and moving ahead. Discourse Studies, 8(3), 423-459. https://doi.org/10.1177/1461445606061881
  31. Taboada, M. (2006). Discourse markers as signals (or not) of rhetorical relations. Journal of Pragmatics, 38(4), 567-592. https://doi.org/10.1016/j.pragma.2005.09.010
  32. Van der Linden, K. (1993). Speaking of Actions: choosing Rhetorical Status and Grammatical Form in Instructional Text Generation, Doctoral Dissertation, University of Colorado, USA.
  33. Wolf, F., & Gibson, E. (2005). Representing Discourse Coherence: A Corpus-Based Study. Computational Linguistics, 31(2), 249-288. https://doi.org/10.1162/0891201054223977