Predicting Software Size and Effort From Code Using Natural Language Processing
Loading...

Date
2024
Journal Title
Journal ISSN
Volume Title
Publisher
CEUR-WS
Open Access Color
OpenAIRE Downloads
OpenAIRE Views
Abstract
Software Size Measurement (SSM) holds a crucial role in software project management by facilitating the acquisition of software size, which serves as the primary input for development effort and schedule estimation. However, many small and medium-sized companies encounter challenges in conducting objective SSM and Software Effort Estimation (SEE) due to resource constraints and a lack of expert workforce. This often leads to inaccurate estimates and projects exceeding planned time and budget. Hence, organizations need to perform objective SSM and SEE with minimal resources and without relying on an expert workforce. In this research, we introduce two exploratory case studies aimed at predicting the functional size (COSMIC and Event-based size) and effort of software projects from the code using a deep-learning-based NLP model: CodeBERT. For this purpose, we collected and annotated two datasets consisting of 4800 Python and 1100 C# functions. Then, we trained a classification model to predict COSMIC data movements (entry, exit, read, write) and four regression models to predict Event-based size (interaction, communication, process) and effort. Despite utilizing a relatively small dataset for model training, we achieved promising results with an 84.5% accuracy for the COSMIC size, 0.13 normalized mean absolute error (NMAE) for the Event-based size, and 0.18 NMAE for the effort. These findings are particularly insightful as they demonstrate the practical utility of language models in SSM and SEE. © 2024 Copyright for this paper by its authors.
Description
Keywords
Artificial Intelligence, Effort Estimation, Natural Language Processing, Software Size Measurement
Fields of Science
Citation
WoS Q
N/A
Scopus Q
Q4
Source
CEUR Workshop Proceedings -- Joint of the 33rd International Workshop on Software Measurement and the 18th International Conference on Software Process and Product Measurement, IWSM-MENSURA 2024 -- 30 September 2024 through 4 October 2024 -- Montreal -- 204467
Volume
3852
