Predicting Software Size and Effort From Code Using Natural Language Processing

Loading...
Publication Logo

Date

2024

Journal Title

Journal ISSN

Volume Title

Publisher

CEUR-WS

Open Access Color

OpenAIRE Downloads

OpenAIRE Views

Research Projects

Journal Issue

Abstract

Software Size Measurement (SSM) holds a crucial role in software project management by facilitating the acquisition of software size, which serves as the primary input for development effort and schedule estimation. However, many small and medium-sized companies encounter challenges in conducting objective SSM and Software Effort Estimation (SEE) due to resource constraints and a lack of expert workforce. This often leads to inaccurate estimates and projects exceeding planned time and budget. Hence, organizations need to perform objective SSM and SEE with minimal resources and without relying on an expert workforce. In this research, we introduce two exploratory case studies aimed at predicting the functional size (COSMIC and Event-based size) and effort of software projects from the code using a deep-learning-based NLP model: CodeBERT. For this purpose, we collected and annotated two datasets consisting of 4800 Python and 1100 C# functions. Then, we trained a classification model to predict COSMIC data movements (entry, exit, read, write) and four regression models to predict Event-based size (interaction, communication, process) and effort. Despite utilizing a relatively small dataset for model training, we achieved promising results with an 84.5% accuracy for the COSMIC size, 0.13 normalized mean absolute error (NMAE) for the Event-based size, and 0.18 NMAE for the effort. These findings are particularly insightful as they demonstrate the practical utility of language models in SSM and SEE. © 2024 Copyright for this paper by its authors.

Description

Keywords

Artificial Intelligence, Effort Estimation, Natural Language Processing, Software Size Measurement

Fields of Science

Citation

WoS Q

N/A

Scopus Q

Q4

Source

CEUR Workshop Proceedings -- Joint of the 33rd International Workshop on Software Measurement and the 18th International Conference on Software Process and Product Measurement, IWSM-MENSURA 2024 -- 30 September 2024 through 4 October 2024 -- Montreal -- 204467

Volume

3852

Issue

Start Page

End Page

Google Scholar Logo
Google Scholar™

Sustainable Development Goals