Mining understandable state machine models from embedded code
- PDF / 2,092,497 Bytes
- 46 Pages / 439.642 x 666.49 pts Page_size
- 54 Downloads / 203 Views
Mining understandable state machine models from embedded code Wasim Said1 · Jochen Quante1
· Rainer Koschke2
© Springer Science+Business Media, LLC, part of Springer Nature 2020
Abstract Program understanding is a time-consuming and tedious activity for software developers. Manually building abstractions from source code requires in-depth analysis of the code. Automatic extraction of such models is possible, but cannot derive meaningful abstractions that are not already contained in the code. The automated extraction even has problems to decide which aspects of the code are important and which are not. Therefore, interactive semi-automatic approaches are the compromise of choice. In this article, we describe how state machines that describe the behaviour of a function can be extracted from code. The approach includes interaction – the user decides which aspects of the identified potentially relevant information is really relevant and which is not. This helps to reduce the resulting state machines to an understandable degree. However, these state machines in their raw form have transition conditions that are very complex and thus not understandable for humans. Therefore, we also introduce a technique to reduce these guards to an understandable form. The technique is a combination of heuristic logic minimization, exploitation of infeasible paths, and using transition priorities. We evaluate the approach on industrial embedded C code, first in a case study with hundreds of extracted state machines, and then in two experiments with professional developers. The results show that the approach is highly effective in making the guards understandable, and that guards reduced by our approach and presented with priorities are easier to understand than guards without priorities. We also show that the overall approach is beneficial for program comprehension. The guard reduction approach itself is quite generic and can also be applied to other problems. We demonstrate that for the simplification of mode switch logic. Keywords Software reengineering · Program understanding · Static analysis · Model mining · State machines · Boolean minimization · Embedded software · Legacy code
Communicated by: Massimiliano Di Penta and David D. Shepherd This article belongs to the Topical Collection: Software Analysis, Evolution and Reengineering (SANER) Wasim Said
[email protected]
Extended author information available on the last page of the article.
Empirical Software Engineering
1 Introduction Embedded systems have become more and more important in most industrial fields. Since this trend has been going on for decades, a lot of effort is nowadays spent for maintenance and evolution of legacy software. Legacy software contains accumulated expert knowledge from different people and domains, which makes it so valuable. This knowledge is often only available in the code. At the same time, aged software often is very complex (Parnas 1994). Understanding legacy software is a big challenge in software industry. It is a time consumi
Data Loading...