Statistical Genetic Programming: The Role of Diversity

In this chapter, a new GP-based algorithm is proposed. The algorithm, named SGP (Statistical GP), exploits statistical information, i.e. mean, variance and correlation-based operators, in order to improve the GP performance. SGP incorporates new genetic o

  • PDF / 591,868 Bytes
  • 12 Pages / 439.37 x 666.142 pts Page_size
  • 82 Downloads / 208 Views

DOWNLOAD

REPORT


Abstract In this chapter, a new GP-based algorithm is proposed. The algorithm, named SGP (Statistical GP), exploits statistical information, i.e. mean, variance and correlation-based operators, in order to improve the GP performance. SGP incorporates new genetic operators, i.e. Correlation Based Mutation, Correlation Based Crossover, and Variance Based Editing, to drive the search process towards fitter and shorter solutions. Furthermore, this work investigates the correlation between diversity and fitness in SGP, both in terms of phenotypic and genotypic diversity. First experiments conducted on four symbolic regression problems illustrate the goodness of the approach and permits to verify the different behavior of SGP in comparison with standard GP from the point of view of the diversity and its correlation with the fitness.

1 Introduction Maintaining diversity in the genetic programming is important, because it helps to prevent the GP process from a premature convergence. The lack of diversity may lead to convergence towards local optima or towards a not optimal behavior in dynamic environments. Therefore, experimental analysis of diversity can give us a better perspective about the population transition and the search process in GP. M. Amir Haeri (&)  M. M. Ebadzadeh Department of Computer Engineering and Information Technology, Amirkabir University of Technology, Tehran, Iran e-mail: [email protected] M. M. Ebadzadeh e-mail: [email protected] G. Folino ICAR-CNR, Rende, Italy e-mail: [email protected]

V. Snášel et al. (eds.), Soft Computing in Industrial Applications, Advances in Intelligent Systems and Computing 223, DOI: 10.1007/978-3-319-00930-8_4,  Springer International Publishing Switzerland 2014

37

38

M. Amir Haeri et al.

According to this, diversity in genetic programming is studied by many researchers working in the GP field. Some of them tried to define appropriate phenotypic or genotypic diversity measures. Rosca [9, 10] suggested a phenotypic measure based on the number of different fitness values in the population. Analogously, Langdon [7] defined genotypic diversity as the number of different structures in the population. Some of the genotypic diversity measures have been defined on the basis of the edit distance between structures in the GP population [2, 3]. Folino et al. [5] analyzed the effectiveness of parallel genetic programming models in maintaining diversity in a population, i.e. island and cellular GP, using phenotypic and genotypic entropy. Their study confirms that the considered parallel models help to promote diversity but the authors conclude no relation between diversity measures and goodness of the fitness can be obtained. Jackson [6] investigated the effects of mutation operator on enhancing the diversity in GP population. He reported that the role of mutation operator in enhancing the diversity depends on the nature of the problem. In three of his test problems mutation did not have a significant effect on any diversity measures, while in one case, mutation operator ha