A Novel Binary DE Based on the Binary Search Space Topology

Abstract

Differential evolution is known as a simple and well-argued evolutionary algorithm that demonstrates the high performance in many hard black-box optimization problems with continuous variables. The main feature of differential evolution is the difference-based mutation. The mutation explores the search space using the distribution of points in the population and usually can well adapt to the objective function landscape. There exist some modifications of differential evolution for the binary search space. The proposed approach involves the understanding of the binary space topology for developing a better analogue of the difference-based mutation. We have compared the proposed binary differential evolution algorithm with the standard binary genetic algorithm using a set of binary test problems, including hard deceptive problems. The preliminary experimental results shows that new binary differential evolution is a competitive search algorithm and outperforms the binary genetic algorithm in reliability for some problems but yields it in the required number of function evaluations. The proposed approach for developing mutation in DE can be expanded to all other mutation schemes for increasing the performance.

Keywords: Binary differential evolution, genetic algorithm, black-box optimization

Introduction

Differential evolution (DE) is known as a simple and well-argued evolutionary algorithm (EA) that demonstrates the high performance in many hard black-box optimization problems with continuous variables (Storn & Price, 1997). DE demonstrates the high performance even in its originally proposed form. Nevertheless, many modifications for different classes of optimization problems have been proposed. Many of DE-based approaches have become competition winners and are state-of-the-art (Molina et al., 2018, Stanovov et al., 2021). Unlike many EAs, DE uses another evolutionary cycle. In the general EA cycle, selection is used for defining the fittest individuals for mating. Then crossover produces an offspring that contains a recombination of parents’ genes. Mutation is used as an additional operator for preventing the premature convergence and makes random and usually weak changes in genes of the offspring. The main feature of DE is the difference-based mutation. The operator plays the most important role in the search process and is applied before crossover and selection for generating a new random solution based on the distribution of the population in the search space. Such mutation usually can well adapt to the landscape of the objective function, identifies regularities and separable components, can walk along ravines. The benefits make DE attractive to work with other search spaces, and especially with the binary space.

There exist some modifications of differential evolution for the binary search space. The standard approach uses DE with continuous variable in [ 0 , 1 ] , which are rounded to 0 or 1 . The simple rounding can be substituted by the algorithm from the binary PSO, which defines 0 or 1 by evaluating the sigmoid function from the velocity of changes in genes (Pampara et al., 2006). The angle modulated DE (AMDE) makes use of angle modulation, a technique derived from the telecommunications industry, to implement a mapping between binary and continuous spaces (Engelbrecht & Pampara, 2007). Many approaches operate the binary space directly and design new mutation schemes for the representation (Chen et al., 2015; Deng et al., 2011; Peng et al., 2016). Some alternative approaches don’t design analogs of operators in continuous DE, but develop new operators while maintaining the general DE’s scheme (Banitalebi et al., 2016; Wang et al., 2012).

In this study, we propose a new approach that involves the understanding of the binary space topology for developing a better analogue of the continuous difference-based mutation. The proposed binary difference-based mutation involves such elements of the binary space as the Boolean hypercube, the neighbourhood of a binary vector, and the shortest path. The proposed binary differential evolution algorithm has been compared with the standard binary genetic algorithm using a set of binary test problems, including hard deceptive problems. The experimental results shows that new binary differential evolution is a competitive search algorithm and outperforms the binary genetic algorithm in reliability for some problems but yields it in the required number of function evaluations. At the same time, the results are preliminary, because the proposed approach for developing mutation in DE can be expanded to all other mutation schemes and state-of-the-art self-adaptive approaches for increasing the performance.

The rest of the paper is organized as follows. Section 2 describes the problem statement and the proposed approach. Sections 3 presents experimental setups and the experimental results. In conclusion, the proposed methods and the obtained results are summarized, and some further research is suggested.

Problem Statement

The standard DE algorithm solves the continuous global optimization problem (1), where the objective function is assumed to be a black-box model:

f x m i n x R n , f : x R ,(1)

here x = ( x 1 , , x n ) and i : x i S R n , usually l b i x i r b i with left ( l b ) and right ( r b ) bounds.

Many real-world optimization problems use binary decision variables, but the quality of a solution is still estimated using a real value. Binary variables can be also the result of encoding of mixed-type variables or the result of the discretization of continuous variables. In this case, we deal with pseudo-Boolean optimization that uses the following formal problem statement is (2):

f x m i n x 0,1 n , f : x R ,(2)

here x = ( x 1 , x 2 , , x n ) is a Boolean vector of n coordinates, n N + , f is an objective function. There are no assumptions on the type and properties of the objective function, thus it is considered as a black-box model.

Research Questions

One of approaches for understanding and analysing the topology of the binary search space is the representation using an n -dimensional Boolean hypercube (Björklund et al., 2004; Boros & Hammer, 2002). Let’s make some important definitions.

Let d H x i , x j = k = 1 n x i k - x j k is the Hamming distance between two binary vectors x i and x j .

Let N 1 x i = x : d H x , x i = 1 is a set of 1-neighbour vectors. A set of k-neighbour vectors can be defined at the same manner: N k x i = x : d H x , x i = k .

All binary vectors are vertices of the Boolean hypercube. Each pair of 1-neighbour vectors defines an edge of the hypercube.

Let a set p a t h x 0 , x m = { x 0 , x 1 , , x m - 1 , x m } is a path from a vertex x 0 to a vertex x m , then x i , x i + 1 p a t h x 0 , x m : x i + 1 N 1 ( x i ) . The length of the path is equal to p a t h x 0 , x m = m .

If d H x 0 , x m = k and p a t h x 0 , x m = k , when p a t h s x 0 , x m is the shorted path .

If d H x 0 , x m > 1 , there are multiple shortest paths from x 0 to x m .

Now let’s consider the general scheme of a DE algorithm (see Algorithm 1). It contains three main steps, namely the difference-based mutation, crossover, and selection.

Algorithm 1. DE

1Set the population size N, the scaling factor F, and the crossover probability CR.

2Initialize population of random solutions {x_1,x_2,…,x_N } and evaluate their fitness values.

3while the termination criteria are not met do

4for i=1,…,N do

5Generate a mutant vector v_i using one of mutation schemes.

6Perform crossover of x_i and v_i and get a trial vector u_i.

7Perform selection. If the trial vector u_i is better than x_i, save u_i in the population.

8for end

9while end

The following notation for choosing specific DE’s operator is used: D E / M / D / C , where M is a mutation strategy, D is the number of difference vectors in the mutation scheme, C is a type of crossover. Let’s discuss the most popular and simple DE: D E / r a n d / 1 / b i n , which uses mutation with random vectors, only one difference vector, and the binominal crossover operator. The DE algorithm was proposed for continuous optimization problems and the mutation scheme for r a n d / 1 is defined as (3):

v i = x r 1 + F ( x r 2 - x r 3 ) ,(3)

here r 1 , r 2 , and r 3 are indexes of different random vectors from the population, i.e. r 1 r 2 r 3 i , and F is the scale factor. The difference-based mutation produces a new point v i in the search space by moving from x r 1 in the direction from x r 3 to x r 2 at the distance defined by F (if F = 1 , the distance is equal to the length of ( x r 2 - x r 3 ) ). Figure 1 demonstrates an example of such mutation.

Figure 1: The difference-based mutation operator in DE
The difference-based mutation operator in DE
See Full Size >

Purpose of the Study

The purpose of the study is the development of a way for building a mutant vector in the binary search space that uses the conception similar to mutation in the continuous search space.

First, we will define the direction from x r 3 to x r 2 in the binary space. By analogy with the continuous search space, we must define the shortest path from x r 3 to x r 2 . This can be done using definition 4. Because of several shortest paths in the binary space, we must choose one of them. We can do it at random because of the mutation conception. Second, we must evaluate a point along the chosen direction at the distance defined by the scaling factor F . Because of the discrete search space, the point on the path can be defined as a point that belongs to the neighbourhood N l x r 3 (definition 2), where l { 0,1 , , d H x r 2 , x r 3 } is defined using F . It is obvious that N 0 x r 3 = x r 3 and N d H x r 2 , x r 3 x r 3 = x r 2 . We can define l by rounding up the result of the following product (4):

l = F d H x r 2 , x r 3 , F ( 0,1 ] .(4)

Thereby, the F ( x r 2 - x r 3 ) component of the difference-based mutation can be defined as a random vector that belongs to the shortest path from x r 3 to x r 2 and is located on the distance l from x r 3 (5):

x d N l x r 3 p a t h S ( x r 2 , x r 3 ) .(5)

Finally, we need to make the movement from x r 1 in the direction defined by x d . For the binary space the movement means that a mutant vector v i will be in N l x r 1 and its l components will differ in the same positions as components differ in x r 3 and x d . Such a way will save components, which stay unchanged in the shortest path, by analogy with the movement in the continuous space if the direction ( x r 2 - x r 3 ) is parallel to some of coordinate axes. The whole algorithm for performing the r a n d / 1 mutation in the binary space can be defined using properties of the exclusive disjunction (XOR) operation (6):

v i = x r 1 x d x r 3 .(6)

In the same manner, one can evaluate all the rest of mutation schemes. In this study, we will also test the c u r r e n t - t o - b e s t / 1 scheme (7)-(8), which is reported as one of the best non-adaptive mutation schemes (Mohamed et al., 2021) and r a n d / 2 as it produces better diversity in the population (9)-(10).

v i = x i x d 1 x i x d 2 x r 2 ,(7)

x d 1 N l x i p a t h s ( x b e s t , x i ) , x d 2 N l x r 2 p a t h S ( x r 1 , x r 2 ) ,(8)

here x i is a vector from the population that is used for building a trial vector, x b e s t is the best-found solution, r 1 , r 2 are indexes of different random vectors from the population, i.e. r 1 r 2 i .

v i = x r 1 x d 1 x r 3 x d 2 x r 5 ,(9)

x d 1 N l x r 3 p a t h s ( x r 3 , x r 2 ) , x d 2 N l x r 5 p a t h S ( x r 5 , x r 4 ) ,(10)

here r 1 , …, r 5 are indexes of different random vectors from the population, i.e. r 1 r 2 r 3 r 4 r 5 i .

We don’t need to make any changes in the crossover and selection operators because they don’t depend on the search space properties.

Research Methods

The proposed approach has been implemented using C++ in the multi-thread mode for parallel computations using multicore CPUs. We have chosen the following set of test problems: the O n e M a x problem with n = 100 for evaluating the general ability of convergence, 6 hard massively multimodal and deceptive functions [we use the original notation f 11 , …, f 16 as in (Yu & Suganthan, 2010)] with n equal to 30 , 30 , 24 , 30 , 30 , and 40 .

Binary DE have been tested with various combinations of parameters F and C r from the following sets: F = { 0.1 , 0.25 , 0.5,0.75,0.9 } and C r = { 0.1 , 0.25 , 0.5,0.75,0.9 } . The performance of binary DE has been compared with the standard binary genetic algorithm (GA). Binary GA have been tested with various settings the following sets: the type of crossover = {one-point, two-point, uniform} and the probability of mutation = { 1 / 3 n , 1 / n , 3 / n }.

The budget of function evaluations ( m a x F E s ) and its distribution in generations are: m a x F E s = 20000 , the population size ( p o p S i z e ) is 100, and the number of generations ( m a x G e n s ) is 200 for O n e M a x ; m a x F E s = 50000 (as it is proposed by authors of the benchmark), and p o p S i z e / m a x G e n s = 100 / 500 , 250 / 200 for f 11 , …, f 16 . The number of independent runs for each combination of parameters for each test function is 100. The performance measures are the rate of successful runs ( S R ) when the global optimum was found (an estimation of the probability of finding the global solution) and the average number of F E s before the global optimum was found (only for successful runs).

Findings

The experimental results for the best settings of each algorithm are presented in Table 1. An example of the typical convergence plot for f 11 averaged over 100 runs is shown in Figure 2.

Table 1 - The experimental results
See Full Size >
Figure 2: The convergence plot for f11 in one run
The convergence plot for f11 in one run
See Full Size >

Conclusion

In the study, we have proposed a novel binary DE that uses the difference-based mutation in the binary search space by analogy how it operates in the continuous space. The mutation operator uses such definitions as the neighbourhood, the shortest path over the Boolean hypercube. The preliminary experimental results have shown that the proposed approach is able to solve hard global pseudo-Boolean optimization problems and for many of considered problems the success rate of finding the global optimum is 100%. At the same time, we can see that the approach is slower that GA, which uses random mutation. We assume that binary DE can find a more accurate solution for large-scale problem with the limited budget of FEs, because it monotonically converges instead of randomly generating new points. In this study, we have tested only three simple schemes of mutation. State-of-the-art DE algorithms use many additional mechanisms such as restarts, the control of the population size, the self-adaptive control of parameters F and C r , and self-adaptive mutation. In out further work, we will investigate one of the best DE approach SHADE in the binary space and will extend the test problem set with known benchmarks.

Acknowledgments

This research was funded by the Ministry of Science and Higher Education of the Russian Federation, Grant No. 075-15-2022-1121.

References

  • Banitalebi, A., Abd Aziz, M. I., & Aziz, Z. A. (2016). A self-adaptive binary differential evolution algorithm for large scale binary optimization problems. Information Sciences, 367, 487-511.

  • Björklund, H., Sandberg, S., & Vorobyov, S. (2004). Complexity of Model Checking by Iterative Improvement: The Pseudo-Boolean Framework. In M. Broy, & A. V. Zamulin (Eds), Lecture Notes in Computer Science (Vol. 2890, pp. 381-394). https://doi.org/10.1007/978-3-540-39866-0_38

  • Boros, E., & Hammer, P. L. (2002). Pseudo-Boolean optimization. Discrete Applied Mathematics, 123(1-3), 155-225.

  • Chen, Y., Xie, W., & Zou, X. (2015). A binary differential evolution algorithm learning from explored solutions. Neurocomputing, 149, 1038-1047.

  • Deng, C., Zhao, B., Yang, Y., Peng, H., & Wei, Q. (2011). Novel binary encoding differential evolution algorithm. In Y. Tan, Y. Shi, Y. Chai, & Wang, G. (Eds), Advances in Swarm Intelligence: Second International Conference, ICSI 2011, Chongqing, China, June 12-15, 2011, Proceedings, Part I 2 (pp. 416-423). Springer.

  • Engelbrecht, A. P., & Pampara, G. (2007). Binary differential evolution strategies. In 2007 IEEE congress on evolutionary computation (pp. 1942-1947). IEEE.

  • Mohamed, A. W., Hadi, A. A., & Mohamed, A. K. (2021). Differential Evolution Mutations: Taxonomy, Comparison and Convergence Analysis. IEEE Access, 9, 68629-68662.

  • Molina, D., LaTorre, A., & Herrera, F. (2018). SHADE with iterative local search for large-scale global optimization. In 2018 IEEE congress on evolutionary computation (CEC) (pp. 1-8). IEEE.

  • Pampara, G., Engelbrecht, A. P., & Franken, N. (2006, July). Binary differential evolution. In 2006 IEEE international conference on evolutionary computation (pp. 1873-1879). IEEE.

  • Peng, H., Wu, Z., Shao, P., & Deng, C. (2016). Dichotomous Binary Differential Evolution for Knapsack Problems. Mathematical Problems in Engineering, 1-12.

  • Stanovov, V., Akhmedova, S., & Semenkin, E. (2021). NL-SHADE-RSP Algorithm with Adaptive Archive and Selective Pressure for CEC 2021 Numerical Optimization. IEEE Congress on Evolutionary Computation (CEC) (pp. 809-816). IEEE.

  • Storn, R., & Price, K. (1997). Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces. Journal of Global Optimization, 11, 341-359.

  • Wang, L., Fu, X., Mao, Y., Menhas, M. L., & Fei, M. (2012). A novel modified binary differential evolution algorithm and its applications. Neurocomput, 98, 55-75.

  • Yu, E. L., & Suganthan, P. N. (2010). Ensemble of niching algorithms. Information Sciences, 180(15), 2815-2833.

Copyright information

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License

About this article

Publication Date

27 February 2023

eBook ISBN

978-1-80296-960-3

Publisher

European Publisher

Volume

1

Print ISBN (optional)

-

Edition Number

1st Edition

Pages

1-403

Subjects

Cite this article as:

Sopov, E. (2023). A Novel Binary DE Based on the Binary Search Space Topology. In P. Stanimorovic, A. A. Stupina, E. Semenkin, & I. V. Kovalev (Eds.), Hybrid Methods of Modeling and Optimization in Complex Systems, vol 1. European Proceedings of Computers and Technology (pp. 328-335). European Publisher. https://doi.org/10.15405/epct.23021.40