Constraint Model for the Satellite Image Mosaic
Selection Problem
Manuel Combarro Simón # 
University of Luxembourg, Luxembourg
Interdisciplinary Centre for Security, Reliability and Trust (SnT), Luxembourg

Pierre Talbot # 
University of Luxembourg, Luxembourg
Interdisciplinary Centre for Security, Reliability and Trust (SnT), Luxembourg

Grégoire Danoy # 
University of Luxembourg, Luxembourg
Interdisciplinary Centre for Security, Reliability and Trust (SnT), Luxembourg

arXiv:2312.04210v1 [cs.AI] 7 Dec 2023

Jedrzej Musial # 
Poznan University of Technology, Poland

Mohammed Alswaitti # 
University of Luxembourg, Luxembourg
Interdisciplinary Centre for Security, Reliability and Trust (SnT), Luxembourg

Pascal Bouvry # 
University of Luxembourg, Luxembourg
Interdisciplinary Centre for Security, Reliability and Trust (SnT), Luxembourg

Abstract
Satellite imagery solutions are widely used to study and monitor different regions of the Earth.
However, a single satellite image can cover only a limited area. In cases where a larger area of
interest is studied, several images must be stitched together to create a single larger image, called a
mosaic, that can cover the area. Today, with the increasing number of satellite images available for
commercial use, selecting the images to build the mosaic is challenging, especially when the user
wants to optimize one or more parameters, such as the total cost and the cloud coverage percentage
in the mosaic. More precisely, for this problem the input is an area of interest, several satellite
images intersecting the area, a list of requirements relative to the image and the mosaic, such as
cloud coverage percentage, image resolution, and a list of objectives to optimize. We contribute to
the constraint and mixed integer lineal programming formulation of this new problem, which we call
the satellite image mosaic selection problem, which is a multi-objective extension of the polygon
cover problem. We propose a dataset of realistic and challenging instances, where the images were
captured by the satellite constellations SPOT, Pléiades and Pléiades Neo. We evaluate and compare
the two proposed models and show their efficiency for large instances, up to 200 images.
2012 ACM Subject Classification Computing methodologies → Discrete space search
Keywords and phrases constraint modeling, satellite imaging, set covering, polygon covering.
Digital Object Identifier 10.4230/LIPIcs.CP.2023.23
Category Short Paper
Supplementary Material Software (Source Code): https://github.com/mancs20/mosaic_image_
combination/tree/cp2023
archived at swh:1:dir:c776d54cd26c5685fd8a0a8ddfc2131771884b32
Funding Manuel Combarro Simón: This work is partially funded by the Luxembourg National
Research Fund (FNR)—ASTRAL Project, ref. 17043604, and by the joint research programme
UL/SnT-ILNAS on Technical Standardization for Trustworthy ICT, Aerospace, and Construction.
© Manuel Combarro Simón, Pierre Talbot, Grégoire Danoy, Jedrzej Musial, Mohammed Alswaitti, and
Pascal Bouvry;
licensed under Creative Commons License CC-BY 4.0
29th International Conference on Principles and Practice of Constraint Programming (CP 2023).
Editor: Roland H. C. Yap; Article No. 23; pp. 23:1–23:16
Leibniz International Proceedings in Informatics
Schloss Dagstuhl – Leibniz-Zentrum für Informatik, Dagstuhl Publishing, Germany

23:2

Satellite Image Mosaic Selection Problem

Figure 1 Optimization of the subset of images to make a mosaic of the Tokyo Bay region.

Pierre Talbot: This work is supported by the FNR—COMOC Project, ref. C21/IS/16101289.
Jedrzej Musial: This work is funded by the FNR—PolLux program under the SERENITY Project,
ref. C22/IS/17395419

1

Introduction

The space industry is continuously growing and is no longer an exclusive market for military
and government applications. According to the most recent report of the European Union
Agency for the Space Programme (EUSPA) [13], the global market for navigation systems
and earth observation (EO) had revenues of around €200 billion in 2022 and is expected to
reach €500 billion by 2031. As access to space has become cheaper, an increasing number
of private companies have entered the space business. Due to advances in satellite design
and high-resolution remote sensors, the number of satellite launches dedicated to EO in 2021
was greater than the sum of launches between 2012 and 2016 [37]. In 2020 more than 100
terabytes of satellite imagery was generated per day [28].
There are several EO-based applications that analyze a vast area of interest (AOI)
that can only be covered by combining several adjacent images into a larger one, called a
mosaic. Mosaics are crucial for applications such as crop classification [21, 14], environmental
monitoring [30, 15], and urban development analysis [38, 32]. The mosaicking of satellite
images is a complicated process that presents challenges, such as color balancing [39] and
image stitching [27].
In this work, we focus on the combinatorial problem of selecting the images to create
the mosaic by optimizing one or several criteria. This problem is an extended version of
the NP-hard problem of finding the minimum axis-parallel rectangle cover of a rectilinear
polygon without holes [11], where the axis-parallel rectangles can be seen as the satellite
images, and the rectilinear polygon as the AOI. In our problem, a cover is the subset of
images that can be used to generate a mosaic. In Figure 1 a particular example of this
problem is shown, where the objective is to build a mosaic using the smaller number of
images. There are 30 images to choose from, and the optimization algorithm finds an optimal
subset of four images.
In this paper, we present a multi-objective approach for this problem that seeks to
optimize four popular parameters of satellite images for mosaic generation: cloud cover,
incidence angle [1], resolution and cost of the images. In general, there might not be an
objective that is more important than the other, which is why we propose a multi-objective
approach, instead of a linear aggregation or lexicographic ordering of the objectives.
As the number of available satellite images has increased significantly, it is becoming
more challenging to select the optimal combination of images to build a mosaic. The number
of images covering one place can reach hundreds. This is even more difficult if the user is
interested in optimizing several parameters. Without a computational approach for this,

M. Combarro Simón, P. Talbot, G. Danoy, J. Musial, M. Alswaitti, and P. Bouvry

23:3

users have to select by hand the images they want for the cover. Providing a Pareto front
from which users can choose a cover is crucial to save money and time, considering that
high-resolution satellite images are expensive.
In this paper, we propose a constraint programming (CP) model and a mixed integer linear
programming (MILP) model to solve the problem presented. However, directly modeling this
geometric problem with constraints is challenging, as we would need to encode geometric
operations such as union and intersection of polygons. Instead, we preprocess each instance
by computing a discretization where the intersections of all images are first computed
(Section 2.1). We obtain a set of non-overlapping polygons where each polygon is simply an
integer and the geometric characteristics can be ignored. This problem is a multi-objective
extension of the well-known set covering problem (Section 3).
A unique aspect of this problem is to minimize the cloud coverage of the mosaic, which, in
contrast to other objectives, does not have the same value throughout the image. While any
part of the image has the same resolution, not all parts of the images have the same amount
of clouds, except for images with 0 or 100% of cloud coverage. Because of this particularity
of the problem, it is possible to reduce the cloud coverage percentage in the final mosaic by
choosing a specific combination of images in such a way that cloudy regions of an image are
overlapped by non-cloudy regions of other images.To the best of our knowledge, there is no
work taking this into account to reduce the cloud coverage in the final mosaic. We call this
problem the satellite image mosaic selection problem (SIMS) (Section 2).
The main contribution of this paper is to introduce the SIMS problem and present a
CP model, as well as MILP model that can successfully find solutions to real instances of
up to 200 images (Section 4). The constraint and mixed integer programming approaches
are part of a larger framework where the images are automatically retrieved from different
marketplaces and the solutions found by the solver can be visualized.
Although SIMS can be expressed as a linear problem (Section A), we choose to rely on
constraint programming for two reasons: to ease the formulation of the model—in particular,
it is convenient to use set variables—and because this problem aims to be extended for new
requirements, which can be non-linear. The flexibility of the model is of utmost importance
in this work, which is why constraint programming is our main choice.

2

Satellite Image Mosaic Selection Problem

The input for the SIMS problem is an area of interest (AOI) on Earth and a set of satellite
images that intersect it. Each image has a cost and a list of parameters including the
resolution, incidence angle and cloud coverage. The AOI is represented as a simple closed
polygon without holes, and the images are represented as quadrilaterals. For both the AOI
and satellite images, the corner coordinates are provided. With that information, the AOI
and the images can be represented in the plane.
As clouds are not usually even distributed in the images, having images with a certain
cloud coverage percentage does not guarantee that the final mosaic has less than that cloud
coverage. Depending on the cloud distribution in the images, the final mosaic can have a
lower or higher percentage of cloud coverage as depicted in Figure 2. This is not the case for
the other objectives because they have a unique value along the image. For example, if all
images have a determined resolution, the final mosaic will have the same resolution.

CP 2023

23:4

Satellite Image Mosaic Selection Problem

Figure 2 A cloudy region of an image can be covered by a non-cloudy region of another image,
impacting the cloud coverage percentage of the final mosaic.
Table 1 Number of intersections for the instances covering the Tokyo Bay region.
Images
30
50
100
150
200

2.1

Intersections
298
806
3278
8079
14855

Preprocessing of the Problem

To make the discretization, we first remove the parts of the images that are outside the AOI,
and then we find all the polygons resulting from the intersection of the images, we find the
polygons using the GEOS library [17]. The universe is partitioned into a set of polygon
elements, and each of them is assigned to its corresponding images. In Figure 3, we show
an example of this process, where we generate 254 polygons from 30 images. In Table 1,
we show the number of intersections for different cardinality of the images set to cover the
Tokyo Bay region.
The following step is to detect the clouds in the images and add them to the universe
and to the correspondig sets. The objective of doing this is to know whether a region of the
final mosaic, that is represented by one element of the universe, is free of clouds or not. We
consider that a region of the AOI is free of clouds if there is at least one image in the cover
in which that region does not have clouds.

Figure 3 298 polygons are obtained after preprocessing of 30 images. First the area of the images
outside the AOI is removed and then the polygons resulting from the images intersections are found.

M. Combarro Simón, P. Talbot, G. Danoy, J. Musial, M. Alswaitti, and P. Bouvry

I

I

1
4

5
5

7

II

6

3
6

23:5

1
4

II

7
2

2

(a) Before the cloud integration the universe (b) Regions represented by element 5 and 6 are
had 3 elements and 7 after the integration.
free of clouds in the final mosaic

Figure 4 Possible cases for the integration of the detected clouds to the universe and the
corresponding sets. Set I and II are represented by the rectangles with red and blue borders
respectively.

A cloudy region is an element in the universe that is present in all the sets that cover
that region. For the set that represents the image in which the cloud occurred, we make a
distinction and we say that the element is cloudy for that set. In this way, a set is composed
of cloudy elements and non-cloudy elements.
In real applications, clouds can be detected using cloud detectors [36, 19, 22]. As this
is a problem orthogonal to our work, we do not detect the clouds, but instead randomly
allocate them in the parts of the image. For each image, we have metadata indicating the
cloud coverage percentage of the image. With that information and knowing the elements
that belong to the image, we randomly set one of the elements as cloudy. We repeat this
operation until the cloud coverage percentage of the image is achieved.
In Figure 4a, all possible scenarios of how clouds are converted to elements of the universe
are shown for the general case. To facilitate the understanding of this process, only two
overlapping images are shown, but the procedure is the same when more than two images
overlap. Images I and II are represented as rectangles with red and blue borders, respectively.
The clouds in image I are colored red, and the ones in image II are colored blue. Initially,
both sets have in common its intersection, element 3 (I = {1, 3} and II = {2, 3}). We can see
that both clouds of I are partially covered by II; in one case is because one part of the cloud
is in the intersection, and in the other case is because one part of the cloud is overlapped by
a cloud from II, so it can not be completely covered by a non-cloudy region of II. From the
previous, we can see that three elements are created, 4, 5 and 7. Element 4 are the clouds
that are only present in I, element 5 is the cloudy region of I that is not cloudy in II, i.e.
covered by II, and element 7 is a cloudy region of I that is also cloudy in II. Element 6 is
similar to element 5, is a cloudy area in II that is not cloudy in I. When all the clouds are
detected and incorporate to the universe, the original three elements 1, 2 and 3 are modified
as follows: element 1 is the non-cloudy region of I that is not overlapped by any other image,
element 2 is equivalent to element 1 but for image II, and element 3 is the non-cloudy area
of the intersection between I and II. Finally, the universe has seven elements, and the sets
are I = {1, 3, 4, 5, 6, 7}, II = {2, 3, 5, 6, 7}. In Figure 4b, we show a resulting mosaic after
covering the clouds. Importantly, taking into account the clouds in this way increases the
cardinality of the universe, but the problem itself does not change, which is why we took a
simpler approach to randomly assign clouds to each element.

CP 2023

23:6

Satellite Image Mosaic Selection Problem

3

Constraint Model

Let U = {k1 , . . . , kn } ⊂ N be a set of n parts of the area of interest, called the universe.
The set U is a polygon partition of the area of interest, i.e. two parts do not overlap and
their union is exactly the area of interest. Each satellite image is represented by a collection
Pi ⊂ U of parts. We write I = {P1 , . . . , Pm } the set of all m satellite images. The goal is to
find a subset T ⊂ {1, . . . , m} of images that covers the area of interest. The parameters of
the model are U and I while T is the main decision variable. The set covering constraint is
captured by the following:
[
Pi = U
(1)
i∈T

A trivial solution to this constraint is to take all the images, but we usually consider
an optimization version where the cardinality of T is minimized. In our case, each image
i ∈ {1, . . . , m} has a cost Wi ∈ N that we seek to minimize:

min

X

Wi

(2)

i∈T

Along with Equation 1, this problem is called weighted set cover. Depending on the user
requirements, we can consider other objectives such as resolution and incidence angle. For
each part k ∈ U , we have its area Ak ∈ N. And for each image i ∈ {1, . . . , m}, we have its
resolution Ri ∈ N and its incidence angle Fi ∈ N. We seek to minimize (the resolution is
given in how many cm2 represents a pixel, the less the better) the best resolution obtained
for each part:
X
min
min{Ri | i ∈ T, k ∈ Pi }
(3)
k∈U

For the incidence angle, we seek to minimize the maximal angle, although other choices
would be possible such as minimizing the average.
min {max {Fi | i ∈ T }}

(4)

A more challenging aspect of this problem is to minimize the area covered by clouds.
To achieve that, we consider that each part is either cloudy or not. We leave the cloud
detection and the splitting of the image into cloudy and non-cloudy parts to a preprocessing
step. Let Ci ⊂ Pi the cloudy parts of the image i. For each part k ∈ U , we define
Dk := {i ∈ {1, . . . , m} | k ∈ Pi \ Ci } the set of all images containing a non-cloudy view of
the part k. For each part k ∈ U , we can now define the Boolean variable Vk to be true when
the part k is cloudy in the cover:
^
Vk ⇔
i∈
/T
(5)
i∈Dk

We can now minimize the area covered by clouds:
X
min
Vk ∗ Ak

(6)

k∈U

This objective can also be turned into a constraint if the user only wants covers with a
certain cloud coverage threshold.

M. Combarro Simón, P. Talbot, G. Danoy, J. Musial, M. Alswaitti, and P. Bouvry

23:7

The model introduced is actually linear, as shown in Appendix A, and can be solved
by mixed integer programming solvers. We simply represent the set T by m 0-1 variables
{x1 , . . . , xm } such that xi = 1 if we take the image and xi = 0 otherwise. We also use this
representation for constraint programming solvers, because it is not possible to represent the
set covering constraint otherwise—this is due to T having a non-fixed cardinality.

3.1

Search Strategy Based On Greedy Algorithm

A well-known greedy algorithm for the set covering problem consists in taking the images
covering the most uncovered parts of the universe first [9]. We model this heuristic as a
search strategy within the MiniZinc constraint model. This has the advantage of always
producing a solution that is at least as good as the greedy heuristics—since it is the first
solution found. To achieve that, we reuse an existing search strategy provided by MiniZinc.
A second advantage is that our search strategy can be reused with any constraint solver
compatible with MiniZinc. We select the variable using the anti_first_fail strategy—the
variable with the largest domain is selected first—and we take the highest value in its
domain (indomain_max). The trick is to model a set of variables {G1 , . . . , Gm } such that
Gi ∈ {0, . . . , |Pi |} is equal to the number of parts covered by the image i. Actually, in any
solution, we have Gi = |Pi | since the whole universe must be covered. What is interesting is
the value of Gi in partial assignments during the search. The difference between the upper
and lower bounds max(Gi ) − min(Gi ) is the number of parts that are currently uncovered
by the partial assignment, and that can be covered by the image i. Since the anti-first-fail
strategy selects the largest domain first, it effectively implements the greedy heuristics. We
model Gi as follows:
Gi =

X _
k ∈ Pi )
(

(7)

k∈Pi i∈T

We note that the new variables Gi are fully defined with the parts, and therefore once the
main decision variable T is assigned, the variables Gi must be assigned as well.

3.2

Multi-Objective Constraint Optimization Algorithm

The multi-objective constraint programming algorithm used in this work was pionneered by
Gavanelli [16] and has been frequently used in constraint optimization [23, 33, 18]. The main
idea is to run a satisfaction constraint solver iteratively and add new constraints representing
the Pareto front to ensure the next solution is not dominated by any point in the current
Pareto front. To illustrate this algorithm, suppose a biobjective maximization problem where
x and y are the two variables to optimize. We run the constraint solver which returns a first
satisfiable solution where x = 10 and y = 5. At that point the Pareto front is {(10, 5)}. We
add to the model the constraint x > 10 ∨ y > 5 which guarantees that the next solution
will not be dominated by the current points in the Pareto front. The solver might then
find the solution x = 2 and y = 6 which is incomparable to the previous solution and is
added to the Pareto front {(10, 5), (2, 6)}. The constraint generated from the Pareto front
is now (x > 10 ∨ y > 5) ∧ (x > 2 ∨ y > 6). This process continues until the solver finds an
unsatisfiable solution, in which case we are guaranteed to have found the optimal Pareto
front.

CP 2023

23:8

Satellite Image Mosaic Selection Problem

4

Evaluation

4.1

Dataset Description

For each experiment, there is an AOI and a number of images that cover the AOI. The
objective is to find the Pareto front, where each point in the front represents a subset of
images that must cover the AOI and optimize four objectives: cost, resolution, incidence
angle and cloud coverage.
To carry out this research, we developed a framework capable of retrieving image metadata
from different satellite marketplaces, preprocessing it (discretization and cloud integration to
the universe), calling a CP or a MILP solver, and visualizing the solutions from the Pareto
front.
We selected five AOIs from around the world: Mexico City (Mexico), Rio de Janeiro
(Brazil), Paris (France), Lagos (Nigeria), and Tokyo Bay (Japan). For each AOI, we obtained
all available images that were captured from 01-01-2021 to 01-01-2023 by the following
satellite constellations SPOT [4], Pléiades [2] and Pléiades Neo [3]. We opted for those
satellite constellations as they have all the metadata used in the experiments; other satellite
constellations lacked some parameters such as cloud coverage or incidence angle.
Five instances were generated for all AOIs, except Lagos. Each of these instances differs
from each other by the number of images given to cover the AOIs. The number of images for
the instances were 30, 50, 100, 150 and 200. For Lagos, the total number of images available
for the specified date range was 145, so the number of images for the Lagos instances were
30, 50, 100 and 145.

4.2

Experimental Setup

Each instance was solved using the CP model and the MILP model. We run the CP model
with two solvers, OR-tools [31] and Gecode 6.3.0 [34]. For each of these solvers, we run the
experiments twice; one with the default solver search strategy, and the other one with the
greedy search strategy proposed in Section 3.1. The MILP model was implemented using
the Gurobi solver [20]. We will refer to these five approaches as OR-tools default, OR-tools
greedy, Gecode default, Gecode greedy and Gurobi
For the MILP model, the algorithm used to obtain the exact Pareto front was SAUGMENCON [40] which is based on the AUGMECON [25, 26] algorithm and on the well-known
ϵ-constraint method. In the ϵ-constraint methods, one objective is optimized, and the others
are added as constraints to the model. The right-hand side of the objective constraints
gradually changes from the less restrictive values of the objectives to the most restricted
ones. This process continues until all combinations of values for the constraint objectives
have been explored. The SAUGMENCON method introduces two acceleration mechanisms
to improve the computational efficiency of the front generation.
The experiments were run on an AMD Epyc ROME 7H12 processor (64 cores, 280W).
All the solvers were configured to run in parallel with 8 cores and 16 threads. The running
time for each experiment was 1 hour.

4.3

Experimental Results

To compare the results, we used the hypervolume of the Pareto front, which is a standard
metric for comparing fronts in multiobjective optimization. For each instance, we score the
strategies, calculating how worst they are compared to the best. For example, a score of 1

M. Combarro Simón, P. Talbot, G. Danoy, J. Musial, M. Alswaitti, and P. Bouvry

23:9

14

Best hypervolume

12
10
8
6
4
2
0

OR-tools defaultOR-tools greedy Gecode default Gecode greedy

Gurobi

Figure 5 Number of times each approach had the best hypervolume.
0,9

1

0,87

0,9
0,8

0,847

0,7

Score average

0,81
0,78
0,75
0,72

0,781

0,775
0,73

0,69

0,713

0,921
0,844

0,787

0,768

0,743

Gecode greedy

Gurobi

0,6
0,5
0,4
0,3
0,2

0,66

0,1

0,63
0,6

Score average

0,84

OR-tools default OR-tools greedy Gecode default

Gecode greedy

Gurobi

(a) Considering the 24 instances.

0

OR-tools default OR-tools greedy Gecode default

(b) Considering the 21 instances where all strategies
found a solution.

Figure 6 Average score for each approach.

means that the strategy has the same hypervolume value as the best one, and a score of 0.5
means that the hypervolume of the strategy is half of the best hypervolume for that instance.
In Table 2 from Appendix C we can see the hypervolume values for each strategy for all
instances.
For only one instance, the complete Pareto front was found in the running time, the
strategies that found the complete front were OR-tools default, OR-tools greedy and Gurobi.
For the rest of the instances, the whole Pareto front was not found, and the hypervolume
corresponds to the partial front found during the running time. For 3 of the 4 instances with
200 images, the CP strategies could not find any point of the front; the entire running time
was employed by the FlatZinc submodule of Minizinc, to flatten the model with the data file.
However, Gurobi for 3 of these instances could find one point of the Pareto front. For the
rest of the instances, all the strategies could find at least one point of the Pareto front, and
generally the CP approaches obtained superior results compared to MILP.
As we can see in Figures 5 and 6a, for these experiments, the best approach was OR-tools
default, being the best for 13 out of 24 instances and with a score average of 0.847. The
second and third best strategies were OR-tools greedy and Gurobi, with a similar performance.
The fourth and fifth places were occupied by Gecode default and greedy, being really close.
If we do not consider the 3 instances in which the CP strategies could not find a solution,
the score averages change; see Figure 6b. OR-tools default has an average score very close
to 1, and OR-tools greedy has a much better average score than Gurobi, which for these
instances is the worst strategy on average. The difference in the average score between both
Gecode strategies remains very close.
It is interesting to note that Gurobi showed excellent performance for small instances
comprising 30 and 50 images. However, its performance was not as impressive for larger

CP 2023

23:10

Satellite Image Mosaic Selection Problem

instances. This could be related to the way solutions are discovered and added to the front.
For future research, it could be interesting to compare the hypervolume anytime behavior
for different approaches used for CP and MILP to get the exact Pareto front.

5

Related Work

Geometric set covering problems can be divided into two categories based on the requirements
of the covering shapes. In one category, the covering shapes do not have a fixed position
in the plane, for example, covering a polygonal region with the minimum amount of fixed
sized rectangles [24] or with a set of known rectangles that can freely move on the plane [35].
The other category is where the covering shapes have a fixed size and position, for example,
covering a polygonal region with discs with a fixed size and position on the plane [10]. SIMS,
belongs to the second category as the satellite images represent a fixed region on the Earth.
Considering the cloud coverage percentage in the final mosaic makes SIMS problem
different from polygon cover and other geometrical cover problems, where the covering shapes
only have to cover the polygon or the universe of points in the space. In this problem, the
covering shapes, besides covering the polygon, should also cover certain regions (clouds) that
are present in the shapes. Interestingly, this can be seen as solving two weighted set covering
problems; in one, the AOI must be covered and in the other, the clouds.
The main approaches to solving geometric set covering problems are local search [6, 12, 5]
and linear programming (LP) [7, 8]. In most of the papers, opposite to SIMS, the universe
is a set of points instead of a region. In [10], they provide an exact algorithm for the case
where the universe is a set of regions, and the covering objects are discs. The algorithm is
effective when the minimum number of discs to cover the space is low.
In [29], set covering problem is tackled using constraint programming. There, the authors
propose a way to prune the domain of possible solution using a lower and upper bound
for the objective value. The lower bound consists of determining the minimum number of
sets that can cover the space. This is equivalent to answering the following NP-complete
problem: does a cover of the universe exist with K sets. They propose a new strategy to
get an approximation of this lower bound and compare it against two other well-known
lower-bound values: the value of the LP relaxation problem and a greedy algorithm. The
proposed prune strategy is good for problems where the size of the sets is small, for bigger
subsets, they recommend alternating between the LP relaxation and the greedy algorithm.

6

Conclusion

In this paper, we introduce a novel geometrical NP-hard problem, SIMS, inspired by the
selection of satellite images for mosaic generation. CP and MILP models are provided for this
problem, together with a search strategy for the CP model, based on the well-known greedy
algorithm used for set covering problems. In the experiments performed, the CP solved
with OR-tools got the best result, evidencing the power of this solver. Our proposed search
strategy could not outperform the default search strategies, but in the case of the Gecode
solver, it produced similar results. Generally, the CP model outperformed the MILP model.
This could be related to the method used to generate the Pareto front. For future work, it
will be interesting to compare different approaches to generate the exact Pareto front for
the CP and MILP models, based on the metric anytime behavior for the hypervolume. We
also plan to propose heuristics to tackle larger instances and to evaluate their performance
against the proposed CP and MILP models.

M. Combarro Simón, P. Talbot, G. Danoy, J. Musial, M. Alswaitti, and P. Bouvry

23:11

References
1
2
3
4
5

6

7

8

9
10

11

12
13

14

15

16

17
18
19

Airbus.
Incidence angle.
URL: https://www.intelligence-airbusds.com/en/
8719-angle-conversion.
Airbus. Pléiades. URL: https://www.intelligence-airbusds.com/en/8692-pleiades.
Airbus.
Pléiades Neo.
URL: https://www.airbus.com/en/products-services/space/
earth-observation/earth-observation-portfolio/pleiades-neo.
Airbus. SPOT. URL: https://www.intelligence-airbusds.com/en/8693-spot-67.
Pradeesha Ashok, Aniket Basu Roy, and Sathish Govindarajan. Local search strikes again:
PTAS for variants of geometric covering and packing. Journal of Combinatorial Optimization,
39(2):618–635, jun 2019. doi:10.1007/s10878-019-00432-y.
Nikhil Bansal and Kirk Pruhs. Weighted geometric set multi-cover via quasi-uniform sampling.
In Algorithms – ESA 2012, pages 145–156. Springer Berlin Heidelberg, 2012. doi:10.1007/
978-3-642-33090-2_14.
Timothy M. Chan and Qizheng He. Faster approximation algorithms for geometric set cover.
In Leibniz International Proceedings in Informatics, LIPIcs, volume 164. Schloss DagstuhlLeibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing, 6 2020. doi:10.4230/LIPIcs.
SoCG.2020.27.
Chandra Chekuri, Sariel Har-Peled, and Kent Quanrud. Fast LP-based approximations for
geometric packing and covering problems. In Proceedings of the Fourteenth Annual ACMSIAM Symposium on Discrete Algorithms, pages 1019–1038. Society for Industrial and Applied
Mathematics, jan 2020. doi:10.1137/1.9781611975994.62.
V. Chvatal. A greedy heuristic for the set-covering problem. Mathematics of Operations
Research, 4(3):233–235, aug 1979. doi:10.1287/moor.4.3.233.
Claudio Contardo and Alain Hertz. An exact algorithm for a class of geometric set-cover
problems. Discrete Applied Mathematics, 300:25–35, 2021. doi:https://doi.org/10.1016/j.
dam.2021.05.005.
Joseph C. Culberson and Robert A. Reckhow. Covering polygons is hard. Annual Symposium
on Foundations of Computer Science (Proceedings), pages 601–611, 1988. doi:10.1109/SFCS.
1988.21976.
Minati De and Abhiruk Lahiri. Geometric Dominating-Set and Set-Cover via Local-Search.
Computational Geometry, 113:102007, 8 2023. doi:10.1016/j.comgeo.2023.102007.
European Union Agency for the Space Programme. EUSPA EO and GNSS Market Report.
Technical Report 1, European Union Agency for the Space Programme, 2022. URL: https:
//www.euspa.europa.eu/sites/default/files/uploads/euspa_market_report_2022.pdf.
Shilan Felegari, Alireza Sharifi, Kamran Moravej, Muhammad Amin, Ahmad Golchin, Anselme
Muzirafuti, Aqil Tariq, and Na Zhao. Integration of Sentinel 1 and Sentinel 2 Satellite Images
for Crop Mapping. Applied Sciences, 11:10104, 2021. doi:10.3390/app112110104.
Neil Flood, Fiona Watson, and Lisa Collett. Using a U-net convolutional neural network
to map woody vegetation extent from high resolution satellite imagery across Queensland,
Australia. International Journal of Applied Earth Observation and Geoinformation, 82:101897,
10 2019. doi:10.1016/J.JAG.2019.101897.
Marco Gavanelli. An algorithm for multi-criteria optimization in CSPs. In ECAI 2002: 15th
European Conference on Artificial Intelligence, July 21-26, 2002, Lyon France: Including
Prestigious Applications of Intelligent Systems (PAIS 2002): Proceedings, volume 77, page
136. IOS Press, 2002.
GEOS contributors. GEOS coordinate transformation software library. Open Source Geospatial
Foundation, 2021. URL: https://libgeos.org/.
Tias Guns, Peter J. Stuckey, and Guido Tack. Solution Dominance over Constraint Satisfaction
Problems, 2018. arXiv:1812.09207 [cs]. URL: http://arxiv.org/abs/1812.09207.
Yanan Guo, Xiaoqun Cao, Bainian Liu, and Mei Gao. Cloud detection for satellite imagery
using attention-based u-net convolutional neural network. Symmetry, 12(6):1056, jun 2020.
doi:10.3390/sym12061056.

CP 2023

23:12

Satellite Image Mosaic Selection Problem

20
21

22

23

24

25

26

27

28

29
30

31
32

33

34
35

Gurobi Optimization, LLC. Gurobi Optimizer Reference Manual, 2023. URL: https://www.
gurobi.com.
Ola Hall, Sigrun Dahlin, Håkan Marstorp, Maria Francisca Archila Bustos, Ingrid Öborn, and
Magnus Jirström. Classification of maize in complex smallholder farming systems using UAV
imagery. Drones, 2(3):1–8, 2018. doi:10.3390/drones2030022.
Jacob Høxbroe Jeppesen, Rune Hylsberg Jacobsen, Fadil Inceoglu, and Thomas Skjødeberg
Toftegaard. A cloud detection algorithm for satellite imagery based on deep learning. Remote
Sensing of Environment, 229:247–259, aug 2019. doi:10.1016/j.rse.2019.03.039.
Martin Lukasiewycz, Michael Glaß, Christian Haubelt, and Jürgen Teich. Solving Multiobjective Pseudo-Boolean Problems. In Theory and Applications of Satisfiability Testing –
SAT 2007, pages 56–69. Springer Berlin Heidelberg, Berlin, Heidelberg, 2007. doi:10.1007/
978-3-540-72788-0_9.
Sina Sharif Mansouri, George Georgoulas, Thomas Gustafsson, and George Nikolakopoulos.
On the covering of a polygonal region with fixed size rectangles with an application towards
aerial inspection. In 2017 25th Mediterranean Conference on Control and Automation (MED).
IEEE, jul 2017. URL: https://doi.org/10.1109%2Fmed.2017.7984284, doi:10.1109/med.2017.
7984284.
George Mavrotas. Effective implementation of the ϵ-constraint method in multi-objective
mathematical programming problems. Applied Mathematics and Computation, 213(2):455–465,
jul 2009. URL: https://doi.org/10.1016%2Fj.amc.2009.03.037, doi:10.1016/j.amc.2009.03.
037.
George Mavrotas and Kostas Florios. An improved version of the augmented ϵ-constraint
method (AUGMECON2) for finding the exact pareto set in multi-objective integer programming
problems. Applied Mathematics and Computation, 219(18):9652–9669, may 2013. doi:10.
1016/j.amc.2013.03.002.
V. Megha and K. K. Rajkumar. Automatic Satellite Image Stitching Based on Speeded
Up Robust Feature. In Proceedings - 2021 1st IEEE International Conference on Artificial
Intelligence and Machine Vision, AIMV 2021, pages 1–6, Gandhinagar, India, 2021. Institute
of Electrical and Electronics Engineers Inc. doi:10.1109/AIMV53313.2021.9670954.
Doug Mohney.
Terabytes From Space:
Satellite Imaging is Filling
Data
Centers,
2020.
URL:
https://datacenterfrontier.com/
terabytes-from-space-satellite-imaging-is-filling-data-centers/.
Sébastien Mouthuy, Yves Deville, and Grégoire Dooms. Global constraint for the set covering
problem. Journées Francophones de Programmation par Contraintes, pages 183–192, 2007.
Konstantinos G. Nikolakopoulos, Paraskevi Lampropoulou, Elias Fakiris, Dimitris Sardelianos,
and George Papatheodorou. Synergistic use of UAV and USV data and petrographic analyses
for the investigation of beachrock formations: A case study from Syros Island, Aegean sea,
Greece. Minerals, 8(11):534, 11 2018. doi:10.3390/min8110534.
Laurent Perron and Vincent Furnon. Or-tools. URL: https://developers.google.com/
optimization/.
Simone Piaggesi, Laetitia Gauvin, Michele Tizzoni, Natalia Adler, Stefaan Verhulst, Andrew
Young, Rihannan Price, Leo Ferres, Ciro Cattuto, and André Panisson. Predicting city poverty
using satellite imagery. In IEEE Computer Society Conference on Computer Vision and
Pattern Recognition Workshops, volume 2019-June, pages 90–96, 2019.
Pierre Schaus and Renaud Hartert. Multi-Objective Large Neighborhood Search. In Principles and Practice of Constraint Programming, volume 8124, pages 611–627. Springer Berlin
Heidelberg, Berlin, Heidelberg, 2013. doi:10.1007/978-3-642-40627-0_46.
Christian Schulte, Guido Tack, and Mikael Lagerkvist. Modeling and Programming with
Gecode, 2020.
Y. G. Stoyan, T. Romanova, G. Scheithauer, and A. Krivulya. Covering a polygonal region by
rectangles. Computational Optimization and Applications 2009 48:3, 48(3):675–695, 5 2009.
doi:10.1007/S10589-009-9258-1.

M. Combarro Simón, P. Talbot, G. Danoy, J. Musial, M. Alswaitti, and P. Bouvry

36

37
38

39

40

23:13

Lin Sun, Xu Yang, Shangfeng Jia, Chen Jia, Quan Wang, Xinyan Liu, Jing Wei, and Xueying
Zhou. Satellite data cloud detection using deep learning supported by hyperspectral data.
International Journal of Remote Sensing, 41(4):1349–1371, sep 2019. doi:10.1080/01431161.
2019.1667548.
Union of Concerned Scientists. UCS Satellite Database. URL: https://www.ucsusa.org/
resources/satellite-database.
Haibo Wang, Xueshuang Gong, Bingbing Wang, Chao Deng, and Qiong Cao. Urban development analysis using built-up area maps based on multiple high-resolution satellite data.
International Journal of Applied Earth Observation and Geoinformation, 103:102500, 12 2021.
doi:10.1016/J.JAG.2021.102500.
Lei Yu, Yongjun Zhang, Mingwei Sun, and Xinyu Zhu. Colour balancing of satellite imagery
based on a colour reference library. International Journal of Remote Sensing, 37(24):5763–5785,
12 2016. doi:10.1080/01431161.2016.1249306.
Weihua Zhang and Marc Reimann. A simple augmented ϵ-constraint method for multi-objective
mathematical integer programming problems. European Journal of Operational Research,
234(1):15–24, apr 2014. doi:10.1016/j.ejor.2013.09.001.

A

Linear Programming Model

For the mixed integer linear programming, we use the same nomenclature as for the constraint
model. We just add the necessary variables to linearize the model.
To linearize the cover constraint (1) it is necessary to associate each image Pi with a
decision variable xi that is equal to 1 if the image i is selected, otherwise it is 0. We rewrite
the constraint as follows:

X

xi ≥ 1, for all k ∈ U

(8)

i:k∈Pi

The previous constraint guarantees that all the parts are covered by at least one image.
To linearize the cost constraint (2) it is necessary to associate each image Pi with an
auxiliary variable wi that represents the cost of the image. The linear constraint can be
written as:

min

X

xi w i

(9)

Pi ∈I

The constraints (8) and (9) are the classical constraints used for set covering problems.
The resolution objective is a min-min problem, where the objective is to minimize the sum
of the min resolution of each part. The min resolution of a part is the minimum resolution
of the images that contain them and belong to a cover. We need to add an auxiliary decision
variable rk representing the best resolution of the part k and a big constant B, bigger than the
maximum image resolution. Also, we need to add an auxiliary binary decision variables zkj
for each image Pj that contains k. For each part k, we define Lk := {i ∈ {1, . . . , m} | k ∈ Pi }
as the set of all images containing the part k. For each part k we can now define a constraint
for the values that can take the variables zkj .
X

zkj = |Lk | − 1

(10)

j∈Lk

CP 2023

23:14

Satellite Image Mosaic Selection Problem

The constraint expressed above states that only one of the zkj variables can be 0, the
rest have to be 1. We define the minimum resolution of a part as rk . With the following two
constraints, we can linearize (3).

rk ≥ (xj Rj + B(1 − xi )) − 2Bzkj for all j ∈ Lk

min

X

rk

(11)

(12)

k∈U

The first term on the right-hand side of (11) affects how the part k perceives the resolution
of the images to which it belongs. If the image is in a cover, the resolution is equal to Rj .
If the image is not in a cover, then the resolution is equal to the big constant B. As we
minimize rk , the lower this first term, the better. This term forced the images with lower
resolution to be in a cover. The second term on the right-hand side of (11) is used to force rk
equal to the minimum value of the resolution of the images that contain the part k and are
in the cover. When zkj = 1 the right-hand side is negative and when 0 the value is positive,
and it is the value that rk takes.
To linearize the incidence angle objective, we need to minimize an auxiliary variable
maxf that represents the maximum incidence angle of the images in the cover.

min maxf ≥ xi Fi , for all i = 1, . . . , m

(13)

To minimize the area of the clouds, we can model this as a partial set cover problem,
where the universe C = {1, . . . c} is formed by all the clouds, and the sets are the images
that can cover the clouds. For example, if we have the following set P2c = {c1 , c2 , c5 } it
means that image 2 can cover clouds 1, 2 and 5, i.e. parts 1, 2 and 5 are not cloudy in image
2. For each cloud ci we have a variable yi that is 1 if the cloud is covered or 0 otherwise and
Ac indicating the area of the cloud. To maximize the covering of the cloudy areas, we will
minimize the following expression:

min −

X

y c Ac

(14)

c∈C

Subject to the following constraint, which forces yc to be 0 if none of the images that
cover the cloud c is selected to cover the AOI.
X

xi ≥ yc , for all c ∈ C

(15)

i:c∈Pic

B

MiniZinc Model

We describe the full MiniZinc constraint model implementing the mathematical model given
in Section 2.
int : num_images ;
int : universe ;
int : max_cloud_area ;

M. Combarro Simón, P. Talbot, G. Danoy, J. Musial, M. Alswaitti, and P. Bouvry

23:15

set of int : IMAGES = 1.. num_images ;
set of int : UNIVERSE = 1.. universe ;
array [ IMAGES ] of set of int : images ;
array [ IMAGES ] of set of int : clouds ;
array [ IMAGES ] of int : costs ;
array [ UNIVERSE ] of int : areas ;
array [ IMAGES ] of int : resolution ;
array [ IMAGES ] of int : incidence_angle ;
array [ IMAGES ] of var bool : taken ;
% Which images have a universe ‘u ‘ without cloud ?
% That is , uclear [ u ] = { i1 , i2 , ..} means that the images numbered i1 , i2
, ... contains ‘u ‘ without clouds .
array [ UNIVERSE ] of set of int : uclear = [{ i | i in IMAGES where not ( u
in clouds [ i ]) /\ u in images [ i ] } | u in UNIVERSE ];
% Set covering constraint .
constraint forall ( u in UNIVERSE ) (
exists ( i in IMAGES ) ( taken [ i ] /\ u in images [ i ]) ) ;
% cloudy [ u ] is true iff no image containing a version of ‘u ‘ without
clouds is taken .
array [ UNIVERSE ] of var bool : cloudy ;
array [ UNIVERSE ] of var int : num_clear_images ;
constraint forall ( u in UNIVERSE ) (
num_clear_images [ u ] = sum ( i in uclear [ u ]) ( taken [ i ])
);
constraint forall ( u in UNIVERSE ) ( cloudy [ u ] = ( num_clear_images [ u ] == 0) ) ;
var int : cloudy_area = sum ( u in UNIVERSE ) ( cloudy [ u ] * areas [ u ]) ;
var int : total_cost = sum ( i in IMAGES ) ( costs [ i ] * taken [ i ]) ;
var int : max_resolution = sum ( u in UNIVERSE ) ( min ( i in IMAGES where u in
images [ i ] /\ taken [ i ]) ( resolution [ i ]) ) ;
var int : max_incidence = max ( i in IMAGES ) ( taken [ i ] * incidence_angle [ i ]) ;
array [1..4] of var
constraint objs [1]
constraint objs [2]
constraint objs [3]
constraint objs [4]

int : objs ;
= total_cost ;
= cloudy_area ;
= max_resolution ;
= max_incidence ;

CP 2023

23:16

Satellite Image Mosaic Selection Problem

C

Experimental results detailed
Table 2 Hypervolume values for all the experiments.

Instance
lagos_nigeria_30
mexico_city_30
paris_30
rio_de_janeiro_30
tokyo_bay_30
lagos_nigeria_50
mexico_city_50
paris_50
rio_de_janeiro_50
tokyo_bay_50
lagos_nigeria_100
mexico_city_100
paris_100
rio_de_janeiro_100
tokyo_bay_100
lagos_nigeria_145
mexico_city_150
paris_150
rio_de_janeiro_150
tokyo_bay_150
mexico_city_200
paris_200
rio_de_janeiro_200
tokyo_bay_200

OR-tools default
5.46E+33
4.83E+32
1.95E+34
5.76E+33
6.35E+33
3.26E+34
2.88E+33
9.02E+34
3.94E+33
3.32E+34
1.86E+35
1.98E+35
4.73E+35
4.76E+35
1.23E+35
2.98E+36
1.52E+36
2.60E+36
6.59E+35
1.16E+36
2.66E+36
0.00E+00
0.00E+00
0.00E+00

OR-tools greedy
5.21E+33
4.82E+33
1.95E+34
5.65E+33
6.33E+33
3.15E+34
2.92E+34
8.85E+34
3.87E+34
3.22E+34
1.87E+35
1.97E+35
3.13E+34
4.14E+35
1.23E+35
3.78E+36
1.11E+36
2.83E+36
4.81E+34
8.68E+35
2.05E+35
0.00E+00
0.00E+00
0.00E+00

Gecode default
4.89E+32
4.62E+33
1.59E+34
5.65E+33
6.05E+33
2.19E+34
2.63E+34
7.12E+34
3.05E+34
1.65E+34
1.68E+35
2.11E+35
4.32E+35
2.28E+35
1.89E+35
2.32E+36
2.27E+36
1.11E+36
4.00E+35
8.36E+35
4.28E+36
0.00E+00
0.00E+00
0.00E+00

Gecode greedy
4.91E+33
4.59E+32
1.58E+34
5.65E+33
6.05E+33
2.18E+34
2.57E+34
7.24E+34
3.04E+34
1.78E+34
1.68E+35
2.03E+35
2.91E+35
1.83E+35
1.89E+35
2.32E+36
2.28E+36
1.11E+36
4.00E+35
8.36E+35
4.28E+36
0.00E+00
0.00E+00
0.00E+00

Gurobi
4.87E+33
4.83E+33
1.95E+33
5.76E+33
6.30E+33
2.77E+34
2.71E+34
7.85E+34
3.83E+34
1.98E+34
8.55E+34
1.58E+35
4.32E+35
3.39E+35
2.77E+35
9.30E+35
3.42E+35
2.87E+36
4.61E+34
1.12E+36
1.06E+36
1.60E+36
0.00E+00
2.46E+35