Traveling Salesperson Problem
This tutorial was generated using Literate.jl. Download the source as a .jl
file.
This tutorial was originally contributed by Daniel Schermer.
This tutorial describes how to implement the Traveling Salesperson Problem in JuMP using solver-independent lazy constraints that dynamically separate subtours. To be more precise, we use lazy constraints to cut off infeasible subtours only when necessary and not before needed.
It uses the following packages:
using JuMP
import Gurobi
import Plots
import Random
import Test
Mathematical Formulation
Assume that we are given a complete graph $\mathcal{G}(V,E)$ where $V$ is the set of vertices (or cities) and $E$ is the set of edges (or roads). For each pair of vertices $i, j \in V, i \neq j$ the edge $(i,j) \in E$ is associated with a weight (or distance) $d_{ij} \in \mathbb{R}^+$.
For this tutorial, we assume the problem to be symmetric, that is, $d_{ij} = d_{ji} \, \forall i,j \in V$.
In the Traveling Salesperson Problem, we are tasked with finding a tour with minimal length that visits every vertex exactly once and then returns to the point of origin, that is, a Hamiltonian cycle with minimal weight.
To model the problem, we introduce a binary variable, $x_{ij} \in \{0,1\} \; \forall i, j \in V$, that indicates if edge $(i,j)$ is part of the tour or not. Using these variables, the Traveling Salesperson Problem can be modeled as the following integer linear program.
Objective Function
The objective is to minimize the length of the tour (due to the assumed symmetry, the second sum only contains $i<j$):
\[\text{min } \sum_{i \in V} \sum_{j \in V, i < j} d_{ij} x_{ij}.\]
Note that it is also possible to use the following objective function instead:
\[\text{min } \sum_{i \in V} \sum_{j \in V} \dfrac{d_{ij} x_{ij}}{2}.\]
Constraints
There are four classes of constraints in our formulation.
First, due to the presumed symmetry, the following constraints must hold:
\[x_{ij} = x_{ji} \quad \forall i,j \in V.\]
Second, for each vertex $i$, exactly two edges must be selected that connect it to other vertices $j$ in the graph $G$:
\[\sum_{j \in V} x_{ij} = 2 \quad \forall i \in V.\]
Third, we do not permit loops to occur:
\[x_{ii} = 0 \quad \forall i \in V.\]
The fourth constraint is more complicated. A major difficulty of the Traveling Salesperson Problem arises from the fact that we need to prevent subtours, that is, several distinct Hamiltonian cycles existing on subgraphs of $G$.
Note that the previous constraints do not guarantee that the solution will be free of subtours. To this end, by $S$ we label a subset of vertices. Then, for each proper subset $S \subset V$, the following constraints guarantee that no subtour may occur:
\[\sum_{i \in S} \sum_{j \in S, i < j} x_{ij} \leq \vert S \vert - 1 \quad \forall S \subset V.\]
Problematically, we require exponentially many of these constraints as $\vert V \vert$ increases. Therefore, we will add these constraints only when necessary.
Implementation
There are two ways we can eliminate subtours in JuMP, both of which will be shown in what follows:
- iteratively solving a new model that incorporates previously identified subtours,
- or adding violated subtours as lazy constraints.
Data
The vertices are assumed to be randomly distributed in the Euclidean space; thus, the weight (distance) of each edge is defined as follows.
function generate_distance_matrix(n; random_seed = 1)
rng = Random.MersenneTwister(random_seed)
X = 100 * rand(rng, n)
Y = 100 * rand(rng, n)
d = [sqrt((X[i] - X[j])^2 + (Y[i] - Y[j])^2) for i in 1:n, j in 1:n]
return X, Y, d
end
n = 100
X, Y, d = generate_distance_matrix(n)
([9.913970137863682, 70.19797138879542, 50.3261785841856, 87.58412053070398, 95.34654118744876, 50.7810571056071, 78.97511635624403, 7.125413261100788, 13.837807897217225, 39.31891799217675 … 84.87369607977678, 61.680928138712, 5.665730912653899, 15.622563304879634, 36.90767228785501, 70.07597765092129, 79.43901471209098, 46.482254570311675, 68.59072330642508, 86.69884288310024], [96.78179466896867, 56.23453714649542, 67.44638756669107, 7.1115103002265645, 92.78034391338332, 34.57366887562756, 76.53412034001651, 33.078576899782796, 62.27235533684083, 31.31072581673351 … 1.4463814325218927, 42.253985947804495, 53.81635009641501, 76.50117040708963, 27.74238915740479, 60.20183753580153, 19.81346291572821, 90.60507365183767, 31.10234142135033, 21.085230265206945], [0.0 72.65150307747324 … 88.07242440917794 107.82340444001487; 72.65150307747324 0.0 … 25.183536454701592 38.829789264256306; … ; 88.07242440917794 25.183536454701592 … 0.0 20.69411777577625; 107.82340444001487 38.829789264256306 … 20.69411777577625 0.0])
For the JuMP model, we first initialize the model object. Then, we create the binary decision variables and add the objective function and constraints. By defining the x
matrix as Symmetric
, we do not need to add explicit constraints that x[i, j] == x[j, i]
.
function build_tsp_model(d, n)
model = Model(Gurobi.Optimizer)
set_silent(model)
@variable(model, x[1:n, 1:n], Bin, Symmetric)
@objective(model, Min, sum(d .* x) / 2)
@constraint(model, [i in 1:n], sum(x[i, :]) == 2)
@constraint(model, [i in 1:n], x[i, i] == 0)
return model
end
build_tsp_model (generic function with 1 method)
To search for violated constraints, based on the edges that are currently in the solution (that is, those that have value $x_{ij} = 1$), we identify the shortest cycle through the function subtour()
. Whenever a subtour has been identified, a constraint corresponding to the form above can be added to the model.
function subtour(edges::Vector{Tuple{Int,Int}}, n)
shortest_subtour, unvisited = collect(1:n), Set(collect(1:n))
while !isempty(unvisited)
this_cycle, neighbors = Int[], unvisited
while !isempty(neighbors)
current = pop!(neighbors)
push!(this_cycle, current)
if length(this_cycle) > 1
pop!(unvisited, current)
end
neighbors =
[j for (i, j) in edges if i == current && j in unvisited]
end
if length(this_cycle) < length(shortest_subtour)
shortest_subtour = this_cycle
end
end
return shortest_subtour
end
subtour (generic function with 1 method)
Let us declare a helper function selected_edges()
that will be repeatedly used in what follows.
function selected_edges(x::Matrix{Float64}, n)
return Tuple{Int,Int}[(i, j) for i in 1:n, j in 1:n if x[i, j] > 0.5]
end
selected_edges (generic function with 1 method)
Other helper functions for computing subtours:
subtour(x::Matrix{Float64}) = subtour(selected_edges(x, size(x, 1)), size(x, 1))
subtour(x::AbstractMatrix{VariableRef}) = subtour(value.(x))
subtour (generic function with 3 methods)
Iterative method
An iterative way of eliminating subtours is the following.
However, it is reasonable to assume that this is not the most efficient way: whenever a new subtour elimination constraint is added to the model, the optimization has to start from the very beginning.
That way, the solver will repeatedly discard useful information encountered during previous solves (for example, all cuts, the incumbent solution, or lower bounds).
Note that, in principle, it would also be feasible to add all subtours (instead of just the shortest one) to the model. However, preventing just the shortest cycle is often sufficient for breaking other subtours and will keep the model size smaller.
iterative_model = build_tsp_model(d, n)
optimize!(iterative_model)
@assert is_solved_and_feasible(iterative_model)
time_iterated = solve_time(iterative_model)
cycle = subtour(iterative_model[:x])
while 1 < length(cycle) < n
println("Found cycle of length $(length(cycle))")
S = [(i, j) for (i, j) in Iterators.product(cycle, cycle) if i < j]
@constraint(
iterative_model,
sum(iterative_model[:x][i, j] for (i, j) in S) <= length(cycle) - 1,
)
optimize!(iterative_model)
@assert is_solved_and_feasible(iterative_model)
global time_iterated += solve_time(iterative_model)
global cycle = subtour(iterative_model[:x])
end
Set parameter WLSAccessID
Set parameter WLSSecret
Set parameter LicenseID to value 722777
Set parameter GURO_PAR_SPECIAL
WLS license 722777 - registered to JuMP Development
Found cycle of length 3
Found cycle of length 3
Found cycle of length 3
Found cycle of length 3
Found cycle of length 3
Found cycle of length 3
Found cycle of length 3
Found cycle of length 3
Found cycle of length 3
Found cycle of length 4
Found cycle of length 3
Found cycle of length 3
Found cycle of length 3
Found cycle of length 4
Found cycle of length 5
Found cycle of length 6
Found cycle of length 5
Found cycle of length 3
Found cycle of length 4
Found cycle of length 6
Found cycle of length 4
Found cycle of length 8
Found cycle of length 5
Found cycle of length 10
Found cycle of length 10
Found cycle of length 15
Found cycle of length 11
Found cycle of length 4
Found cycle of length 22
Found cycle of length 3
Found cycle of length 5
Found cycle of length 21
objective_value(iterative_model)
744.6016576596794
time_iterated
4.155482053756714
As a quick sanity check, we visualize the optimal tour to verify that no subtour is present:
function plot_tour(X, Y, x)
plot = Plots.plot()
for (i, j) in selected_edges(x, size(x, 1))
Plots.plot!([X[i], X[j]], [Y[i], Y[j]]; legend = false)
end
return plot
end
plot_tour(X, Y, value.(iterative_model[:x]))
Lazy constraint method
A more sophisticated approach makes use of lazy constraints. To be more precise, we do this through the subtour_elimination_callback()
below, which is only run whenever we encounter a new integer-feasible solution.
lazy_model = build_tsp_model(d, n)
function subtour_elimination_callback(cb_data)
status = callback_node_status(cb_data, lazy_model)
if status != MOI.CALLBACK_NODE_STATUS_INTEGER
return # Only run at integer solutions
end
cycle = subtour(callback_value.(cb_data, lazy_model[:x]))
if !(1 < length(cycle) < n)
return # Only add a constraint if there is a cycle
end
S = [(i, j) for (i, j) in Iterators.product(cycle, cycle) if i < j]
con = @build_constraint(
sum(lazy_model[:x][i, j] for (i, j) in S) <= length(cycle) - 1,
)
MOI.submit(lazy_model, MOI.LazyConstraint(cb_data), con)
return
end
set_attribute(
lazy_model,
MOI.LazyConstraintCallback(),
subtour_elimination_callback,
)
optimize!(lazy_model)
Set parameter WLSAccessID
Set parameter WLSSecret
Set parameter LicenseID to value 722777
Set parameter GURO_PAR_SPECIAL
WLS license 722777 - registered to JuMP Development
@assert is_solved_and_feasible(lazy_model)
objective_value(lazy_model)
744.6016576596794
time_lazy = solve_time(lazy_model)
2.055448055267334
This finds the same optimal tour:
plot_tour(X, Y, value.(lazy_model[:x]))
The solution time is faster than the iterative approach:
Test.@test time_lazy < time_iterated
Test Passed