Simple, Fast and Provable Greedy Algorithms for the Goemans and Williamson Max-Cut Relaxation

This is the accompanying code for the paper "Simple, Fast and Provable Greedy Algorithms for the Goemans and Williamson Max-Cut Relaxation". In this work we propose simple and provable, fast greedy algorithms to solve the semidefinite programming relaxation proposed by Goemans and Williamson.

Analysis of the algorithm demonstrate that it always converges to the global optimum. Cuts found by the algorithms are monotonically non-decreasing, and will keep improving until no improvement can be made. The C++ implementation of algorithm 2 (low rank) can solve the Max-Cut relaxation for graphs with thousands of nodes in a matter of seconds in a desktop machine. Empirical results, which can be replicated with this implementation, supports our claims.

If you find this work useful please cite,

here goes the reference to our paper

Description

C++ implementation

For the G set graphs, the input graphs must be stored in the input folder and must follow the same format as the example shown for G11 (G11.txt file) in this repo. The G11.mat file is the input file for the python implementations of the algorithm.

For the SNAP set graphs, the input graphs must be stored in the input folder and must follow the same format as the example shown for the Amazon graph (amazon.txt file) in this repo. The main difference with respect to the input files for the G set graphs is that the input files for the SNAP set graphs does not include the weights of the edges, because the weight is equal to one for all edges. This allow us to save storage space for these graphs.

Four algorithms were implemented in C++:

algorithm1.cpp, implementation of Algorithm 1 (full rank) in the paper.
algorithm2.cpp, implementation of Algorithm 2 (low rank) in the paper.
algorithm2_snap.cpp, implementation of Algorithm 2 (low rank) in the paper, modified to read input graphs from the SNAP set (see previous paragraph).
algorithm2_snap_fixed_rank.cpp, implementation of Algorithm 2 (low rank) in the paper, modified to read input graphs from the SNAP set (see previous paragraph) and modified to set the value of p = 100 (initial rank of solution matrix) to handle graphs with more than 1 million of nodes (see paper).

To compile algorithm 1:

g++ -I /path/to/folder/eigen-3.3.9/ algorithm1.cpp -o algorithm1 -O2

To run algorithm 1:

./algorithm1 ./inputs/G11.txt 0.001 1000

where:

G11.txt is the name of the graph to solve (G1, G2, ...)
0.001 is the value of the tolerance used to evaluate convergence of algorithm
1000 is the maximum number of iterations for outer loop of the algorithm

The instructions to compile and run the rest of C++ implementations (algorithm2, algorithm2_snap, and algorithm2_snap_fixed_rank) are the same as for algorithm1 but with the name of the algorithm modified accordingly.

Very important: do not forget to use the flag -O2 to compile the codes, otherwise the implementations will be slow.

Python implementation

In addition to our implementation in C++, we implemented our algorithm in Python. This implementation is not as efficient in time and space as the implementation in C++, but allow us to easily plot the convergence of different quantities during the execution of the algorithm.

The input graphs must be stored in the input folder and must follow the same format as the G set graphs in this link. An example is shown for G11 (G11.mat file) in this repo.

We used an open source solver (SCS with CVXPY) to find the optimal solution for the G set graphs to compare our results.

To run algorithm 1:

python robust_maxcut_k2.py --input_graph G11 --max_iterations 1000 --tol 0.001 --random_seed 0 --fast_execution on

To run algorithm 2 (low rank):

python robust_maxcut_k2_reduced_rank.py --input_graph G11 --max_iterations 1000 --tol 0.001 --random_seed 0 --fast_execution on

To run SCS (with CVXPY):

python cvxpy_maxcut_k2.py --input_graph G11 --max_iterations 1000 --tol 0.001 --random_seed 0

where:

input_graph is the name of the graph to solve (G1, G2, ...)
max_iterations is the maximum number of iterations for outer loop of the algorithm
tol is the value of the tolerance used to evaluate convergence of algorithm
random_seed is the value of the random seed for numpy (to replicate results)
fast_execution is a value s.t., if True, some computations (solution matrix rank per iteration) are avoided (only available for algorithms 1 and 2, not for SCS)

Dependencies

C++

Eigen 3.3.9

Python

python 3.9.18
numpy 1.26.0
scipy 1.11.3
matplotlib 3.5.3
distutils 3.9.18
cvxpy 1.4.2 (only used for comparison, not necessary for algorithms' implementation)

Colab

You can also replicate our python implementation results with the following google colab notebooks:

Main example

License

Our implementation is open-sourced under the Apache-2.0 license. See the LICENSE file for details.

Contact

Bernardo Gonzalez <bernardo.gtorres@gmail.com>

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
cpp_solutions		cpp_solutions
imgs		imgs
inputs		inputs
python_solutions		python_solutions
solver_solutions		solver_solutions
LICENSE		LICENSE
README.md		README.md
algorithm1.cpp		algorithm1.cpp
algorithm2.cpp		algorithm2.cpp
algorithm2_snap.cpp		algorithm2_snap.cpp
algorithm2_snap_fixed_rank.cpp		algorithm2_snap_fixed_rank.cpp
cvxpy_maxcut_k2.py		cvxpy_maxcut_k2.py
robust_maxcut_k2.py		robust_maxcut_k2.py
robust_maxcut_k2_reduced_rank.py		robust_maxcut_k2_reduced_rank.py

License

b3r8/robust-greedy-maxcut

Folders and files

Latest commit

History

Repository files navigation

Simple, Fast and Provable Greedy Algorithms for the Goemans and Williamson Max-Cut Relaxation

Description

C++ implementation

Python implementation

Dependencies

C++

Python

Colab

License

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Languages