Skip to content

A fast implementation of the Goemans-Williamson scheme for the prize-collecting Steiner tree / forest problem.

License

Notifications You must be signed in to change notification settings

fraenkel-lab/pcst_fast

Repository files navigation

Build Status

pcst_fast

A library for solving the prize-collecting Steiner forest (PCSF) problem on graphs. The underlying algorithm is based on the classical Goemans-Williamson approximation scheme. Our variant provably runs in nearly-linear time and has a factor-2 approximation guarantee. The following paper contains details about the algorithm:

A Nearly-Linear Time Framework for Graph-Structured Sparsity Chinmay Hegde, Piotr Indyk, Ludwig Schmidt ICML 2015

Installation

  • Option 1: pip

      pip install pcst_fast
    

    This may not work for some versions of python on some operating systems.

  • Option 2: manual

    The core C++ library has no dependencies besides a basic build system for C++11. Both g++ and clang are currently supported. The Python wrapper requires a functioning Python build system.

    Clone the repository and compile the python wrapper with the supplied makefile:

      make pcst_fast_py
    

    You can then import the package via import pcst_fast.

Usage

The pcst_fast package contains the following function:

vertices, edges = pcst_fast(edges, prizes, costs, root, num_clusters, pruning, verbosity_level)

The parameters are:

  • edges: a 2D int64 array. Each row (of length 2) specifies an undirected edge in the input graph. The nodes are labeled 0 to n-1, where n is the number of nodes.
  • prizes: the node prizes as a 1D float64 array.
  • costs: the edge costs as a 1D float64 array.
  • root: the root note for rooted PCST. For the unrooted variant, this parameter should be -1.
  • num_clusters: the number of connected components in the output.
  • pruning: a string value indicating the pruning method. Possible values are 'none', 'simple', 'gw', and 'strong' (all literals are case-insensitive). 'none' and 'simple' return intermediate stages of the algorithm and do not have approximation guarantees. They are only intended for development. The standard GW pruning method is 'gw', which is also the default. 'strong' uses "strong pruning", which was introduced in [JMP00]. It has the same theoretical guarantees as GW pruning but better empirical performance in some cases. For the PCSF problem, the output of strong pruning is at least as good as the output of GW pruning.
  • verbosity_level: an integer indicating how much debug output the function should produce.

The output variables are:

  • vertices: the vertices in the solution as a 1D int64 array.
  • edges: the edges in the output as a 1D int64 array. The list contains indices into the list of edges passed into the function.

Performance

The following paper contains many results on standard PCST benchmark instances:

A Fast, Adaptive Variant of the Goemans-Williamson Scheme for the Prize-Collecting Steiner Tree Problem Chinmay Hegde, Piotr Indyk, Ludwig Schmidt Workshop of the 11th DIMACS Implementation Challenge: Steiner Tree Problems, 2014

On instances with up to 350,000 edges, the algorithm typically runs in under 2 seconds on a standard laptop computer from 2010.