Chapter 15. Graph visitors

1. Breadth-first search
2. Depth-first search
3. Random walks

1. Breadth-first search

1.1. igraph_bfs — Breadth-first search

int igraph_bfs(const igraph_t *graph,
               igraph_integer_t root, const igraph_vector_t *roots,
               igraph_neimode_t mode, igraph_bool_t unreachable,
               const igraph_vector_t *restricted,
               igraph_vector_t *order, igraph_vector_t *rank,
               igraph_vector_t *father,
               igraph_vector_t *pred, igraph_vector_t *succ,
               igraph_vector_t *dist, igraph_bfshandler_t *callback,
               void *extra);

A simple breadth-first search, with a lot of different results and the possibility to call a callback whenever a vertex is visited. It is allowed to supply null pointers as the output arguments the user is not interested in, in this case they will be ignored.

If not all vertices can be reached from the supplied root vertex, then additional root vertices will be used, in the order of their vertex ids.

Consider using igraph_bfs_simple instead if you set most of the output arguments provided by this function to a null pointer.

Arguments: 

graph:

The input graph.

root:

The id of the root vertex. It is ignored if the roots argument is not a null pointer.

roots:

Pointer to an initialized vector, or a null pointer. If not a null pointer, then it is a vector containing root vertices to start the BFS from. The vertices are considered in the order they appear. If a root vertex was already found while searching from another one, then no search is conducted from it.

mode:

For directed graphs, it defines which edges to follow. IGRAPH_OUT means following the direction of the edges, IGRAPH_IN means the opposite, and IGRAPH_ALL ignores the direction of the edges. This parameter is ignored for undirected graphs.

unreachable:

Logical scalar, whether the search should visit the vertices that are unreachable from the given root node(s). If true, then additional searches are performed until all vertices are visited.

restricted:

If not a null pointer, then it must be a pointer to a vector containing vertex ids. The BFS is carried out only on these vertices.

order:

If not null pointer, then the vertex ids of the graph are stored here, in the same order as they were visited.

rank:

If not a null pointer, then the rank of each vertex is stored here.

father:

If not a null pointer, then the id of the father of each vertex is stored here.

pred:

If not a null pointer, then the id of vertex that was visited before the current one is stored here. If there is no such vertex (the current vertex is the root of a search tree), then -1 is stored.

succ:

If not a null pointer, then the id of the vertex that was visited after the current one is stored here. If there is no such vertex (the current one is the last in a search tree), then -1 is stored.

dist:

If not a null pointer, then the distance from the root of the current search tree is stored here.

callback:

If not null, then it should be a pointer to a function of type igraph_bfshandler_t. This function will be called, whenever a new vertex is visited.

extra:

Extra argument to pass to the callback function.

Returns: 

Error code.

Time complexity: O(|V|+|E|), linear in the number of vertices and edges.

Example 15.1.  File examples/simple/igraph_bfs.c

/* -*- mode: C -*-  */
/*
   IGraph library.
   Copyright (C) 2009-2021  The igraph development team

   This program is free software; you can redistribute it and/or modify
   it under the terms of the GNU General Public License as published by
   the Free Software Foundation; either version 2 of the License, or
   (at your option) any later version.

   This program is distributed in the hope that it will be useful,
   but WITHOUT ANY WARRANTY; without even the implied warranty of
   MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
   GNU General Public License for more details.

   You should have received a copy of the GNU General Public License
   along with this program; if not, write to the Free Software
   Foundation, Inc.,  51 Franklin Street, Fifth Floor, Boston, MA
   02110-1301 USA

*/

#include <igraph.h>

igraph_bool_t bfs_callback(const igraph_t *graph,
                           igraph_integer_t vid,
                           igraph_integer_t pred,
                           igraph_integer_t succ,
                           igraph_integer_t rank,
                           igraph_integer_t dist,
                           void *extra) {
    printf(" %li", (long int) vid);
    return 0;
}

int main() {

    igraph_t graph, ring;
    igraph_vector_t order, rank, father, pred, succ, dist;

    /* Create a disjoint union of two rings */
    igraph_ring(&ring, 10, /*directed=*/ 0, /*mutual=*/ 0, /*circular=*/ 1);
    igraph_disjoint_union(&graph, &ring, &ring);
    igraph_destroy(&ring);

    /* Initialize the vectors where the result will be stored. Any of these
     * can be omitted and replaced with a null pointer when calling
     * igraph_bfs() */
    igraph_vector_init(&order, 0);
    igraph_vector_init(&rank, 0);
    igraph_vector_init(&father, 0);
    igraph_vector_init(&pred, 0);
    igraph_vector_init(&succ, 0);
    igraph_vector_init(&dist, 0);

    /* Now call the BFS function */
    igraph_bfs(&graph, /*root=*/0, /*roots=*/ 0, /*neimode=*/ IGRAPH_OUT,
               /*unreachable=*/ 1, /*restricted=*/ 0,
               &order, &rank, &father, &pred, &succ, &dist,
               /*callback=*/ 0, /*extra=*/ 0);

    /* Print the results */
    igraph_vector_print(&order);
    igraph_vector_print(&rank);
    igraph_vector_print(&father);
    igraph_vector_print(&pred);
    igraph_vector_print(&succ);
    igraph_vector_print(&dist);

    /* Cleam up after ourselves */
    igraph_vector_destroy(&order);
    igraph_vector_destroy(&rank);
    igraph_vector_destroy(&father);
    igraph_vector_destroy(&pred);
    igraph_vector_destroy(&succ);
    igraph_vector_destroy(&dist);

    igraph_destroy(&graph);

    return 0;
}


Example 15.2.  File examples/simple/igraph_bfs_callback.c

/* -*- mode: C -*-  */
/*
   IGraph library.
   Copyright (C) 2009-2021  The igraph development team

   This program is free software; you can redistribute it and/or modify
   it under the terms of the GNU General Public License as published by
   the Free Software Foundation; either version 2 of the License, or
   (at your option) any later version.

   This program is distributed in the hope that it will be useful,
   but WITHOUT ANY WARRANTY; without even the implied warranty of
   MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
   GNU General Public License for more details.

   You should have received a copy of the GNU General Public License
   along with this program; if not, write to the Free Software
   Foundation, Inc.,  51 Franklin Street, Fifth Floor, Boston, MA
   02110-1301 USA

*/

#include <igraph.h>

igraph_bool_t bfs_callback(const igraph_t *graph,
                           igraph_integer_t vid,
                           igraph_integer_t pred,
                           igraph_integer_t succ,
                           igraph_integer_t rank,
                           igraph_integer_t dist,
                           void *extra) {
    printf(" %li", (long int) vid);
    return 0;
}

int main() {
    igraph_t graph, ring;

    /* Create a disjoint union of two rings */
    igraph_ring(&ring, 10, /*directed=*/ 0, /*mutual=*/ 0, /*circular=*/ 1);
    igraph_disjoint_union(&graph, &ring, &ring);
    igraph_destroy(&ring);

    /* Now call the BFS function */
    printf("(");
    igraph_bfs(&graph, /*root=*/0, /*roots=*/ 0, /*neimode=*/ IGRAPH_OUT,
               /*unreachable=*/ 1, /*restricted=*/ 0,
               /*order=*/ 0, /*rank=*/ 0, /*father=*/ 0, /*pred=*/ 0,
               /*succ=*/ 0, /*dist=*/ 0,
               /*callback=*/ bfs_callback, /*extra=*/ 0);
    printf(" )\n");

    /* Cleam up after ourselves */
    igraph_destroy(&graph);

    return 0;
}


1.2. igraph_bfs_simple — Breadth-first search, single-source version

int igraph_bfs_simple(igraph_t *graph, igraph_integer_t vid, igraph_neimode_t mode,
                      igraph_vector_t *vids, igraph_vector_t *layers,
                      igraph_vector_t *parents);

An alternative breadth-first search implementation to cater for the simpler use-cases when only a single breadth-first search has to be conducted from a source node and most of the output arguments from igraph_bfs are not needed. It is allowed to supply null pointers as the output arguments the user is not interested in, in this case they will be ignored.

Arguments: 

graph:

The input graph.

vid:

The id of the root vertex.

mode:

For directed graphs, it defines which edges to follow. IGRAPH_OUT means following the direction of the edges, IGRAPH_IN means the opposite, and IGRAPH_ALL ignores the direction of the edges. This parameter is ignored for undirected graphs.

vids:

If not a null pointer, then an initialized vector must be passed here. The ids of the vertices visited during the traversal will be stored here, in the same order as they were visited.

layers:

If not a null pointer, then an initialized vector must be passed here. The i-th element of the vector will contain the index into vids where the vertices that are at distance i from the root are stored. In other words, if you are interested in the vertices that are at distance i from the root, you need to look in the vids vector from layers[i] to layers[i+1].

parents:

If not a null pointer, then an initialized vector must be passed here. The vector will be resized so its length is equal to the number of nodes, and it will contain the index of the parent node for each visited node. The values in the vector are undefined for vertices that were not visited.

Returns: 

Error code.

Time complexity: O(|V|+|E|), linear in the number of vertices and edges.

Example 15.3.  File examples/simple/igraph_bfs_simple.c

/* -*- mode: C -*-  */
/*
   IGraph library.
   Copyright (C) 2009-2021  The igraph development team

   This program is free software; you can redistribute it and/or modify
   it under the terms of the GNU General Public License as published by
   the Free Software Foundation; either version 2 of the License, or
   (at your option) any later version.

   This program is distributed in the hope that it will be useful,
   but WITHOUT ANY WARRANTY; without even the implied warranty of
   MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
   GNU General Public License for more details.

   You should have received a copy of the GNU General Public License
   along with this program; if not, write to the Free Software
   Foundation, Inc.,  51 Franklin Street, Fifth Floor, Boston, MA
   02110-1301 USA

*/

#include <igraph.h>

void vector_print(igraph_vector_t *v) {
    long int i;
    for (i = 0; i < igraph_vector_size(v); i++) {
        printf(" %li", (long int) VECTOR(*v)[i]);
    }
    printf("\n");
}

int main() {

    igraph_t g;
    igraph_vector_t vids, layers, parents;

    igraph_ring(&g, 10, IGRAPH_UNDIRECTED, 0, 0);

    igraph_vector_init(&vids, 0);
    igraph_vector_init(&layers, 0);
    igraph_vector_init(&parents, 0);

    igraph_bfs_simple(&g, 0, IGRAPH_ALL, &vids, &layers, &parents);

    vector_print(&vids);
    vector_print(&layers);
    vector_print(&parents);

    igraph_destroy(&g);

    igraph_vector_destroy(&vids);
    igraph_vector_destroy(&layers);
    igraph_vector_destroy(&parents);

    return 0;
}


1.3. igraph_bfshandler_t — Callback type for BFS function

typedef igraph_bool_t igraph_bfshandler_t(const igraph_t *graph,
        igraph_integer_t vid,
        igraph_integer_t pred,
        igraph_integer_t succ,
        igraph_integer_t rank,
        igraph_integer_t dist,
        void *extra);

igraph_bfs() is able to call a callback function, whenever a new vertex is found, while doing the breadth-first search. This callback function must be of type igraph_bfshandler_t. It has the following arguments:

Arguments: 

graph:

The graph that that algorithm is working on. Of course this must not be modified.

vid:

The id of the vertex just found by the breadth-first search.

pred:

The id of the previous vertex visited. It is -1 if there is no previous vertex, because the current vertex is the root is a search tree.

succ:

The id of the next vertex that will be visited. It is -1 if there is no next vertex, because the current vertex is the last one in a search tree.

rank:

The rank of the current vertex, it starts with zero.

dist:

The distance (number of hops) of the current vertex from the root of the current search tree.

extra:

The extra argument that was passed to igraph_bfs().

Returns: 

A logical value, if TRUE (=non-zero), that is interpreted as a request to stop the BFS and return to the caller. If a BFS is terminated like this, then all elements of the result vectors that were not yet calculated at the point of the termination contain NaN.

See also: 

2. Depth-first search

2.1. igraph_dfs — Depth-first search

int igraph_dfs(const igraph_t *graph, igraph_integer_t root,
               igraph_neimode_t mode, igraph_bool_t unreachable,
               igraph_vector_t *order,
               igraph_vector_t *order_out, igraph_vector_t *father,
               igraph_vector_t *dist, igraph_dfshandler_t *in_callback,
               igraph_dfshandler_t *out_callback,
               void *extra);

A simple depth-first search, with the possibility to call a callback whenever a vertex is discovered and/or whenever a subtree is finished. It is allowed to supply null pointers as the output arguments the user is not interested in, in this case they will be ignored.

If not all vertices can be reached from the supplied root vertex, then additional root vertices will be used, in the order of their vertex ids.

Arguments: 

graph:

The input graph.

root:

The id of the root vertex.

mode:

For directed graphs, it defines which edges to follow. IGRAPH_OUT means following the direction of the edges, IGRAPH_IN means the opposite, and IGRAPH_ALL ignores the direction of the edges. This parameter is ignored for undirected graphs.

unreachable:

Logical scalar, whether the search should visit the vertices that are unreachable from the given root node(s). If true, then additional searches are performed until all vertices are visited.

order:

If not null pointer, then the vertex ids of the graph are stored here, in the same order as they were discovered.

order_out:

If not a null pointer, then the vertex ids of the graphs are stored here, in the order of the completion of their subtree.

father:

If not a null pointer, then the id of the father of each vertex is stored here.

dist:

If not a null pointer, then the distance from the root of the current search tree is stored here.

in_callback:

If not null, then it should be a pointer to a function of type igraph_dfshandler_t. This function will be called, whenever a new vertex is discovered.

out_callback:

If not null, then it should be a pointer to a function of type igraph_dfshandler_t. This function will be called, whenever the subtree of a vertex is completed.

extra:

Extra argument to pass to the callback function(s).

Returns: 

Error code.

Time complexity: O(|V|+|E|), linear in the number of vertices and edges.

2.2. igraph_dfshandler_t — Callback type for the DFS function

typedef igraph_bool_t igraph_dfshandler_t(const igraph_t *graph,
        igraph_integer_t vid,
        igraph_integer_t dist,
        void *extra);

igraph_dfs() is able to call a callback function, whenever a new vertex is discovered, and/or whenever a subtree is completed. These callbacks must be of type igraph_dfshandler_t. They have the following arguments:

Arguments: 

graph:

The graph that that algorithm is working on. Of course this must not be modified.

vid:

The id of the vertex just found by the depth-first search.

dist:

The distance (number of hops) of the current vertex from the root of the current search tree.

extra:

The extra argument that was passed to igraph_dfs().

Returns: 

A logical value, if TRUE (=non-zero), that is interpreted as a request to stop the DFS and return to the caller. If a DFS is terminated like this, then all elements of the result vectors that were not yet calculated at the point of the termination contain NaN.

See also: 

3. Random walks

3.1. igraph_random_walk — Perform a random walk on a graph

int igraph_random_walk(const igraph_t *graph, igraph_vector_t *walk,
                       igraph_integer_t start, igraph_neimode_t mode,
                       igraph_integer_t steps,
                       igraph_random_walk_stuck_t stuck);

Performs a random walk with a given length on a graph, from the given start vertex. Edge directions are (potentially) considered, depending on the mode argument.

Arguments: 

graph:

The input graph, it can be directed or undirected. Multiple edges are respected, so are loop edges.

walk:

An allocated vector, the result is stored here. It will be resized as needed.

start:

The start vertex for the walk.

steps:

The number of steps to take. If the random walk gets stuck, then the stuck argument specifies what happens.

mode:

How to walk along the edges in directed graphs. IGRAPH_OUT means following edge directions, IGRAPH_IN means going opposite the edge directions, IGRAPH_ALL means ignoring edge directions. This argument is ignored for undirected graphs.

stuck:

What to do if the random walk gets stuck. IGRAPH_RANDOM_WALK_STUCK_RETURN means that the function returns with a shorter walk; IGRAPH_RANDOM_WALK_STUCK_ERROR means that an error is reported. In both cases walk is truncated to contain the actual interrupted walk.

Returns: 

Error code.

Time complexity: O(l + d), where l is the length of the walk, and d is the total degree of the visited nodes.

3.2. igraph_random_edge_walk — Perform a random walk on a graph and return the traversed edges

int igraph_random_edge_walk(const igraph_t *graph,
                            const igraph_vector_t *weights,
                            igraph_vector_t *edgewalk,
                            igraph_integer_t start, igraph_neimode_t mode,
                            igraph_integer_t steps,
                            igraph_random_walk_stuck_t stuck);

Performs a random walk with a given length on a graph, from the given start vertex. Edge directions are (potentially) considered, depending on the mode argument.

Arguments: 

graph:

The input graph, it can be directed or undirected. Multiple edges are respected, so are loop edges.

weights:

A vector of non-negative edge weights. It is assumed that at least one strictly positive weight is found among the outgoing edges of each vertex. Additionally, no edge weight may be NaN. If either case does not hold, an error is returned. If it is a NULL pointer, all edges are considered to have equal weight.

edgewalk:

An initialized vector; the indices of traversed edges are stored here. It will be resized as needed.

start:

The start vertex for the walk.

steps:

The number of steps to take. If the random walk gets stuck, then the stuck argument specifies what happens.

mode:

How to walk along the edges in directed graphs. IGRAPH_OUT means following edge directions, IGRAPH_IN means going opposite the edge directions, IGRAPH_ALL means ignoring edge directions. This argument is ignored for undirected graphs.

stuck:

What to do if the random walk gets stuck. IGRAPH_RANDOM_WALK_STUCK_RETURN means that the function returns with a shorter walk; IGRAPH_RANDOM_WALK_STUCK_ERROR means that an error is reported. In both cases, edgewalk is truncated to contain the actual interrupted walk.

Returns: 

Error code.