Detect Cycle in Directed Graph
Given a directed graph represented as an adjacency list, the task is to determine if the graph contains any cycles.
graph LR
1 --> 2
2 --> 3
3 --> 4
4 --> 2
Depth-First Search (DFS) Approach
In an undirected graph, we use a visited[]
array to track visited nodes and detect if a node is revisited while doing depth first search thus indicating a cycle. However, this approach does not work for directed graphs because nodes can be visited multiple times from different paths and yet count of multiple visit to a same node won't indicate the presence of a cycle.
For example, consider the following graph:
graph LR
1 --> 2
3 --> 2
Here, node 2 can be visited twice and , however a simple DFS keeping the track of visited nodes cannot indicate the presence of a cycle.
Observing Sub-Graphs
A directed graph can be seen as a collection of sub-graphs. In the graph above, there are two sub-graphs: 1 → 2
and 3 → 2
. Running a cycle detection algorithm independently on each sub-graph will detect cycles for that particular sub-graph. Once the algorithm is complete for one sub-graph, we can unmark all the nodes which were visisted in one sub-graph and prepare them for a revisit again from another sub-graph.
While this approach works, it is inefficient as it repeatedly runs DFS on nodes where the checks were already done previously.
Optimized DFS Algorithm
We can optimize the above approach by:
- Initializing the
visited[]
array once. - Introducing a
processed[]
array to track sub-graphs already checked for cycles.
Here’s the code:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 |
|
Breadth-First Search (BFS) Approach
To understand the BFS-based approach, it’s essential to first understand topological sorting of directed graphs.
Consider the graph below:
graph LR
1 --> 2
2 --> 3
3 --> 4
1 --> 4
This graph can be viewed as a dependency graph, where if you want to processing node 1
then it can be seen that it requires prior processing of nodes 2
and 4
first. Now, if we go on to process node 2
the we can also see that it then require processing of node 3
which then again require processing of node 4
. Hence in order to process node 1
we have to go in processing order as follows: . This order is know is topological order of the graph.
Topological Sorting Algorithm
Topological sorting processes nodes in decreasing order of their in-degrees. Below is the algorithm to print the topological order of a directed graph.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 |
|
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 |
|
Khan's Algorithm for Cycle Detection
There is an interesting observation that can be made. Let's suppose the graph looks like this:
graph LR
1 --> 2
2 --> 3
3 --> 1
1 --> 4
We can see that a cycle exist in the graph and also conclude that no matter how we process this graph, topological sort can never exist!
Khan's algorithm is a modification of the topological sorting algorithm. By counting the nodes added to the queue, we can determine if the graph contains a cycle. If the count of processed nodes equals the total number of nodes in the graph, it implies the absence of cycles.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 |
|