COMPUTER SCIENCE FUNDAMENTALS SERIES

Backtracking
Algorithms

State space trees · N-Queens · Sudoku ·
Constraint satisfaction · Pruning · Branch and bound

Mid-level software engineer track · 20 slides

What Is Backtracking?

The paradigm

Backtracking is a systematic method for exploring all potential solutions by building candidates incrementally and abandoning a candidate as soon as it cannot lead to a valid solution.

Core idea

Build a solution one decision at a time
At each step, check constraints — if violated, prune this branch
If a dead end is reached, undo the last decision and try the next option
If all options exhausted at a level, backtrack further

When to use backtracking

🧩 Constraint satisfaction — Sudoku, crosswords, scheduling

🔀 Combinatorial search — permutations, combinations, subsets

♛ Puzzle solving — N-Queens, Sudoku, mazes

📈 Graph problems — colouring, Hamiltonian paths

🎯 Optimisation — with branch and bound extension

Key insight: backtracking turns exponential brute force into something practical by cutting off invalid branches early. The earlier you prune, the faster the search.

State Space Trees

A state space tree is a rooted tree representing all possible states of a backtracking algorithm. Each node is a partial solution; edges represent decisions.

RootEmpty solution — no decisions made

Internal nodesPartial solutions with some decisions made

Leaf nodesComplete candidates — valid solutions or dead ends

Pruned branchesSubtrees skipped because constraints already violated

Without pruning: tree has b^d leaves (b = branching factor, d = depth). With pruning: many subtrees are never explored.

The Backtracking Template

Generic pseudocode

def backtrack(state, decisions):
    if is_solution(state):
        record_solution(state)
        return

    for choice in get_choices(state, decisions):
        if is_valid(state, choice):     # prune
            apply(state, choice)        # choose
            backtrack(state, decisions)  # recurse
            undo(state, choice)         # unchoose

Pattern recognition: if a problem asks "find all configurations satisfying constraints" or "does a valid arrangement exist", backtracking is the first technique to consider.

The three pillars

is_valid() — Pruning

Rejects partial solutions that cannot lead to a valid complete solution. The earlier this fires, the more work is saved.

apply() / undo() — Choose / Unchoose

State mutation and reversal. The candidate is modified in place and restored after recursion — no copying needed.

is_solution() — Base Case

Recognises when a complete, valid solution has been built. Records or returns the result.

Recursive backtracking uses the call stack to store state. Iterative backtracking uses an explicit stack — identical complexity, avoids stack overflow for very deep searches.

N-Queens Problem

Problem statement

Place N queens on an N x N chessboard so that no two queens attack each other — no shared row, column, or diagonal.

Strategy

Place queens one row at a time (row 0, 1, ..., N-1)
For each row, try each column position
Check constraints: no column conflict, no diagonal conflict
If no valid column exists, backtrack to the previous row

Solution counts

N	Solutions	Unique (symmetry)
4	2	1
8	92	12
12	14,200	1,787
14	365,596	45,752

Constraint checking

def is_safe(board, row, col):
    for prev_row in range(row):
        prev_col = board[prev_row]
        # same column
        if prev_col == col:
            return False
        # same diagonal
        if abs(prev_col - col) == row - prev_row:
            return False
    return True

N-Queens — Pruning in Action

4-Queens search trace

Row 0: col 0 ✓
  Row 1: col 0 ✗ (column)
  Row 1: col 1 ✗ (diagonal)
  Row 1: col 2 ✓
    Row 2: col 0 ✗ (diagonal)
    Row 2: col 1 ✗ (col conflict row 1)
    Row 2: col 2 ✗ (col conflict row 1)
    Row 2: col 3 ✗ (diagonal)
    ← backtrack to row 1
  Row 1: col 3 ✓
    Row 2: col 0 ✗ (diagonal)
    Row 2: col 1 ✓
      Row 3: col 0 ✗  col 1 ✗  col 2 ✗  col 3 ✗
      ← backtrack
    Row 2: col 2 ✗ ...
    ← backtrack to row 0
Row 0: col 1 ✓
  Row 1: col 3 ✓
    Row 2: col 0 ✓
      Row 3: col 2 ✓  ★ SOLUTION [1,3,0,2]

Efficiency

Brute force

Check all N^N arrangements — 4⁴ = 256 for N=4

Backtracking

Typically explores only a small fraction of the tree

Approach	8-Queens nodes
Brute force (8⁸)	16,777,216
Backtracking	~114
Speedup	~147,000x

The power of pruning: each invalid placement at row r eliminates an entire subtree of N^(N-r-1) nodes.

Sudoku Solver

Problem

Fill a 9x9 grid so every row, column, and 3x3 box contains digits 1–9 exactly once.

Backtracking approach

def solve_sudoku(board):
    cell = find_empty(board)
    if cell is None:
        return True           # solved

    row, col = cell
    for num in range(1, 10):
        if is_valid(board, row, col, num):
            board[row][col] = num
            if solve_sudoku(board):
                return True
            board[row][col] = 0   # undo

    return False  # trigger backtracking

Optimisations

Naked singles

If only one number is valid for a cell, place it immediately — no branching needed

Hidden singles

If a number can only go in one cell in a row/col/box, place it

Constraint propagation (AC-3)

Reduce domains before branching — eliminates options proactively

MRV heuristic

Fill the cell with fewest remaining options first — fail faster

Sudoku with constraint propagation solves most newspaper puzzles without any backtracking at all. Hard puzzles (17-clue minimum) still require search.

Subset Sum

Problem

Given a set S = {s₁, s₂, ..., s_n} and a target T, find all subsets of S that sum to T.

Backtracking solution

def subset_sum(nums, target, start, curr, res):
    if target == 0:
        res.append(curr[:])
        return
    if target < 0:
        return                  # prune: overshot

    for i in range(start, len(nums)):
        curr.append(nums[i])
        subset_sum(nums, target - nums[i],
                   i + 1, curr, res)
        curr.pop()              # backtrack

Pruning strategies

Sort first

If nums[i] > remaining target, skip all subsequent elements — they are larger

Skip duplicates

If nums[i] == nums[i-1] at same recursion level, skip to avoid duplicate subsets

Running sum check

Maintain sum of unused elements; if remaining_sum < target, prune — impossible to reach target

Subset sum is NP-complete. Backtracking with pruning is practical for moderate inputs (n < ~40). For larger inputs, dynamic programming or meet-in-the-middle may be preferable.

Permutations

Generate all permutations of [1, 2, ..., n]

def permute(nums, start, result):
    if start == len(nums):
        result.append(nums[:])
        return

    for i in range(start, len(nums)):
        nums[start], nums[i] = nums[i], nums[start]
        permute(nums, start + 1, result)
        nums[start], nums[i] = nums[i], nums[start]

State space

Level 0: n choices for position 0
Level 1: n-1 choices for position 1
Total leaves: n! (all valid — no pruning for plain permutations)

Permutations with constraints

When constraints are added (e.g., "no two adjacent elements differ by more than 2"), pruning becomes effective:

Check the constraint at each level before recursing
Invalid placements prune entire subtrees
Transforms O(n!) into something much smaller in practice

Avoiding duplicates

For inputs with repeated elements (e.g., [1, 1, 2]):

Sort the input first
At each recursion level, skip element i if nums[i] == nums[i-1] and i-1 was not used at this level

Combinations

C(n, k): choose k elements from n

def combine(n, k, start, current, result):
    if len(current) == k:
        result.append(current[:])
        return

    # pruning: need (k - len(current)) more
    # only proceed if enough elements remain
    for i in range(start, n-(k-len(current))+2):
        current.append(i)
        combine(n, k, i + 1, current, result)
        current.pop()

Pruning the search space: at each level, if not enough remaining elements to fill the combination, stop immediately.

Combinations vs permutations

Property	Permutations	Combinations
Order matters	Yes	No
Count	n! / (n-k)!	n! / (k!(n-k)!)
Start index	Fixed (swap)	Advances
Typical pruning	Constraint	Size + constraint

Combinations are a strict subset of the permutation search space. Using the start index avoids generating [1,2] and [2,1] separately.

Without pruning: explore all 2ⁿ subsets, filter by size k. With pruning: only generate C(n,k) valid combinations directly.

Graph Colouring

Problem

Assign colours to vertices of an undirected graph such that no two adjacent vertices share the same colour, using at most k colours.

def graph_colour(graph, k, colours, vertex):
    if vertex == len(graph):
        return True             # all coloured

    for c in range(1, k + 1):
        if is_safe(graph, colours, vertex, c):
            colours[vertex] = c
            if graph_colour(graph, k, colours,
                            vertex + 1):
                return True
            colours[vertex] = 0  # backtrack

    return False

Applications

Map colouring

Four colour theorem guarantees k=4 suffices for planar graphs

Register allocation

Compilers assign variables to CPU registers (graph = interference graph)

Scheduling

Exams, tasks, or frequencies with conflict constraints

Timetabling

Courses sharing students cannot be in the same slot

The minimum k for which a valid colouring exists is the chromatic number X(G). Finding X(G) is NP-hard.

Hamiltonian Path & Cycle

Definitions

Hamiltonian pathVisits every vertex exactly once

Hamiltonian cycleA Hamiltonian path that returns to the starting vertex

def hamiltonian(graph, path, visited):
    if len(path) == len(graph):
        if graph[path[-1]][path[0]]:
            return True   # cycle found
        return False

    for v in range(len(graph)):
        if not visited[v] and graph[path[-1]][v]:
            visited[v] = True
            path.append(v)
            if hamiltonian(graph, path, visited):
                return True
            path.pop()
            visited[v] = False   # backtrack

    return False

Pruning strategies

Degree check

If an unvisited vertex has no unvisited neighbours, prune immediately

Connectivity check

If removing the current vertex disconnects the unvisited subgraph, prune

Warnsdorff's heuristic

Choose the vertex with fewest unvisited neighbours next — reduces branching factor

Hamiltonian cycle is NP-complete. No polynomial algorithm is known. Backtracking is the standard exact solver, but impractical for graphs with hundreds of vertices.

Constraint Satisfaction Problems

Formal definition

A CSP is defined by three components:

Variables

X₁, X₂, ..., X_n — the unknowns to assign

Domains

D₁, D₂, ..., D_n — possible values for each variable

Constraints

Restrictions on which combinations of values are allowed

Why CSP framing matters: provides a uniform representation — one algorithm solves many problems. Separates problem modelling from search strategy.

Examples as CSPs

Problem	Variables	Domains
N-Queens	Queen per row	{1..N}
Sudoku	Each cell	{1..9}
Graph colour	Vertex colours	{1..k}
Map colour	Region colours	{R,G,B,Y}
Scheduling	Task slots	Available slots

Constraints: N-Queens — no shared col/diag. Sudoku — row/col/box uniqueness. Graph colouring — adjacent vertices differ. Scheduling — no resource conflicts.

CSP Solving Strategies

Variable ordering

MRV — Minimum Remaining Values: choose the variable with the smallest domain; "fail-first" principle
Degree heuristic — choose the variable involved in the most constraints on unassigned variables

Value ordering

LCV — Least Constraining Value: choose the value that rules out the fewest options for neighbours
Maximises remaining flexibility in the search

Inference

Forward checking — after assignment, remove inconsistent values from neighbours
AC-3 — enforce arc consistency across all constraint arcs
MAC — run AC-3 after every assignment

Impact on search performance

Strategy	Effect
Plain backtracking	Explores many dead-end branches
+ MRV	Detects failures earlier — fail-first
+ Forward checking	Prunes domains proactively after each assignment
+ MAC (AC-3)	Near-optimal pruning; solves most CSPs efficiently

Pruning Strategies

Without pruning, backtracking degenerates to brute force. Effective pruning is the difference between practical and intractable.

Types of pruning

Feasibility pruning

Reject partial solutions that already violate a constraint

Bound pruning

In optimisation, reject branches that cannot improve the best known solution

Symmetry breaking

Avoid exploring configurations that are rotations/reflections of already-explored solutions

Dominance pruning

Skip a partial solution if another is provably at least as good

Constraint propagation

Proactively reduce variable domains rather than waiting for conflicts

Implementation guidelines

Check constraints as early as possible — after each decision, not only at leaf nodes
Maintain incremental data structures — e.g., boolean arrays for used columns/diagonals in N-Queens
Sort choices — try the most constrained or promising option first to find solutions (or contradictions) sooner
Memoisation — cache results of subproblems when the state space has overlapping structure

Rule of thumb: the cost of the pruning check must be less than the cost of exploring the pruned subtree. Cheap checks that eliminate large subtrees are the sweet spot.

Optimisation Techniques

Bit manipulation

For problems with small state spaces (N-Queens with N ≤ 30), represent sets as bitmasks for O(1) constraint checks.

def solve(row, cols, diag1, diag2, n):
    if row == n:
        return 1
    count = 0
    available = ((1 << n) - 1) \
              & ~(cols | diag1 | diag2)
    while available:
        bit = available & (-available)
        count += solve(row + 1,
                       cols  | bit,
                       (diag1 | bit) << 1,
                       (diag2 | bit) >> 1, n)
        available ^= bit
    return count

Other techniques

Iterative deepening

Combine DFS with depth limits. Useful when solution depth is unknown and memory is constrained.

Randomised restarts

For hard CSPs, restart from a random initial state if the search stalls. Avoids unproductive subtrees.

Parallelism

Distribute independent subtrees across threads. Work stealing for load balancing. Near-linear speedup for many-branch problems.

Branch and Bound

Extending backtracking for optimisation

Branch and bound adds a bounding function that estimates the best possible solution achievable from a partial solution.

How it works

Branch — split the problem into subproblems (like backtracking)
Bound — compute an optimistic estimate for each subproblem
Prune — if the bound is worse than the best known solution, discard

Example: 0/1 Knapsack

Branch — for each item, include or exclude
Bound — use fractional knapsack (greedy) as upper bound
Prune — if upper bound ≤ current best profit, skip subtree

Backtracking vs branch and bound

Aspect	Backtracking	Branch & Bound
Goal	Feasible solutions	Optimal solution
Pruning	Constraint violations	Bound vs best-so-far
Bounding fn	Not required	Essential
Problems	CSPs, enumeration	TSP, knapsack, scheduling

Branch and bound is the standard approach for integer linear programming (ILP) solvers like CPLEX and Gurobi. The quality of the bounding function determines solver performance.

Backtracking vs Brute Force

Brute force

Generate all possible candidates, then check each for validity. Always explores the full search space. Time: O(b^d).

Backtracking

Generate candidates incrementally, pruning invalid branches early. Explores only a fraction of the space. Worst case same; best case dramatically faster.

When backtracking does not help

All solutions are valid (plain enumeration) — no pruning opportunities
Constraints only checkable at leaf nodes — no early termination
Highly connected constraint graphs — pruning eliminates few branches

Empirical comparison

Problem	Brute Force	Backtracking	Speedup
8-Queens	16,777,216	~114 nodes	~147,000x
Sudoku (easy)	9⁵¹	~100 nodes	astronomical
Graph colour (sparse)	kⁿ	~O(k*n)	exponential

Backtracking is not a different complexity class — it is a constant-factor (often enormous) improvement within the same exponential class. For polynomial-time solutions, you need a fundamentally different algorithm.

Classic Problem Complexities

Time complexities

Problem	Worst Case	Typical with Pruning	Notes
N-Queens	O(N!)	Much less in practice	One queen per row eliminates N^N
Sudoku	O(9⁸¹)	Near-instant for most	Constraint propagation dominates
Subset Sum	O(2ⁿ)	Depends on target	DP may be better for small targets
Permutations	O(n!)	O(n!) unconstrained	No pruning for plain case
Combinations C(n,k)	O(C(n,k))	O(C(n,k))	Size pruning only
Graph Colouring	O(kⁿ)	Problem-dependent	Sparse graphs prune well
Hamiltonian Cycle	O(n!)	Problem-dependent	NP-complete
TSP (B&B)	O(n!)	Manageable for n<25	Bounding fn quality is key

Space complexity

Recursive backtracking: O(d) stack space where d = max recursion depth. Storing all solutions: O(S * d) where S = number of solutions.

Practical takeaway

Backtracking complexity is problem-instance dependent. Analyse the constraint structure, not just the worst case, to predict real-world performance.

Summary & Further Reading

Key takeaways

Backtracking is brute force with pruning — build candidates incrementally, abandon invalid branches early
The state space tree is the mental model: every node is a decision, pruning removes subtrees
The generic template — choose, check, recurse, unchoose — applies to N-Queens, Sudoku, graph colouring, and hundreds of other problems
CSPs unify many backtracking applications under one formal framework
Variable/value ordering (MRV, LCV) and constraint propagation (AC-3, MAC) dramatically reduce search effort
Pruning quality is the single biggest performance lever
Branch and bound extends backtracking to optimisation with bounding functions
Backtracking does not change the complexity class — but the constant factor is often the difference between seconds and centuries

Source	Description
Cormen et al.	Introduction to Algorithms (CLRS) — backtracking and branch-and-bound
Skiena	The Algorithm Design Manual — practical strategies and war stories
Russell & Norvig	AI: A Modern Approach — CSP chapter with AC-3, MRV, MAC
Knuth	TAOCP Vol. 4 — exhaustive enumeration and Dancing Links
LeetCode	Backtracking tag — curated practice problems
Sedgewick	Algorithms — permutations, combinations, constraint search

BacktrackingAlgorithms

What Is Backtracking?

The paradigm

Core idea

When to use backtracking

State Space Trees

The Backtracking Template

Generic pseudocode

The three pillars

is_valid() — Pruning

apply() / undo() — Choose / Unchoose

is_solution() — Base Case

N-Queens Problem

Problem statement

Strategy

Solution counts

Constraint checking

N-Queens — Pruning in Action

4-Queens search trace

Efficiency

Brute force

Backtracking

Sudoku Solver

Problem

Backtracking approach

Optimisations

Naked singles

Hidden singles

Constraint propagation (AC-3)

MRV heuristic

Subset Sum

Problem

Backtracking solution

Pruning strategies

Sort first

Skip duplicates

Running sum check

Permutations

Generate all permutations of [1, 2, ..., n]

State space

Permutations with constraints

Avoiding duplicates

Combinations

C(n, k): choose k elements from n

Combinations vs permutations

Graph Colouring

Problem

Applications

Map colouring

Register allocation

Scheduling

Timetabling

Hamiltonian Path & Cycle

Definitions

Pruning strategies

Degree check

Connectivity check

Warnsdorff's heuristic

Constraint Satisfaction Problems

Formal definition

Variables

Domains

Constraints

Examples as CSPs

CSP Solving Strategies

Variable ordering

Value ordering

Inference

Impact on search performance

Pruning Strategies

Types of pruning

Feasibility pruning

Bound pruning

Symmetry breaking

Dominance pruning

Constraint propagation

Implementation guidelines

Optimisation Techniques

Bit manipulation

Other techniques

Backtracking
Algorithms