This is an in progress blog post about my sudoku solver: nsudoku-solve

N-Sudoku Solver

What is a Sudoku Puzzle?

Sudoku is a fun puzzle, if you somehow are reading this without knwing how it works you can check the Wikipedia Page about it, or see my short explanation on it below, otherwise you can skip until the next section.

Explanation

A Sudoku puzzle is typically composed of square cells forming a bigger square, the normal puzzle is a 9×9 square composed of 3×3 cells, but it can have any cell size, starting at 1×1, and going up to N×N.

The 1×1 Sudoku is trivial, it looks like this when solved:

1×1 Sudoku
+---+
| 1 |
+---+

A more interesting, but still simple example is a solved 2×2 Sudoku:

2×2 Sudoku
+-----+-----+
| 1 2 | 3 4 |
| 3 4 | 1 2 |
+-----+-----+
| 2 1 | 4 3 |
| 4 3 | 2 1 |
+-----+-----+

The important details are that every row, column and cell must have the numbers 1 up to 4 only once:

2×2 Row
| 1 2 | 3 4 |

2×2 Column
-
1
3
-
2
4
-

2×2 Cell
+-----+
| 1 2 |
| 3 4 |
+-----+

This remains valid for all Sudokus; a 9×9 Sudoku has numbers 1 through 9 in its cells, a 100×100 Sudoku has numbers 1 through 100 in its cells.

Keeping this invariant makes it possible to solve Sudokus; for example, solving a 2×2 cell can be done like this:

2×2 Cell
+-----+
| 1 . |
| 3 4 |
+-----+

We can see there is a 2 missing, thus we can fill the empty spot:

2×2 Cell
+-----+
| 1 2 |
| 3 4 |
+-----+

That is the basics, we will go through more advanced techniques in the implementation section.

Implementation

Representing a Sudoku in Rust

In order to solve the Sudokus, we first need to have an in memory representation of the puzzles, that is what we will be going over in this section, if you are familiar with Rust and can imagine how to represent a Sudoku then you can probably skip over this section.

Rust Arrays

A simple way to represent a 2×2 Sudoku would be using Arrays inside Arrays, commonly called 2D Arrays, which are part of the N-Dimensional Array family.

Representing our example 2×2 Sudoku using 2D Arrays would look like this:

// 2×2 Sudoku
// +-----+-----+
// | 1 2 | 3 4 |
// | 3 4 | 1 2 |
// +-----+-----+
// | 2 1 | 4 3 |
// | 4 3 | 2 1 |
// +-----+-----+
let sudoku = [
        [1, 2, 3, 4],
        [3, 4, 1, 2],
        [2, 1, 4, 3],
        [4, 3, 2, 1],
    ];

Contrary to your intuition if you come from a garbage collected language like Java, C# or Python, this is not incredibly inefficient; 2D arrays in most languages are very inefficient as you end up having an array of pointers/references, an array of references is not ideal as you have to read two references in order to access one element (the reference to the array, and the second reference because its an array of references).

Here it is not a problem as arrays are stack allocated in Rust, therefore this is a contiguous chuk of memory, manipulating it will make your CPU very happy. It does have some pretty big drawbacks though:

Stack space is limited.

We plan to support 225×225 Sudokus, which are pretty big (our future implementation would use 50,625B just to store the values).

The stack in Linux has a 8MiB size by default which would fill after ~165 Sudokus, we could easily have more Sudokus in memory than that.
You need to know the size of stack elements beforehand.

In Rust you can't have dynamically sized types on the stack, therefore you must have a generic Sudoku struct in order to support this, sadly you need const generics in order to have an ergonomic implementation, this is currently (2023-03-21) an unstable Rust feature, so we will not be using it.

It also requires us to use a different type for each Sudoku size, which would be a pain to implement a CLI around.

For the above reasons we will instead be using the heap to store the Sudoku values, this is not as limiting as the Stack, but tends to be slower.

I could implement a relatively generic 2D Array on the heap myself, but we want to imlpement a Sudoku solver, not an efficient 2D Array, so we will use an off the shelf solution: ndarray, if you are familiar with numpy, then ndarray is just the efficient arrays, most other linear algebra/statistics operation are build on top of ndarray and published as separate crates (Rust packages).

Sudoku Values

Our values are dependent on the size of the Sudoku: for a N×N Sudoku, it allows values between 1 and N inclusive (in Rust syntax: 1..=N), as it is never 0, we can try using one of the NonZero* Rust integer types, the smallest one, NonZeroU8, fits our needs:

A NonZeroU8 represents values between 1..=255, this means that the biggest Sudoku we can represent is a 15×15 Sudoku, I will accept this as big enough.
Bigger Sudokus use O(N²) more memory, so it easily becomes impractical to run most algorithms in my Laptop's 8GiBs of RAM.
NonZero* has the added benefit of making some Rust enums smaller; specifically Option<NonZero*> is guaranteed to have the same size as NonZero*.

Using the newtype idiom we can create a SudokuValue type:

pub struct SudokuValue(NonZeroU8);

This prevents our type from being treated as a NonZeroU8, it makes no sense to support most operations on integers on SudokuValues so we might as well stop ourselves from using them at the type level.

We can bring back some operations that we do want though:

#[derive(Debug, Clone, Copy, PartialEq, Eq, Hash)]
pub struct SudokuValue(NonZeroU8);

If you are not familiar with Rust, these are derive macros, in simple terms, they automatically implement some functions on our types:

Debug: Allows debug formatting of our types: println!("{my_debug_type:?}").
Clone: Allows our types to be cloned (create an in memory copy of the type).
Copy: Signals that clone is cheap (no additional logic/small type), and that the values of this type should be copied, not moved.
PartialEq/Eq: Allows our type to be compared to itself (my_type == my_type_too).
Hash: Allows our type to be hashed/used in a hashed collection (HashSets/HashMaps).

Full Sudoku Puzzle

With this background we can now represent the full Sudoku Puzzle:

#[derive(Debug, Clone, PartialEq, Eq)]
pub struct Sudoku {
    grid_w: usize,
    values: Array2<Option<SudokuValue>>,
}

We keep the grid width inside the structure, as a Sudoku has three important values:

The grid width

Used to compute the size of the grids (a 9×9 Sudoku has a 3×3 grid of 3×3 cell chunks)
The order of the Sudoku, ie. 9 for a 9×9 Sudoku

Used to compute the valid values (ie. 1..=9 for a 9×9 Sudoku).
The number of cells in the Sudoku (ie. 81 for a 9×9).

Used to iterate over all the cells/create the Sudoku itself.

If you stare at these values for a bit, you can see their relationship: the order is just the grid width squared, and the number of cells, is just the order squared, therefore:

order = grid_w²
num_cells = order² = grid_w⁴

An interesting fact about computers is that calculating the square of a number is much faster than calculating the square root, especially for integers, Rust also doesn't have a way to express the integer square root of a number so you'd have to cast the number to a floating point value calculate the square root, and then cast it back to an integer, this is prone to errors for big integers and is awkward to write:

let grid_w = (num_cells as f64).sqrt().sqrt() as usize;

Therefore, storing the grid_w makes it easy to derive the order and number of cells, without having to resort to casting values to floats.

Lets also implement some nice methods on our Sudoku struct:

// Helpful methods
impl Sudoku {
    pub fn empty(grid_w: usize) -> Self { todo!() }
    pub fn try_empty(grid_w: usize) -> Result<Self> { todo!() }
    pub unsafe fn empty_unchecked(grid_w: usize) -> Self { todo!() }

    pub fn new(grid_w: usize, values: Vec<Option<SudokuValue>>) -> Self { todo!() }
    pub fn try_new(grid_w: usize, values: Vec<Option<SudokuValue>>) -> Result<Self> { todo!() }
    pub unsafe fn new_unchecked(grid_w: usize, values: Vec<Option<SudokuValue>>) -> Self { todo!() }
}

Implementing empty_unchecked should be the easiest one to check off:

/// Create an empty Sudoku with grid width `grid_w`
///
/// # Safety
///
/// The grid width (`grid_w`) must be valid (`Self::valid_grid_width`)
pub unsafe fn empty_unchecked(grid_w: usize) -> Self {
    // debug: Make sure invariants are met
    debug_assert!(Self::valid_grid_width(grid_w), "Invalid grid width");
    Self {
        grid_w,
        values: Array2::default((grid_w * grid_w, grid_w * grid_w)),
    }
}

We assume that grid_w is valid and create a 2D Array of size grid_w²×grid_w², we mark this function as unsafe because the rest of the code assumes that grid_w is valid, so we might do something wrong if we don't check it.

Sometimes we can be sure that grid_w is right (ie. if we already have a Sudoku and are using its grid_w to create a new empty Sudoku). This way we can skip checking invariants which are upheld elsewhere. Checking the invariant is simple though so we do still check it in debug builds to help with debugging if you do end up using this function wrong.

try_empty is the next one we will implement:

/// Create an empty Sudoku with grid width `grid_w`
///
/// Returns `Err` if the `grid_w` is invalid (`Self::valid_grid_width`)
pub fn try_empty(grid_w: usize) -> Result<Self> {
    Self::valid_grid_width(grid_w)
        .then(|| {
            // Safety: we check that the `grid_w` is valid
            unsafe { Self::empty_unchecked(grid_w) }
        })
        .ok_or(SudokuError::InvalidGridWidth { grid_w })
}

We first check the grid width, if it's valid we can call new_unchecked safely, otherwise we return an error.

This makes the implementation of empty dead simple:

/// Create an empty Sudoku with grid width `grid_w`
///
/// **Panics** if `grid_w` is not valid (`Self::valid_grid_width`)
pub fn empty(grid_w: usize) -> Self {
    Self::try_empty(grid_w).unwrap()
}

We just unwrap the value in try_empty, this exists the program with an error message if there is an error.

Having such a flexible API is overkill for this program, but as it is just a learning process, doing it right is also part of the process.

Now that we have an idea of how to do this for empty, I will only show you how to do try_new, and you can probably implement the panicking and the unsafe version yourself:

/// Create a Sudoku with grid width `grid_w` with the provided values
///
/// **Fails**
///
/// - `grid_w` is invalid
/// - `values.len()` is not `grid_w⁴`
/// - The values are not valid for the size of the Sudoku
pub fn try_new(grid_w: usize, values: Vec<Option<SudokuValue>>) -> Result<Self> {
    Self::validate_grid_width(grid_w)?;
    // Make sure all values are in the correct range and comply with the Sudoku invariants
    Self::validate_values(grid_w, &values)?;
    // Safety: we check invariants beforehand
    Ok(unsafe { Self::new_unchecked(grid_w, values) })
}

This function is very boring as all the interesting stuff happens inside Self::validate_values:

/// Checks if there are enough values and all values are valid for the specified Sudoku size
pub fn validate_values(grid_w: usize, values: &[Option<SudokuValue>]) -> Result<()> {
    // Correct number of values
    let expected = grid_w * grid_w * grid_w * grid_w;
    if values.len() != expected {
        return Err(SudokuError::InvalidValuesAmount {
            len: values.len(),
            expected,
        });
    }
    // Values are Valid
    if let Some(value) = values
        .iter()
        .copied()
        .flatten()
        .find(|&value| !Self::valid_value(grid_w, value))
        .map(|value| value.0.get())
    {
        return Err(SudokuError::InvalidValue {
            value,
            max: grid_w * grid_w,
        });
    }
    let mut vals = Vec::with_capacity(grid_w * grid_w);
    let xs = ArrayView2::from_shape((grid_w * grid_w, grid_w * grid_w), values).unwrap();
    // Verify Rows
    Self::validate_rows_scratch(xs, &mut vals)?;
    // Verify Columns
    Self::validate_columns_scratch(xs, &mut vals)?;
    // Verify cells
    Self::validate_cells_scratch(grid_w, xs, &mut vals)
}

The code is prtty linear:

We assert that the number of values is grid_w⁴.
We assert that the values are in the valid range (1..=grid_w²).
We assert that there are no duplicate values in the rows, columns or cells.

Using the ? operator we forward the errors returned by the Self::validate_* functions, these unctions also take a buffer which we initialize appropriately to make sure it is only allocated once, this prevents many small allocations.

All three validate_* functions are very similar so we will only look at the Self::validate_rows_scratch function:

/// Validate the Sudoku invariants on the rows
///
/// More Efficient as it doesn't need an extra allocation if you already have a buffer
fn validate_rows_scratch(
    values: ArrayView2<Option<SudokuValue>>,
    vals: &mut Vec<Option<SudokuValue>>,
) -> Result<()> {
    if let Some((y, x)) = Self::invalid_sudoku_axis(values.rows(), vals) {
        return Err(SudokuError::WrongValueSet { pos: (x, y) });
    }
    Ok(())
}

/// Check if a Sudoku axis has an invalid value
///
/// Returns the index of the axis, and the index of the offending element in a tuple
///
/// An axis could be a row, column or cell
///
/// Passing an appropriately sized (grid_w²) vector as scratch, makes this function not allocate
/// any extra space
fn invalid_sudoku_axis<'a, T, I>(
    axis: impl IntoIterator<Item = I>,
    scratch: &'a mut Vec<T>,
) -> Option<(usize, usize)>
where
    I: IntoIterator<Item = &'a T>,
    T: PartialEq + Copy,
{
    for (i, a) in axis.into_iter().enumerate() {
        scratch.clear();
        scratch.extend(a.into_iter().copied());
        if let Some(j) = Self::duplicate_value_position(scratch) {
            return Some((i, j));
        }
    }
    None
}

/// If there is a duplicate value, return its index
fn duplicate_value_position<T: PartialEq>(vals: &[T]) -> Option<usize> {
    vals.iter()
        .enumerate()
        .position(|(i, val)| vals[i + 1..].contains(val))
}

Self::validate_rows_scratch is a soft wrapper around Self::invalid_sudoku_axis that returns a nice error message, invalid_sudoku_axis is generic over the axis, whether it is the rows, columns or cells, it will still find duplicate values in them, to do this it takes a single row/column/cell and collects it into a vector, then it goes over the vector searching for the position of a duplicate value, returning the index of the axis and the duplicate value.

With this we can finally represent a Sudoku in memory.

You can look at the full code up to this point in this repository (it has extra tests and helper functions which were not discussed, ie. a function to pretty print Sudokus).

The code we have discussed lives in libnsudoku-solver/src/lib.rs.

I will continue expanding this post in the near future once I add the CLI and solver code, but I will leave it here for now.