Testing Guide¶

MultiGen has a comprehensive test suite with 1353 tests covering all aspects of code generation.

Running Tests¶

# Run all tests
make test

# Or with uv
uv run pytest

# Run specific test file
uv run pytest tests/test_backend_c_basics.py

# Run specific test
uv run pytest tests/test_backend_c_basics.py::TestCBasics::test_basic_function

# Run with verbose output
uv run pytest -v

Test Organization¶

Tests are organized by backend and feature:

tests/
  test_backend_c_*.py           # C backend tests
  test_backend_cpp_*.py         # C++ backend tests
  test_backend_rust_*.py        # Rust backend tests
  test_backend_go_*.py          # Go backend tests
  test_backend_haskell_*.py     # Haskell backend tests
  test_backend_ocaml.py         # OCaml backend tests
  test_backend_llvm_*.py        # LLVM backend tests
  test_pipeline.py              # Pipeline tests
  test_type_inference.py        # Type inference tests
  test_exception_handling.py    # Exception handling tests
  test_context_managers.py      # Context manager tests
  test_generators.py            # Generator/yield tests
  test_check_slice_fspec_finally.py  # Slicing, format specs, finally/else tests
  benchmarks/                   # Benchmark test cases

Benchmark Suite¶

MultiGen includes 7 comprehensive benchmarks:

Algorithms: fibonacci, quicksort, matmul, wordcount

Data Structures: list_ops, dict_ops, set_ops

# Run all benchmarks
make benchmark

# Generate benchmark report
make benchmark-report

Current status: 49/49 passing (100%) across all 7 backends.

Writing Tests¶

Test template:

class TestNewFeature:
    """Test suite for new feature."""

    def setup_method(self) -> None:
        self.converter = MultiGenPythonToCppConverter()

    def test_basic_case(self) -> None:
        """Test basic functionality."""
        code = '''
def example(x: int) -> int:
    return x + 1
'''
        result = self.converter.convert_code(code)
        assert "x + 1" in result

    def test_edge_case(self) -> None:
        """Test edge conditions."""
        # ...

Test Quality Standards¶

ALL tests must pass -- zero tolerance
No flaky tests
Unit tests: <100ms each
Full suite: <25s
Test happy path, edge cases, and error conditions

Debugging Tests¶

# Verbose output
pytest -vv

# Show print statements
pytest -s

# Drop into debugger on failure
pytest --pdb

# Run single test with full output
pytest tests/test_file.py::test_name -vv -s

Troubleshooting¶

Import errors:

export PYTHONPATH=src
pytest

Z3 not available:

pip install z3-solver
# Or skip Z3 tests
pytest -m "not verification"

Next Steps¶

Contributing Guide -- Contributing guidelines
Architecture -- Understanding architecture