I was reading Ned Bachelder’s blog “Why your mock doesn’t work”1 and it triggered an epiphany in me about a testing pattern that I’ve been using for a while without being aware that there might be an aphorism on the practice.
Patch where the object is used; not where it’s defined.
To understand it, consider the example below. Here, you have a module containing a function that fetches data from some fictitious database.
# db.py
from __future__ import annotations
import random
def get_data() -> list[int]:
# ...run some side effects and return data
# from a fictitous database.
return [random.randint(100, 200) for _ in range(4)]
Let’s say another module named service.py
imports the get_data
function and calls that
inside of a function named process_data
:
# service.py
from __future__ import annotations
from db import get_data
def process_data() -> list[int]:
data = get_data()
# ... do some processing.
return data
Now, let’s say we want to write a test for the service.process_data
function. Since the
function depends on db.get_data
, we’ll patch the get_data
function and replace it with a
mock object that returns a canned response. This will make sure that calling process
doesn’t invoke the real get_data
which might have side effects that we don’t want to
trigger during test runs. Also, in this case, instead of returning a list of pseudo-random
integers, the replaced get_data
function will deterministically return a list of known
integers.
You could patch get_data
in multiple ways. Here’s the first attempt:
# test_service.py
from unittest.mock import patch
from service import process
# Patching happens here!
@patch("db.get_data", return_value=[1, 2, 3, 4], autospec=True)
def test_process(mock_get_data):
# Call the target function.
result = process()
# Check the result.
assert result == [1, 2, 3, 4]
# Check that get_data was called.
mock_get_data.assert_called_once()
Since get_data
is defined in the db.py
module, we pass db.get_data
to the patch
decorator. Unfortunately, if you run the above test with pytest2, you’ll see that the
test fails with the following error:
test_service.py F [100%]
========== FAILURES ==========
__________ test_process __________
mock_get_data = <function get_data at 0x7fc8b04d6440>
@patch(
"db.get_data",
return_values=[1, 2, 3, 4], autospec=True
)
def test_process(mock_get_data):
# Call the target function.
result = process()
# Check the result.
> assert result == [1, 2, 3, 4]
E assert [184, 112, 189, 135] == [1, 2, 3, 4]
E At index 0 diff: 184 != 1
E Use -v to get more diff
test_src.py:13: AssertionError
========== short test summary info ==========
FAILED test_src.py::test_process
- assert [184, 112, 189, 135] == [1, 2, 3, 4]
========== 1 failed in 0.14s ==========
The original implementation of get_data
returns a list of 4 pseudo-random integers where
the values lie between 100 and 200 whereas our patched version of get_data
always returns
[1, 2, 3, 4]
. So, the test is failing because the get_data
function didn’t get patched
properly and it’s calling the original get_data
function during the test run.
While the function get_data
is defined in the db.py
module, it’s actally used in the
service.py
module. So, we can avoid this missing target issue by patching get_data
in
the location where it’s used; not where it’s defined. Here’s how to do it:
# test_service.py
# Notice how we're patching 'get_data' in the 'service.py' module.
@patch("service.get_data", return_value=[1, 2, 3, 4], autospec=True)
def test_process(mock_get_data):
# ...rest of the test implementation is the same as before.
This time, when you run the tests, pytest doesn’t complain.
Recent posts
- Hierarchical rate limiting with Redis sorted sets
- Dynamic shell variables
- Link blog in a static site
- Running only a single instance of a process
- Function types and single-method interfaces in Go
- SSH saga
- Injecting Pytest fixtures without cluttering test signatures
- Explicit method overriding with @typing.override
- Quicker startup with module-level __getattr__
- Docker mount revisited