API endpoint

To begin with we are going to need an HTTP API endpoint that our users can send a request to. This endpoint should return the calculated income tax that they owe based on the information they provided in their request to the endpoint.

For now we're going to keep things nice and simple and assume that there are no other sources of income to be considered. So this will be a pretty simple calculation of, given a salary figure how much of that figure will be owed in income tax.

Note that this will be for UK income tax calculations. If you are following along from a different jurisdiction you can just adjust the calculation to suit your needs.

Install the test runner

So before we go any further, we are going to need to install the test runner. If you cast your mind back to the environment setup section, we have done this before.

So lets install the pytest library:

pip install pytest

And then update the dependencies again:

pip freeze > requirements.txt

Now we're ready to start writing our first test.

Note that we've actually missed another dependency that we're gonna need. But we're gonna let our first test guide us and tell us exactly what we need.

Writing our first test

We currently have source code to test. A blank slate if you will. So an integration test is gonna give us the most bang for our buck. Given that we're looking to write an endpoint for our API, it makes sense to write a test against this endpoint.

So lets create a new directory to hold all of our tests:

mkdir -p tests/integration

And then lets create our test file in our new integration folder:

touch tests/integration/test_main.py

In our new test file we can begin to write our very first test for our application.

from http import HTTPStatus

from starlette.testclient import TestClient

from main import app


class TestIncomeTaxesEndpoint:
    def test_calculates_correct_tax(self):
        """
        Given a salary of £33,000
        When a GET request is made to the `income-taxes/` endpoint
        Then the calculated tax owed is £4,086
        """
        # Given
        test_client = TestClient(app=app)
        salary = 33_000

        # When
        response = test_client.get("/income-taxes", params={"salary": salary})

        # Then
        assert response.status_code == HTTPStatus.OK
        assert response.json() == {"tax owed": 4_086}

We probably shouldn't be relying on raw integers or floating point numbers but we can put this to one side for now.

On line 3, we import the TestClient class from the starlette library which was installed within our virtual environment when installed fastapi. Most modern frameworks come bundled with a client that we can use to interact with our own API endpoints under test conditions. For this particular client, the TestCient class takes an argument of the FastAPI application instance itself, which we have not yet defined.

On line 20, we are saying that we want to use our TestClient to make a GET request to the income-taxes/ endpoint. Which has also not been defined yet. We are also saying that we are including the "salary" query parameter in the request. So the URL path of this request would look like the following:

/income-taxes?salary=33000

On line number 23, we verify that we received an HTTP 200 OK response. Note that the enum HTTPStatus.OK is equivalent in this case to 200. The use of the enum is purely for readability purposes.

And finally on line 24, we check the output calculation provided by our endpoint.

In writing the test first, we focus on the design of the endpoint. The interface to our code and how we want to interact with it. This frees us from thinking about the technical details until later.

If we run this test as is, we'd be expecting to see our old friend the ImportError since the app has not been defined yet. But that is not the case here:


test_main.py:None (test_main.py)
../../.venv/lib/python3.12/site-packages/starlette/testclient.py:33: in <module>
    import httpx
E   ModuleNotFoundError: No module named 'httpx'

During handling of the above exception, another exception occurred:
test_main.py:3: in <module>
    from starlette.testclient import TestClient
../../.venv/lib/python3.12/site-packages/starlette/testclient.py:35: in <module>
    raise RuntimeError(
E   RuntimeError: The starlette.testclient module requires the httpx package to be installed.
E   You can install this with:
E       $ pip install httpx

On line number 12 of the stacktrace we can see the error message that tells us exactly what the problem is. The TestClient class depends on the httpx package which we don't yet have.

In fact, we also missed this in the documentation.

So lets run the command as dictated to us by the stracktrace to get the httpx package:

pip install httpx

Once this is installed, we should freeze our pip dependencies again and commit them to version control.

Getting to green as quick as possible

To begin with we should create the file for our source code. For now, we won't too much about project structure. That will come later.

touch main.py

In this file we can set up the app instance which is consumed by the TestClient as well as the endpoint that we have already written the test against.

So our goal here is to get to the green state i.e. where the test is passing as quick as possible. We're not actually too worried about implementation right now. That will also come later. For now we just want the minimal and least path of resistance back to the safe ground of a passing test.

So in our new main.py:

from fastapi import FastAPI

app = FastAPI()


@app.get(path="/income-taxes")
def calculate_income_taxes(salary: float) -> dict[str, int]:
    return {"tax owed": 4_086}

On line 1 we import the FastAPI class and instantiate it on line number 3. This is the app instance which encompasses our server and is the same one that is also used by the TestClient.

FastAPI has its own specifc way of adding endpoints. On line 6 we use the app instance in a decorator with the get method. This indicates that this function represents the logic to be used for this GET endpoint. The path argument as you might have guessed represents the new endpoint path name.

FastAPI is a little different to other frameworks whereby it uses our function definition as part of the OpenAPI documentation and as part of the endpoint itself. So the arguments to the function form the query parameters to the endpoint.

And finally, on line 7 we do something pretty unusual. We simply always return the answer which would satisfy our test. Of course, if we used any input other than the one declared by the test this endpoint wouldn't be fit for purpose. But remember, we just want to see the test pass first. We'll come back to rethink this implementation once we're on the safe ground of passing tests.

If we now run this test as is, we will see it passes!

Subject matter expertise

Now that we're back on safe ground, we can think about how we want to actually implement our calculation.

This is going to require some subject matter expertise. As of March 2024, the income tax bands in the UK look like the following:

Band

Taxable income

Tax rate

Personal Allowance

Up to £12,570

Basic rate

£12,571 to £50,270

20%

Higher rate

£50,271 to £125,140

40%

Additional rate

over £125,140

45%

There are some gotchyas to be aware of when it comes to dealing with personal allowance at the higher income rates, but we'll deal with that later.

For now we can take the information given to us and translate that into some business logic that we can apply.

Refactoring

Armed with our subject matter knowledge we can go back and write something which will calculate the income tax for us for our given input salary of £33,000 instead of the somewhat useless hardcoded value we are currently returning:

from fastapi import FastAPI

app = FastAPI()


@app.get(path="/income-taxes")
def calculate_income_taxes(salary: float) -> dict[str, float]:
    tax_free_allowance = 12_570
    tax_owed = (salary - tax_free_allowance) * 0.2

    return {"tax owed": tax_owed}

This is of course still a little naive but it does provide us with basis to work from.

If we run our test now we will see it passes again.

Brownie points if you can spot the bug we've introduced above!

Seperating responsibilities

Before we start writing more test cases and developing our endpoint further, we should stop and take stock of the hole we've just dug ourselves into.

Our API endpoint is doing too much by itself. Right now we cannot test the business logic of the calculation by itself. To test it we are forced to hit the endpoint. Ideally the API layer of our system should only be handling serialization and validation. It should take some given input, validate it and then pass that input over to some other part of our system for the actual calculation. This way we would be free to make changes to the calculation without having to change things at the API layer.

from fastapi import FastAPI

app = FastAPI()


@app.get(path="/income-taxes")
def calculate_income_taxes(salary: float) -> dict[str, float]:
    tax_owed: float = calculate_income_tax_owed(salary=salary)
    return {"tax owed": tax_owed}


def calculate_income_tax_owed(salary: float) -> float:
    tax_free_allowance = 12_570
    return (salary - tax_free_allowance) * 0.2

See the difference now?

The endpoint code doesn't need to know about the details of how to do the calculation, that is the responsibility of the new function we've just defined on lines 12-14.

This means we can also write tests directly against this new function and not have to pass through the API layer every time. You can imagine if we had some form of authentication in front of the API, then this would be even more sluggish than it already is right now.

In a bigger project we would probably move the calculate_income_tax_owed() function into a seperate file in another location too. Perhaps in a dedicated part of the project where domain-level logic lived. Our project is a little too small to warrant this for now.

Layering in more functionality

So we know that our current calculation approach will handle the basic rate banding for us. We also know that we've completely neglected the next few bandings. And we've left a bug in place! But lets take this 1 step at a time. We should now write some tests for the higher rate banding. This is for salaries between £50k and £125k. This time we are going to introduce the idea of parametrization to our tests.

Writing a parametrized test

We can provide parameters in the form of expected inputs as well as outputs. Other languages also have this capability for example Golang calls it table-driven tests. Parametrization allows us to send any combination of values into our test:

from http import HTTPStatus

import pytest
from starlette.testclient import TestClient

from main import app, calculate_income_tax_owed

...

class TestCalculateIncomeTaxOwed:
    @pytest.mark.parametrize(
        "salary, expected_tax_owed",
        (
            [60_000, 11_432],
            [65_000, 13_432],
        ),
    )
    def test_calculates_for_higher_rate_banding(
        self, salary: int, expected_tax_owed: float
    ):
        """
        Given a salary which is in the higher rate banding
        When `calculate_income_tax_owed()` is called
        Then the correct calculated tax owed is returned
        """
        # Given
        input_salary = salary

        # When
        calculated_tax_owed: float = calculate_income_tax_owed(salary=input_salary)

        # Then
        assert calculated_tax_owed == expected_tax_owed

In line 2 we use the pytest.mark.parametrize decorator from the pytest library. The first positional argument on line 3 is the argnames this represents the name of the arguments which will be passed into the test. And the 2nd positional argument provided to the decorator which starts on line 4 and ends on line 7 are the argvalues. These are the values associated with each argument which gets passed into our test.

On line 19, we define the parameter names to the test function so that they are made available to the test.

So instead of writing 2 seperate tests we can reduce duplication and quickly fire as many combinations as we want.

Now you're probably wondering why we didn't just use this technique earlier so that we could have 1 test with every combination across every salary banding we could ever need. And that would be a valid concern, lots of engineers would favour that approach. This can feel as though it favours deduplication over structure and readibility. Testing how the calculation performs against say the higher rate banding is a different problem than the lower rate banding. Even our source code for that function might end up with a conditional branch to handle that logic.

The other added benefit to keeping the tests seperate is that the final result will read more like a specification.

We could end up with 1 test for lower rate bandings, another for higher rate taxpayers and another for additional rate taxpayers.

This feels much more aligned with the actual source code and provides a clear distinction and dilenation between the salary bandings.

You might argue that this would result in missed opportunity to keep our test code deduplicated, but that's okay. Reducing duplication in our test code shouldn't be at the top of our agenda when writing tests.

The other issue with grouping them all together under 1 large parametrized test would be that debugging it would become trickier. Lets say a test case for an additional rate banding started to fail. The test as a whole would report a failure and we would have to pick out the combination which incurred the failure.

Keeping tests seperated by logical grouping helps provide granularity to our test suite. If a piece of functionality stops working as expected then the failure is localized and easier to pinpoint.

There is however nothing to stop us from peppering our parametrized test with combinations which sit within the same contextual group. i.e. different higher rate salaries. If anything that would be encouraged.

Implementing the solution

With our parametrized test in place, we can head back over to the source code and take the next step of the implementation:

...

def calculate_income_tax_owed(salary: float) -> float:
    tax_free_allowance = 12_570
    basic_rate_threshold = 50_270

    if salary <= basic_rate_threshold:
        return (salary - tax_free_allowance) * 0.20

    return (basic_rate_threshold - tax_free_allowance) * 0.20 + (
        salary - basic_rate_threshold
    ) * 0.40

This time around we want to maintain the logic we had before. We can do a conditional check first to see if the given salary sits in the lower tax payer threshold or the higher banding. If it is the latter, we can perform the next calculation required for the higher banding. This is the conditional branch of logic we discussed earlier.

Again, this is not perfect. It still has a major bug that we've not yet resolved and it doesn't meet all our possible cases. But it gives us enough to get back to the safety of passing tests where we can plot our next move.

If we run this as is, we can see both of our test cases in our parametrized test will pass:

============================= test session starts ==============================
collecting ... collected 2 items

test_main.py::TestCalculateIncomeTaxOwed::test_calculates_for_higher_rate_banding[60000-11432] PASSED [ 50%]
test_main.py::TestCalculateIncomeTaxOwed::test_calculates_for_higher_rate_banding[65000-13432] PASSED [100%]

============================== 2 passed in 0.28s ===============================

Process finished with exit code 0

You can find the code for this chapter at the Github repo.

References

PreviousFramework NextApplication server

Last updated 1 year ago