django-auto-prefetch

Automatically prefetch foreign key values as needed.

These details have not been verified by PyPI

Project links

Project description

https://img.shields.io/github/actions/workflow/status/tolomea/django-auto-prefetch/main.yml.svg?branch=main&style=for-the-badge

https://img.shields.io/badge/Coverage-100%25-success?style=for-the-badge

https://img.shields.io/pypi/v/django-auto-prefetch.svg?style=for-the-badge

https://img.shields.io/badge/code%20style-black-000000.svg?style=for-the-badge

Automatically prefetch foreign key values as needed.

Purpose

When accessing a ForeignKey or OneToOneField (including in reverse) on a model instance, if the field’s value has not yet been loaded then auto-prefetch will prefetch the field for all model instances loaded by the same QuerySet as the current model instance. This is enabled at the model level and totally automatic and transparent for users of the model.

Requirements

Python 3.9 to 3.13 supported.

Django 4.2 to 5.1 supported.

Usage

Install with python -m pip install django-auto-prefetch.

Change all these imports from django.db.models to auto_prefetch:

ForeignKey
Manager
Model - including inheriting Meta from auto_prefetch.Model.Meta
OneToOneField
QuerySet

If you use custom subclasses of any of these classes, you should be able to swap for the auto_prefetch versions in your subclasses’ bases.

For example, if you had:

from django.db import models


class Book(models.Model):
    author = models.ForeignKey("Author", on_delete=models.CASCADE)

    class Meta:
        verbose_name = "Book"

…swap to:

import auto_prefetch
from django.db import models


class Book(auto_prefetch.Model):
    author = auto_prefetch.ForeignKey("Author", on_delete=models.CASCADE)

    class Meta(auto_prefetch.Model.Meta):
        verbose_name = "Book"

Run python manage.py makemigrations to generate migrations for all the models you modified. These migrations will set the Meta.base_manager_name option to prefetch_manager for every model that you’ve converted. This change ensures that auto-prefetching happens on related managers. Such migrations do not change anything in the database.

(If you instead set Meta.base_manager_name on your models, make sure it inherits from auto_prefetch.Manager.)

Background and Rationale

Currently when accessing an uncached foreign key field, Django will automatically fetch the missing value from the database. When this occurs in a loop it creates 1+N query problems. Consider the following snippet:

for choice in Choice.objects.all():
    print(choice.question.question_text, ":", choice.choice_text)

This will do one query for the choices and then one query per choice to get that choice’s question.

This behavior can be avoided with correct application of prefetch_related() like this:

for choice in Choice.objects.prefetch_related("question"):
    print(choice.question.question_text, ":", choice.choice_text)

This has several usability issues, notably:

Less experienced users are generally not aware that it’s necessary.
Cosmetic seeming changes to things like templates can change the fields that should be prefetched.
Related to that, the code that requires the prefetch_related() (e.g. the template) may be quite removed from where the prefetch_related() needs to be applied (e.g. the view).
Subsequently finding where prefetch_related() / select_related() calls are missing is non-trivial and needs to be done on an ongoing basis.
Excess entries in prefetch_related() calls are even harder to find and result in unnecessary database queries.
It is very difficult for libraries like the admin and Django Rest Framework to automatically generate correct prefetch_related() clauses.

On the first iteration of the loop in the example above, when we first access a choice’s question field, instead of fetching the question for just that choice, auto-prefetch will speculatively fetch the questions for all the choices returned by the QuerySet. This change results in the first snippet having the same database behavior as the second while reducing or eliminating all of the noted usability issues.

Some important points:

ManyToManyFields are not changed at all.
Because these are ForeignKey and OneToOneFields, the generated queries can’t have more result rows than the original query and may have less. This eliminates any concern about a multiplicative query size explosion.
This feature will never result in more database queries as a prefetch will only be issued where the ORM was already going to fetch a single related object.
Because it is triggered by fetching missing related objects it will not at all change the DB behavior of code which is fully covered by prefetch_related() and/or select_related() calls.
This will inherently chain across relations like choice.question.author. The conditions above still hold under such chaining.
In some rare situations it may result in larger data transfer between the database and Django (see below).

An example of that last point is:

qs = Choice.objects.all()
list(qs)[0].question

Such examples generally seem to be rarer and more likely to be visible during code inspection (vs {{ choice.question }} in a template). And larger queries are usually a better failure mode than producing hundreds of queries. For this to actually produce inferior behavior in practice you need to: * fetch a large number of choices * filter out basically all of them * …in a way that prevents garbage collection of the unfiltered ones

If any of those aren’t true then automatic prefetching will still produce equivalent or better database behavior than without.

P.S.

If you have concerns go look at the code, it’s all in auto_prefetch/__init__.py and is fairly short.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.11.0

Oct 29, 2024

1.10.0

Oct 8, 2024

1.9.0

Jun 19, 2024

1.8.0

Oct 11, 2023

1.7.0

Jul 10, 2023

1.6.0

Jun 14, 2023

1.5.1

Mar 29, 2023

1.5.0

Feb 25, 2023

1.4.0

Nov 9, 2022

1.3.0

Jun 5, 2022

1.2.0

May 10, 2022

1.1.0

Jan 10, 2022

1.0.0

Oct 5, 2021

0.1.1

Sep 28, 2021

0.1.0

Sep 28, 2021

0.0.6

Jul 29, 2020

0.0.5

Apr 5, 2020

0.0.4

Mar 26, 2020

0.0.3

Mar 26, 2020

0.0.2

Mar 25, 2020

0.0.1

Mar 24, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

django_auto_prefetch-1.11.0.tar.gz (8.2 kB view details)

Uploaded Oct 29, 2024 Source

Built Distribution

django_auto_prefetch-1.11.0-py3-none-any.whl (7.0 kB view details)

Uploaded Oct 29, 2024 Python 3

File details

Details for the file django_auto_prefetch-1.11.0.tar.gz.

File metadata

Download URL: django_auto_prefetch-1.11.0.tar.gz
Upload date: Oct 29, 2024
Size: 8.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.6

File hashes

Hashes for django_auto_prefetch-1.11.0.tar.gz
Algorithm	Hash digest
SHA256	`7fa97ee167d2e09f63e509311825344b5c8c42e8b39a3c7f5ad8b9bc3abced41`
MD5	`0fc1b28114f89d27ebdc3fc81fce91ae`
BLAKE2b-256	`4ed60d1f9acb4b0bd6813ea394d41631d758a1c54afe504badaff3f4f89566bc`

See more details on using hashes here.

File details

Details for the file django_auto_prefetch-1.11.0-py3-none-any.whl.

File metadata

Download URL: django_auto_prefetch-1.11.0-py3-none-any.whl
Upload date: Oct 29, 2024
Size: 7.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.6

File hashes

Hashes for django_auto_prefetch-1.11.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cbbd9036bc769cb6add90a435aca07b7b472ed330af2a8799862ae60deb8b636`
MD5	`cdfa50e60fb6ee6cda8d329881343b08`
BLAKE2b-256	`db13d67914d61510cb4d0dfd889db2689db8ad242ef0959245e2d2e1abce93a8`