Add a ResourceBudget mechanism which keeps disk usage in check during syncs by dralley · Pull Request #7649 · pulp/pulpcore

dralley · 2026-04-27T03:14:54Z

📜 Checklist

Commits are cleanly separated with meaningful messages (simple features and bug fixes should be squashed to one commit)
A changelog entry or entries has been added for any significant changes
Follows the Pulp policy on AI Usage
(For new features) - User documentation and test coverage has been added

dralley · 2026-05-08T03:50:05Z

@balasankarc - Relevant to your previous pull requests

dralley · 2026-05-08T04:01:59Z

+
+Defaults to 5120 (5gb)
+
+### SYNC\_MAX\_IN\_FLIGHT\_ITEMS


Actually the unit of measurement here is content, not artifacts, so this description is a little misleading - but it's a little difficult to describe.

dralley · 2026-05-08T04:12:03Z

 DOMAIN_ENABLED = False

-MAX_CONCURRENT_CONTENT = 25
+MAX_CONCURRENT_CONTENT = 200


Reset this back to where it was before we reduced it to limit worst-case scenarios

Artifacts stay on disk between the ArtifactDownloader stage and the ArtifactSaver stage. If too many large files build up, it can exceed the allotted filesystem space of the working directory. Previously we used unecessarily small batch sizes by default in order to ensure the worst case was avoided. This approach dynamically controls how much disk space is being used by the task and provides backpressure when the limit is exceeded, flushing batches and preventing new artifacts from being downloaded. closes pulp#7559 Assisted-By: claude-opus-4.6

dralley · 2026-05-08T04:19:19Z

        """
-        async for batch in self.batches(minsize=settings.MAX_CONCURRENT_CONTENT):
+        flush_event = self.resource_budget.pressure if self.resource_budget else None
+        minsize = self.resource_budget.max_items if self.resource_budget else 200


This is a little weird because, on one hand actually the resource budget ought to manage this completely, and on the other hand it's hard to actually see the downside of setting a reasonably sized (fairly large) batch size by default? It's going to exist anyway (defaults to 500 if you don't set it manually).

dralley · 2026-05-08T04:38:10Z

                await self.put(d_content)


 class GenericDownloader(Stage):


There's one open question, which is whether max_concurrent_content handling in GenericDownloader ought to be removed entirely.

ResourceBudget accomplishes the same goal but slightly differently. max_concurrent_content blocks items from being pulled from the queue, before the asyncio.Task is created, while ResourceBudget blocks on acquire() after the task has been created but before it does work.

So I suppose ResourceBudget might accumulate asyncio.Task objects, which might waste a little but of memory but probably doesn't matter in practice? I'm not that familiar with these aspects of the pipeline or asyncio, though.

Another option is to just set a fairly permissive cap as a safety net and otherwise let ResourceBudget handle it.

And this is not even the same as max_concurrent_download (handled inside aiohttp), right?

I assume you mean either download_concurrency or rate_limit, but yes.

Lots of these primitives are very closely related but not 100% identical and we could probably do with simplifying them.

I think, IIRC, rate_limit throttles the number of total simultaneous downloads and download_concurrency is the number of simultaneous connections allowed to any single host.

MAX_CONCURRENT_CONTENT / max_concurrent_content is "don't buffer too many items inside the download stage at once"

Also the former operates on individual downloads (artifacts) while the latter is content which may be multi-artifact.

The net impact of these is similar but not 100% identical

mdellweg · 2026-05-08T14:00:57Z

+            if flush_event_listener and flush_event_listener in done:
+                # Don't re-arm until after we yield a batch, to avoid a spin loop
+                # when the event stays set but the batch is empty.
+                flush_event_listener = None
+                no_block = True


What is the fundamental difference here between the two events?
If I read it correctly, the thaw_event is cleared right before yielding.
Can these events be consolidated into one?

I'm not sure they can, I think they have different lifetimes and the way they get re-armed is different.

mdellweg · 2026-05-08T14:02:03Z

                no_block = False
+                # Re-arm the flush listener after yielding
+                if flush_event and flush_event_listener is None:
+                    flush_event_listener = asyncio.ensure_future(flush_event.wait())


Don't you need to .clear() the event to "rearm" it?

It gets cleared in release(), self.pressure and flush_event are the same

github-actions Bot added multi-commit no-changelog no-issue labels Apr 27, 2026

dralley force-pushed the resource-budget branch from ffc20a0 to 3b86771 Compare April 27, 2026 03:17

mdellweg reviewed Apr 27, 2026

View reviewed changes

Comment thread docs/admin/reference/settings.md

pedro-psb reviewed Apr 27, 2026

View reviewed changes

Comment thread docs/admin/reference/settings.md

dralley force-pushed the resource-budget branch from 3b86771 to 7ba48f8 Compare May 1, 2026 03:13

dralley commented May 8, 2026

View reviewed changes

dralley force-pushed the resource-budget branch from 7ba48f8 to 2b98d9b Compare May 8, 2026 04:09

github-actions Bot removed the multi-commit label May 8, 2026

dralley marked this pull request as ready for review May 8, 2026 04:10

github-actions Bot removed no-issue no-changelog labels May 8, 2026

dralley requested review from mdellweg and pedro-psb May 8, 2026 04:10

dralley commented May 8, 2026

View reviewed changes

dralley force-pushed the resource-budget branch from 2b98d9b to 1c0e255 Compare May 8, 2026 04:17

dralley commented May 8, 2026

View reviewed changes

mdellweg reviewed May 8, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a ResourceBudget mechanism which keeps disk usage in check during syncs#7649

Add a ResourceBudget mechanism which keeps disk usage in check during syncs#7649
dralley wants to merge 1 commit intopulp:mainfrom
dralley:resource-budget

dralley commented Apr 27, 2026

Uh oh!

Uh oh!

Uh oh!

dralley commented May 8, 2026

Uh oh!

dralley May 8, 2026

Uh oh!

dralley May 8, 2026

Uh oh!

dralley May 8, 2026 •

edited

Loading

Uh oh!

dralley May 8, 2026 •

edited

Loading

Uh oh!

mdellweg May 8, 2026

Uh oh!

dralley May 8, 2026 •

edited

Loading

Uh oh!

mdellweg May 8, 2026

Uh oh!

dralley May 8, 2026

Uh oh!

mdellweg May 8, 2026

Uh oh!

dralley May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

dralley commented Apr 27, 2026

📜 Checklist

Uh oh!

Uh oh!

Uh oh!

dralley commented May 8, 2026

Uh oh!

dralley May 8, 2026

Choose a reason for hiding this comment

Uh oh!

dralley May 8, 2026

Choose a reason for hiding this comment

Uh oh!

dralley May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dralley May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mdellweg May 8, 2026

Choose a reason for hiding this comment

Uh oh!

dralley May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mdellweg May 8, 2026

Choose a reason for hiding this comment

Uh oh!

dralley May 8, 2026

Choose a reason for hiding this comment

Uh oh!

mdellweg May 8, 2026

Choose a reason for hiding this comment

Uh oh!

dralley May 8, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dralley May 8, 2026 •

edited

Loading

dralley May 8, 2026 •

edited

Loading

dralley May 8, 2026 •

edited

Loading