CoW Developer docs#1305

Open

jsturtevant wants to merge 2 commits intohyperlight-dev:mainfrom

jsturtevant:CoW-docs

Contributor

jsturtevant commented Mar 12, 2026 •

edited

Loading

Part of #1261

Docs updated from @syntactically's original proposal 3d59400

Signed-off-by: James Sturtevant jsturtevant@gmail.com

jsturtevant requested review from danbugs, dblnz, devigned, jprendes, ludfjig, marosset, simongdavies and syntactically as code owners

March 12, 2026 22:07

jsturtevant added the kind/documentation label

syntactically and others added 2 commits

March 12, 2026 15:09


          Add doc on guest-aided CoW design

cda532c

Signed-off-by: James Sturtevant <jsturtevant@gmail.com>


          Update docs to include some diagrams and current behaviors

f1c6500

Signed-off-by: James Sturtevant <jsturtevant@gmail.com>

jsturtevant force-pushed the CoW-docs branch from 585daca to f1c6500 Compare

March 12, 2026 22:09

syntactically requested changes

View reviewed changes

Member

syntactically left a comment

Sorry, I have some comments here. Mostly fairly minor/nits, but a couple of slightly more significant things. Happy to do a draft as well for some of the ones where I've asked for subsections to be restructured, if that would be useful?

docs/paging-development-notes.md

-              top of memory, is simply a large bag of blank pages: scratch memory
-              into which this VM can write.
+              physical address space, which are always populated: one, near the
+              bottom of memory (starting at GPA `0x1000`), is a

Member

syntactically Mar 16, 2026

The specific base address is sort of a nasty implementation detail that might go away at some point anyway. Morally, it doesn't matter where it is and "at the bottom" is really just a general location. I would rather not have this in the high-level docs because it feels like noise to me.

docs/paging-development-notes.md

+              ```
+                Guest Physical Address Space (GPA)
+                +-------------------------------+ MAX_GPA

Member

syntactically Mar 16, 2026

I think this is duplicating a diagram from src/hyperlight_host/src/mem/layout.rs. Maybe we can merge them. I'm ambivalent about whether the merged diagram ends up here or there.

docs/paging-development-notes.md

+                +-------------------------------+ MAX_GPA
+                |    Exn Stack, Bookkeeping     |
+                |    (scratch size, allocator   |

Member

syntactically Mar 16, 2026

Rather than explicitly listing metadata/bookkeeping items, I would just in text reference the layout.rs files (primarily the hyperlight_common one) that have fairly detailed comments on each item along with its offsets.

docs/paging-development-notes.md

+                +-------------------------------+
+                |      Free Scratch Memory      |
+                +-------------------------------+
+                |         Output Data           |

Member

syntactically Mar 16, 2026

Similarly, the io data being here right now is sort of a stupid hack because we don't have the better phys alloc / buffer management from #1112. I would like to avoid having that in this diagram if possible and we can talk about the fact that that is the stopgap in the section below which I see was not fully written in the original commit (sorry).

docs/paging-development-notes.md

               ready supply of already-mapped scratch pages to use for replacement
-              page tables is needed. Currently, the guest accomplishes this by
-              keeping an identity mapping of the entire scratch memory around.
+              page tables is set up by the Host. The guest keeps a mapping of the entire scratch

Member

syntactically Mar 16, 2026

I don't fully understand what this change is trying to say. I prefer the original wording I think.

docs/paging-development-notes.md

-              to be able to take exceptions to it.  The remainder of the top page of
-              the scratch memory is used for this.
+              to be able to take exceptions to it. The exception stack begins at
+              offset `0x20` from the top of the scratch region (below the metadata

Member

syntactically Mar 16, 2026

Again, I would avoid duplicating the actual offset here (too easy to get out of date) and just say below the metadata.

docs/paging-development-notes.md

+              to be able to take exceptions to it. The exception stack begins at
+              offset `0x20` from the top of the scratch region (below the metadata
+              fields described above) and grows downward through the remainder of
+              the top page.

Member

syntactically Mar 16, 2026

Technically I think it's actually 2 pages used for this now.

docs/paging-development-notes.md

-              ignore because we only need 1-2 buffers inflight at a time. We can
-              emulate the current paradigm by recreating a new buffer out of the
-              free space in the original buffer on call, etc etc.
+              I/O buffers are statically allocated at the bottom of the scratch

Member

syntactically Mar 16, 2026

Let's rewrite this to be a little bit more high-level about how buffer management should work, plus details on how it is done right now (I believe there is also some discussion about buffer management in #1112).

docs/paging-development-notes.md

+              +-------------------------------------------+ (scratch base)
+              ```
+              The minimum scratch size (`min_scratch_size()`) accounts for these

Member

syntactically Mar 16, 2026

I'm not sure if this is a useful comment to have in this file. If it's useful to have the details of the min scratch size calculation duplicated in here (my first instinct would be to just link to the layout.rs file that has the calculation explained in detail), it should either be a general statement (e.g. "some architecture-specific processor structures that will be written during initialisation" or explicitly split out for each supported architecture).

docs/paging-development-notes.md

-              collisions between the guest stack and the scratch physical page
-              allocator.
+              In the current startup path, the host enters the guest with
+              `RSP` pointing to the exception stack. Early guest init then

Member

syntactically Mar 16, 2026

Please retain the less-architecture-dependent language of "the stack pointer" rather than "rsp", and consider whether or not it makes sense to retain any useful points/implications of the note about exception stack overflows being difficult to detect (thanks for the rest of the rewrite of this section, which is good).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

syntactically syntactically requested changes

danbugs Awaiting requested review from danbugs danbugs is a code owner

ludfjig Awaiting requested review from ludfjig ludfjig is a code owner

dblnz Awaiting requested review from dblnz dblnz is a code owner

devigned Awaiting requested review from devigned devigned is a code owner

marosset Awaiting requested review from marosset marosset is a code owner

jprendes Awaiting requested review from jprendes jprendes is a code owner

simongdavies Awaiting requested review from simongdavies simongdavies is a code owner

Requested changes must be addressed to merge this pull request.

Labels

kind/documentation