docs(trustless-gateway): Add p2p usage section #520

aschmahmann · 2025-11-11T19:55:12Z

Clarify caveats around utilizing trustless gateways within p2p networks.

Closes #519

cc @lidel

github-actions · 2025-11-11T19:56:37Z

🚀 Build Preview on IPFS ready

🔎 Commit: 81accd9
🔏 CID bafybeidid7m4xpvyk2v6qgr3o6vq4h5lcedeitcseoecwm7ei4zg555uh4
📦 Preview:

src/http-gateways/trustless-gateway.md

Clarify caveats around utilizing trustless gateways within p2p networks

…nale Expand the P2P usage section with detailed rationale for HTTP/2 and TLS requirements. Since this is in "Appendix: Notes for implementers", we provide extra context explaining why these are SHOULD requirements: - HTTP/2: explain head-of-line blocking, connection overhead, and impact on DAG fetching with many block requests - TLS: clarify privacy vs integrity (multihash provides integrity, TLS provides confidentiality and is required for HTTP/2 in browsers) - Add security considerations for both gateway operators and clients - Include practical defaults (30s timeout, 2MiB blocks) - Reference RFC 9113 for HTTP/2 specifications Addresses #519 request to document why HTTP/2 is recommended rather than just stating the requirement.

src/http-gateways/trustless-gateway.md

aschmahmann · 2025-11-12T21:37:11Z

src/http-gateways/trustless-gateway.md

+
+:::note
+
+Blocks larger than 2MiB can cause memory pressure on resource-constrained clients and increase the window for incomplete transfers. Since blocks must be validated as a unit, smaller blocks allow for more granular verification and easier retries on failure.


As alluded to above there are a whole host of issues beyond just these, some of which are bigger deals. Notably, working with larger blocks is in general, unfortunately, not yet ecosystem-safe. If you don't know what you're doing (i.e. you fall under the category of overriding the RECOMMEND / SHOULD) you're likely to cause yourself problems by virtue of building tooling that doesn't work with much else in the ecosystem.

Also, the DoS risks, and latency / excessive bandwidth consumption tradeoffs generally become more difficult.

Given how often large block discussion comes up, improved this :::note with extra context you suggest + added link to https://discuss.ipfs.tech/t/supporting-large-ipld-blocks/15093/ in 41a766d

aschmahmann · 2025-11-12T21:39:56Z

src/http-gateways/trustless-gateway.md

+Trustless Gateways serve two primary deployment models:
+
+1. **Verifiable bridges**: Gateways that provide trustless access from HTTP clients into IPFS networks, where the gateway operator is distinct from content providers
+2. **P2P retrieval endpoints**: Gateways embedded within P2P networks where they serve as HTTP interfaces to peer-operated block stores


nit: I get we're stuck with this phrasing to some extent, but IMO if we can shy away from referring to implementations of the non-recursive trustless IPFS HTTP gateway API used in a p2p network as gateways that'd be great. They're not really gateways into the network as much as part of it 😅.

I've been using "HTTP providers" while talking with FIL folks, to make it clear when I mean non-recursive gateway, maybe introduce that term here?

Pushed in 5fabd52

aschmahmann · 2025-11-12T21:53:17Z

src/http-gateways/trustless-gateway.md

+
+To work around this limitation, clients must open multiple parallel TCP connections to achieve concurrent requests. However, each additional connection incurs significant overhead: TCP handshake latency, memory buffers, bandwidth competition, and increased implementation complexity. Browsers limit concurrent connections per origin (typically 6-8) to manage these costs, but this limitation affects all HTTP/1.1 clients, not just browsers, as the overhead of maintaining many connections becomes prohibitive.
+
+When fetching a DAG that requires many block requests, HTTP/1.1's lack of multiplexing creates a critical bottleneck. Clients face a difficult trade-off: either serialize requests (severely limiting throughput) or maintain many parallel connections (incurring substantial overhead). Users may experience acceptable performance with small test cases, but real-world IPFS content with deep DAG structures will encounter significant slowdowns. HTTP/2's stream multiplexing (:cite[rfc9113]) eliminates this bottleneck by allowing many concurrent requests over a single connection without head-of-line blocking at the application layer.


This is true, but even if not relying on block requests (e.g. CARs with queries sufficient to describe what the client needs) there are other constraints within p2p networks that can cause issues here too (e.g. optimistic queries which also eat up requests).

Hm.. added some extra context in 41a766d

aschmahmann · 2025-11-12T22:10:22Z

src/http-gateways/trustless-gateway.md

+
+Trustless Gateways operating in P2P contexts SHOULD NOT recursively search for content.
+
+In P2P networks, gateways typically serve as block stores for specific peers or content, rather than attempting to locate content across the entire network. Recursive content discovery is handled by the P2P layer (e.g., Amino DHT, IPFS routing), not by individual HTTP gateways.


Recursive content discovery

This isn't quite right. Recursion implies it happens again (e.g. a gateway that returns data itself doing data lookups, or a routing endpoint itself doing routing lookups), the Amino DHT, etc. don't do that. Additionally, as this is sort of longer justification it should probably move into the note section as well,

Does this sentence need to exist given the note below seems to cover the same content anyway?

Simplified in c819537

- fix block size rationale: UnixFS → Bitswap/ecosystem, add link to discuss.ipfs.tech thread on large blocks - broaden HTTP/2 multiplexing examples to include CAR queries and optimistic HEAD/GET probes - fix "recursive" terminology in content discovery paragraph

move rationale from main body to note, keeping only normative SHOULD statements in main body for scannability

clarify that P2P retrieval endpoints are network participants, not bridges, addressing review feedback about "gateway" semantics

…ty section - clarify h2c as "HTTP/2 over cleartext TCP, without TLS" - rename section headers to "HTTP Servers" and "HTTP Clients" - add explicit MUST/SHOULD to each security requirement for testability - transport security is MUST, block validation is MUST, rest are SHOULD

lidel

@aschmahmann pushed some final editorials based on feedback, take a look.
If looks fine, we can merge.

aschmahmann requested a review from lidel as a code owner November 11, 2025 19:55

aschmahmann commented Nov 11, 2025

View reviewed changes

src/http-gateways/trustless-gateway.md Outdated Show resolved Hide resolved

docs(trustless-gateway): Add p2p usage section

f7daba8

Clarify caveats around utilizing trustless gateways within p2p networks

aschmahmann force-pushed the feat/trustless-gateway-p2p-notes branch from ab86146 to f7daba8 Compare November 11, 2025 20:10

lidel mentioned this pull request Dec 1, 2025

Protocol stewardship and improvements — IPFS/2025 ipshipyard/roadmaps#16

Open

aschmahmann commented Dec 11, 2025

View reviewed changes

lidel added 2 commits January 9, 2026 00:37

Merge branch 'main' into feat/trustless-gateway-p2p-notes

901e503

lidel self-assigned this Jan 8, 2026

lidel added 4 commits January 9, 2026 01:05

docs(trustless-gateway): restructure recursion section

c819537

move rationale from main body to note, keeping only normative SHOULD statements in main body for scannability

docs(trustless-gateway): introduce HTTP providers terminology

5fabd52

clarify that P2P retrieval endpoints are network participants, not bridges, addressing review feedback about "gateway" semantics

docs(trustless-gateway): update editors

81accd9

lidel approved these changes Jan 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs(trustless-gateway): Add p2p usage section #520

docs(trustless-gateway): Add p2p usage section #520

Uh oh!

aschmahmann commented Nov 11, 2025

Uh oh!

github-actions bot commented Nov 11, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

aschmahmann Nov 12, 2025

Uh oh!

lidel Jan 9, 2026 •

edited

Loading

Uh oh!

aschmahmann Nov 12, 2025

Uh oh!

lidel Jan 9, 2026

Uh oh!

aschmahmann Nov 12, 2025

Uh oh!

lidel Jan 9, 2026

Uh oh!

aschmahmann Nov 12, 2025

Uh oh!

lidel Jan 9, 2026 •

edited

Loading

Uh oh!

lidel left a comment •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		:::note

		Blocks larger than 2MiB can cause memory pressure on resource-constrained clients and increase the window for incomplete transfers. Since blocks must be validated as a unit, smaller blocks allow for more granular verification and easier retries on failure.


		To work around this limitation, clients must open multiple parallel TCP connections to achieve concurrent requests. However, each additional connection incurs significant overhead: TCP handshake latency, memory buffers, bandwidth competition, and increased implementation complexity. Browsers limit concurrent connections per origin (typically 6-8) to manage these costs, but this limitation affects all HTTP/1.1 clients, not just browsers, as the overhead of maintaining many connections becomes prohibitive.

		When fetching a DAG that requires many block requests, HTTP/1.1's lack of multiplexing creates a critical bottleneck. Clients face a difficult trade-off: either serialize requests (severely limiting throughput) or maintain many parallel connections (incurring substantial overhead). Users may experience acceptable performance with small test cases, but real-world IPFS content with deep DAG structures will encounter significant slowdowns. HTTP/2's stream multiplexing (:cite[rfc9113]) eliminates this bottleneck by allowing many concurrent requests over a single connection without head-of-line blocking at the application layer.


		Trustless Gateways operating in P2P contexts SHOULD NOT recursively search for content.

		In P2P networks, gateways typically serve as block stores for specific peers or content, rather than attempting to locate content across the entire network. Recursive content discovery is handled by the P2P layer (e.g., Amino DHT, IPFS routing), not by individual HTTP gateways.

docs(trustless-gateway): Add p2p usage section #520

Are you sure you want to change the base?

docs(trustless-gateway): Add p2p usage section #520

Uh oh!

Conversation

aschmahmann commented Nov 11, 2025

Uh oh!

github-actions bot commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🚀 Build Preview on IPFS ready

Uh oh!

Uh oh!

Uh oh!

aschmahmann Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

lidel Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

aschmahmann Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

lidel Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

aschmahmann Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

lidel Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

aschmahmann Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

lidel Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lidel left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions bot commented Nov 11, 2025 •

edited

Loading

lidel Jan 9, 2026 •

edited

Loading

lidel Jan 9, 2026 •

edited

Loading

lidel left a comment •

edited

Loading