Recursive preload in js-ipfs is fetching MBs of refs data #67

olizilla · 2020-08-14T11:27:36Z

The first data-set listed on the IPLD Explorer is the "Project Apollo Archives" which is ~57GB... when we fetch the directory listing, everything works as expected, but in the background js-ipfs hits the preload nodes to recursively fetch the entire dag.

That's not great, as ipld explorer is built around the idea that you can fetch arbitrarily huge datasets by only fetch thing subset that you browse... as it stands multiple preload nodes fetch the entire data set and just the refs response alone means ~13MB of json listing out all the refs in the dag is streamed to th browser. For larger datasets the problem would get worse.

It's a nice optimisation for js-ipfs to recursively preload the entire dag, but its expensive for larger datasets.

TODO

set recursive to false on the refs call while preloading. That way we'd let the preload nodes find the blocks for us without pre-emptively fetching a huge dataset.

olizilla · 2020-08-14T11:48:38Z

The refs response for the apollo archive is 10MB compressed, 15MB uncompressed...

olizilla · 2020-08-14T13:36:54Z

@achingbrain am I right in thinking there is not currently a way to non-recursively pre-load things? I'm looking at https://github.com/ipfs/js-ipfs/blob/afcf142a7da8559500e4db0eaf739e9bfbf3a8b8/packages/ipfs/src/core/preload.js#L49 which is hardcoded to always pass r=true (aside, that opt seems to be missing from the docs https://docs.ipfs.io/reference/http/api/#api-v0-refs)

The explorer app would like to be able to let ipld do a get on a cid without triggering a recursive fetch of a abitrarily large dateset on the preload nodes. Would it make sense to provide a config option to overide that flag, or another arguement option to the api methods that trigger preloading?

cc @lidel

olizilla · 2020-08-14T13:38:18Z

or should i just disable pre-loading altogether?

achingbrain · 2020-08-14T14:01:05Z

I think maybe just disable it entirely - preloading is supposed to increase the availability of newly added content, which isn't something ipld-explorer is for.

olizilla · 2020-08-14T14:19:19Z

Would we also lose some discovery power too? I assume preloading acts as a rough way to delegate dht look ups to the preload nodes too...

achingbrain · 2020-08-14T14:52:09Z

Not AFAIK, the delegate routers are there to make DHT queries on behalf of js-IPFS nodes, but preloading is separate from that.

lidel · 2021-05-13T23:12:11Z

Related: https://github.com/ipfs/in-web-browsers/issues/173
TLDR: we may save a lot of bandwidth if preload is aware of unixfs structure and only preloads dir + roots of its items, and not entire dag.

lidel · 2021-06-18T13:41:30Z

This depends on ipfs/js-ipfs#3510

lidel · 2024-09-20T19:46:22Z

we switched to helia + preload nodes are gone. this is no longer happening :)

olizilla added the need/triage Needs initial labeling and prioritization label Aug 14, 2020

olizilla self-assigned this Aug 14, 2020

lidel added status/blocked Unable to be worked further until needs are met and removed need/triage Needs initial labeling and prioritization labels Jun 18, 2021

lidel closed this as not planned Won't fix, can't repro, duplicate, stale Sep 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Recursive preload in js-ipfs is fetching MBs of refs data #67

Recursive preload in js-ipfs is fetching MBs of refs data #67

olizilla commented Aug 14, 2020 •

edited

Loading

olizilla commented Aug 14, 2020

olizilla commented Aug 14, 2020

olizilla commented Aug 14, 2020

achingbrain commented Aug 14, 2020

olizilla commented Aug 14, 2020

achingbrain commented Aug 14, 2020

lidel commented May 13, 2021

lidel commented Jun 18, 2021

lidel commented Sep 20, 2024

Recursive preload in js-ipfs is fetching MBs of refs data #67

Recursive preload in js-ipfs is fetching MBs of refs data #67

Comments

olizilla commented Aug 14, 2020 • edited Loading

olizilla commented Aug 14, 2020

olizilla commented Aug 14, 2020

olizilla commented Aug 14, 2020

achingbrain commented Aug 14, 2020

olizilla commented Aug 14, 2020

achingbrain commented Aug 14, 2020

lidel commented May 13, 2021

lidel commented Jun 18, 2021

lidel commented Sep 20, 2024

olizilla commented Aug 14, 2020 •

edited

Loading