IPFS Performance #5226

davinci26 · 2018-07-13T17:56:48Z

Hey y'all,

I was testing IPFS performance using a script that implements the following logic:

Add 10MB file to IPFS daemon
Create 1 to 100 IPFS daemons in different directories
Do IPFS get from each daemon and measure the average time required to download the file.

I am using Go-IPFS deamon and I use and the python IPFS API to do the GET

============================= EDIT Updated Figure =============================
The X-axis is the number of IPFS nodes and the Y-axis is the average time required to download the file measured in minutes. The error bars are the standard deviation of the time

In terms of scalability I was expecting the time to be increasing logarithmic as more people request the file. The file has more seeders and more leechers.

Is this the expected performance of the system? Am I doing something wrong causing the performance of the system to deteriorate?

If this kind of results are something that interests the community, please let me know and I will provide you with the information.

ajbouh · 2018-07-15T22:20:04Z

@davinci26 Hi, this looks like a cool test to run! What are the axes in your graph?

davinci26 · 2018-07-15T22:25:34Z

@ajbouh Oooops rookie mistake. Sorry about that, the X-axis is the number of IPFS nodes and the Y-axis is the average time required to download the file measured in minutes. The error bars are the standard deviation of the time. I can also link you the script I am using to perform the calculations if you want to reproduce the graph
I will edit the OP as well for clarity

ajbouh · 2018-07-15T22:34:47Z

Do you mind providing an outline of the steps your benchmark takes? Is this essentially a graph of the time it takes for the Nth ipfs node to get the same file? Are the nodes all completely connected before for the first fetch? Or is node connection time also a part of the measured file fetch time?

davinci26 · 2018-07-15T23:06:27Z

My simulation is as follows:

Run a daemon with a 10MB file // Inital file seeder
for n in range(1,90 with step 5) // n here is the number of nodes getting the file
    1. Spawn n IPFS daemons concurrently, each IPFS daemon is in its own directory
    2. For each of the n daemons =>  time IPFS get
    3. Calculate the average time required to download the file
     // t_avg = sum(ti)/n for i in range(1,n)  
    4. When everyone downloads the file close and delete the n daemons.

So each point in the graph is average time required to distribute the file to all nodes, and the X-axis is the number of nodes that try to get the file.

The nodes are brand new, so I assume that they are not connected before.

I am not sure about the answer to the second question. I am just timing the command IPFS get.

ajbouh · 2018-07-15T23:28:02Z

It sounds like you might be measuring multiple things: time to be able to download any file, and then time to download the 10MB test file. In my experience, ipfs dameon takes different amounts of time finish starting up. I wonder what would happen if you "warmed up" each daemon by downloading a tiny 1-byte file, and then downloaded the 10MB file.

I'd be interested in knowing what the time for each of these numbers is: time to boot and download the tiniest file, time to download the 10MB file.

I'd also be interested in knowing the other statistics for measurements, like median, 90th percentile, etc.

davinci26 · 2018-07-16T10:23:00Z

Ill try to do the above, and ill post here my findings!!

ajbouh · 2018-07-17T21:50:12Z

Awesome, @davinci26. Looking forward to it!

Stebalien · 2018-07-18T01:35:46Z

Also note, bitswap is currently very dumb when it comes to asking for chunks. It'll ask everyone so more nodes having the file can actually slow things down.

davinci26 · 2018-07-18T16:18:35Z

@Stebalien Thanks for the info.

I changed my approach and forked ipfs-network-test, hoping that I would be able to better results.

Some observations that I made so far (I will post again when I finalize my simulation):

I observed the problem mentioned in #4588. Which I think is related to your comment (correct me if I am wrong)
IPFS as a whole program is a bit heavy. I have trouble running multiple IPFS instances on the same machine. I want to push my simulation to run around ~500 IPFS nodes (Ideally it would be 5k nodes but it would require a lot of horizontal scaling) and I am using a google cloud machine with 20 vCPUs, 27A GB memory. Currently, 100 IPFS clients is my limit.

My end goal is to compare the IPFS distribution model against the Client-Server model for distributing medium size files. From the results, I have so far I don't expect IPFS to outperform the Client-Server model, but I expect to push the system to the limit and possibly discover some useful findings which I will post here.

Stebalien · 2018-07-18T16:51:21Z

We're working on reducing the memory usage. This will involve:

Closing streams when we aren't using them. We currently don't in many cases for technical reasons but we're working on those.
Storing the peerstore on-disk. See: Implement datastore-backed peerstore libp2p/go-libp2p-peerstore#28

davinci26 · 2018-07-28T15:16:36Z

The results in more presentable way:

IPFS performance deteriorates as the number of nodes increases. This is due to the increasing number of duplicate blocks. This is happenning due to the current Bitswap implementation.

The good thing is that IPFS has better than BitTorrent according to my simulation:

But probably BitTorrent will outperform IPFS when the number of increase sufficiently

ajbouh · 2018-07-29T15:14:14Z

@davinci26 great investigation!

A few questions:

What do you mean by "due to duplicate blocks"
Did you try a much smaller file?
Did you try waiting until the daemon had fully booted and fully connected to other peers?

davinci26 · 2018-07-29T15:49:18Z

@ajbouh thanks!!

What do you mean by "due to duplicate blocks"
As I understand the way bitswap works right now is as follows (maybe someone can confirm this):
Alice publishes to network a want list of blocks. Then other users participating in the exchange start transmitting data (bytes) to Alice. When the transmission finishes Alices calculates the Hash of the data received and check if it matches any of the blocks in her want list. The problem is that as the number of users increases then multiple users send Alice the same block. As a result, Alice spends her bandwidth to download the same block multiple times causing the average time required to download the file to increase. I understand that this not a trivial problem to fix.
Did you try a much smaller file?
I think the trend would be similar although I havent really simulated it rigorously. For a smaller file, the number of blocks would be less and as a result, you would have fewer duplicate blocks. The issue will remain but it will be less visible.
Did you try waiting until the daemon had fully booted and fully connected to other peers?
All nodes all fully loaded and connected and the time it takes to connect and boot is not included in the graphs above. The simulation code uses the IPTB framework and can be found here. The function distCmd implements my simulation and all the calculations.

ajbouh · 2018-07-29T16:06:53Z

Thanks for explaining all that. I am not familiar with the low level details of bbitswap. Is the list of blocks that are sent randomized at all? Seems that if there are enough other peers advertising the same blocks, we can request that they send them in a random order. After we get blocks from enough peers, we can prioritize the peers that seem to be fastest and ask them to the blocks we need most/next. Perhaps @whyrusleeping or @Stebalien have thoughts here? The fancy move here would be to use forward error codes so that we can collect *any* number of blocks from *all* peers and (with high probability) they each would help fill in the parts of the file we're still missing. On Jul 29, 2018 08:49, "davinci26" <[email protected]> wrote: @ajbouh <https://github.com/ajbouh> thanks!! - What do you mean by "due to duplicate blocks" As I understand the way bitswap works right now is as follows (maybe someone can confirm this): Alice publishes to network a want list of blocks. Then other users participating in the exchange start transmitting data (bytes) to Alice. When the transmission finishes Alices calculates the Hash of the data received and check if it matches any of the blocks in her want list. The problem is that as the number of users increases then multiple users send Alice the same block. As a result, Alice spends her bandwidth to download the same block multiple times causing the average time required to download the file to increase. I understand that this not a trivial problem to fix. - Did you try a much smaller file? I think the trend would be similar although I havent really simulated it rigorously. For a smaller file, the number of blocks would be less and as a result, you would have fewer duplicate blocks. The issue will remain but it will be less visible. - Did you try waiting until the daemon had fully booted and fully connected to other peers? All nodes all fully loaded and connected and the time it takes to connect and boot is not included in the graphs above. The simulation code uses the IPTB framework and can be found here <https://github.com/davinci26/iptb>. The function distCmd implements my simulation and all the calculations. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#5226 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAAcnQQUc5y0ohsG1cO87smGIsE9Y6UVks5uLdmJgaJpZM4VPSHp> .

whyrusleeping · 2018-07-29T17:06:22Z

@davinci26 very nice graphs, could you share how you made them? I would love to be able to use them as a way to benchmark improvements.

On the duplicate blocks issue, We've actually laid the groundwork for fixing this issue with 'bitswap sessions', where we track which peers are likely to have a given piece of data based on information about who sends us other related pieces. The next step is to more precisely limit which peers we send those wants to, and track our hit rate (higher hit rate == more certainty, can send one want per peer, lower hit rate == maybe we should send wants to multiple peers still), response times, bandwidth estimators, and so on. We can implement a pretty simple 'want scheduler' (or better named thing) that will likely make a very noticeable impact.

ivan386 · 2018-07-29T18:09:38Z

@whyrusleeping Just add one more step(have and request message as in bittorrent) between want and block sending. And do not need so many crutches.

davinci26 · 2018-07-29T19:14:45Z

@whyrusleeping thanks for the bitswap explanation.

I produced the data with my forked version of IPTB to make the file distribution simulation
Then the graphs are produced with Python and Maltplotlib using this script which parses the simulation output and produces the graphs.

I can prepare a pull request to IPTB to add the graphs and the simulation to it.

whyrusleeping · 2018-07-30T02:32:21Z

@ivan386 you could do that, but doing so adds an extra round trip to every request, and results in a non-trivial overhead for smaller blocks. There is room for some amount of 'have' style messages though, we could definitely add them for other related blocks as part of the message, or even more interestingly, return a bitfield along with each block that indicates which of that blocks children we have (at some non-zero expense to the side sending the blocks).

@davinci26 thats really cool, i'd love to see what changes are needed in iptb to support that. We're actively working on improvements to it, this is the sort of thing we want :)

ajbouh · 2018-08-02T23:39:45Z

@vmx

songjinli · 2018-12-27T08:02:27Z

@ajbouh thanks!!

What do you mean by "due to duplicate blocks"
As I understand the way bitswap works right now is as follows (maybe someone can confirm this):
Alice publishes to network a want list of blocks. Then other users participating in the exchange start transmitting data (bytes) to Alice. When the transmission finishes Alices calculates the Hash of the data received and check if it matches any of the blocks in her want list. The problem is that as the number of users increases then multiple users send Alice the same block. As a result, Alice spends her bandwidth to download the same block multiple times causing the average time required to download the file to increase. I understand that this not a trivial problem to fix.

Did you try a much smaller file?
I think the trend would be similar although I havent really simulated it rigorously. For a smaller file, the number of blocks would be less and as a result, you would have fewer duplicate blocks. The issue will remain but it will be less visible.

Did you try waiting until the daemon had fully booted and fully connected to other peers?
All nodes all fully loaded and connected and the time it takes to connect and boot is not included in the graphs above. The simulation code uses the IPTB framework and can be found here. The function distCmd implements my simulation and all the calculations.

Amazing work! but i didn't find the 'distCmd' function in the fork of IPTBhere, would you please point how to implement the simulation and get the data?

davinci26 · 2018-12-28T04:56:54Z

Sorry about that I moved the code to a different branch at some point the link is this. There is also a readme file there with more information.
This code was valid at the time of writing (Aug 2018) I know for a fact that IPTB did some heavy refactoring from then it may not be compatible with the current version.

Hope the above helps.

badkk · 2019-03-28T14:56:21Z

@whyrusleeping
Can I ask is this problem solved by using bitswap session??

Stebalien · 2019-03-28T16:57:55Z

The situation has significantly improved but there's still some work to be done. Sessions don't currently rank peers within the session to try to pick the fastest one.

whyrusleeping · 2019-04-02T19:33:36Z

Could we get the tests that generated these graphs re-run?

davinci26 · 2019-04-05T02:28:16Z

@whyrusleeping I have the code for the graphs in this fork of IPTB. This code is more than 6 months old.

When I developed it I tried to make a PR for IPTB to add it. I agree with the IPTB folks that IPTB is not the best place for this code. With that said I still believe that there is value in these graphs/simulation for perf monitoring purposes.

Feel free to include me in any discussion if you want to add this into your codebase. I would be happy to help :)

momack2 · 2020-02-29T00:41:26Z

Hey Folks - I believe more up-to-date metrics for this are here: #6782 (comment). We've made a lot of improvements to Bitswap over the past 6 months that are rolling out in our next release -- addressing this exact "many seeders/leachers" performance challenge, so closing this issue to redirect the conversation/evaluation there. =]

magik6k added topic/bitswap Topic bitswap topic/perf Performance labels Jul 14, 2018

davinci26 mentioned this issue Jul 30, 2018

Add stat collection to IPTB ipfs/iptb#65

Closed

This was referenced Sep 6, 2018

Connection Topology ipfs/iptb#76

Open

Collecting statistics ipfs/iptb#77

Open

DonaldTsang mentioned this issue May 16, 2019

Performace, or How IPFS will be better than BitTorrent #6342

Closed

jasonzhouu mentioned this issue May 28, 2019

What is the reason for very slow #6382

Closed

momack2 closed this as completed Feb 29, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IPFS Performance #5226

IPFS Performance #5226

davinci26 commented Jul 13, 2018 •

edited

Loading

ajbouh commented Jul 15, 2018

davinci26 commented Jul 15, 2018 •

edited

Loading

ajbouh commented Jul 15, 2018

davinci26 commented Jul 15, 2018

ajbouh commented Jul 15, 2018

davinci26 commented Jul 16, 2018

ajbouh commented Jul 17, 2018

Stebalien commented Jul 18, 2018

davinci26 commented Jul 18, 2018

Stebalien commented Jul 18, 2018

davinci26 commented Jul 28, 2018

ajbouh commented Jul 29, 2018

davinci26 commented Jul 29, 2018

ajbouh commented Jul 29, 2018 via email

whyrusleeping commented Jul 29, 2018

ivan386 commented Jul 29, 2018

davinci26 commented Jul 29, 2018

whyrusleeping commented Jul 30, 2018

ajbouh commented Aug 2, 2018

songjinli commented Dec 27, 2018 •

edited

Loading

davinci26 commented Dec 28, 2018

badkk commented Mar 28, 2019

Stebalien commented Mar 28, 2019

whyrusleeping commented Apr 2, 2019

davinci26 commented Apr 5, 2019

momack2 commented Feb 29, 2020

IPFS Performance #5226

IPFS Performance #5226

Comments

davinci26 commented Jul 13, 2018 • edited Loading

ajbouh commented Jul 15, 2018

davinci26 commented Jul 15, 2018 • edited Loading

ajbouh commented Jul 15, 2018

davinci26 commented Jul 15, 2018

ajbouh commented Jul 15, 2018

davinci26 commented Jul 16, 2018

ajbouh commented Jul 17, 2018

Stebalien commented Jul 18, 2018

davinci26 commented Jul 18, 2018

Stebalien commented Jul 18, 2018

davinci26 commented Jul 28, 2018

ajbouh commented Jul 29, 2018

davinci26 commented Jul 29, 2018

ajbouh commented Jul 29, 2018 via email

whyrusleeping commented Jul 29, 2018

ivan386 commented Jul 29, 2018

davinci26 commented Jul 29, 2018

whyrusleeping commented Jul 30, 2018

ajbouh commented Aug 2, 2018

songjinli commented Dec 27, 2018 • edited Loading

davinci26 commented Dec 28, 2018

badkk commented Mar 28, 2019

Stebalien commented Mar 28, 2019

whyrusleeping commented Apr 2, 2019

davinci26 commented Apr 5, 2019

momack2 commented Feb 29, 2020

davinci26 commented Jul 13, 2018 •

edited

Loading

davinci26 commented Jul 15, 2018 •

edited

Loading

songjinli commented Dec 27, 2018 •

edited

Loading