Luna performance case — writing memory #239

mwu-tow · 2018-08-08T22:59:59Z

Consider the following Luna code:

import Std.Base
import Std.Foreign.C.Value
import Std.Time

def main:
    count = 1000000
    a = Array CInt64 . alloc count
    def write n:
        a.uncheckedWriteAt n $ CInt64.fromInt n
        if n == 0 then None else write n-1

    t0 = Time.now
    write (count-1)
    t1 = Time.now
    print (t1.diff t0)

And the equivalent C++ code:

#include <chrono>
#include <iostream>
#include <string>
#include <vector>

void write(std::vector<int64_t> &array, int64_t n)
{
	array[n] = n;
	if(n)
		write(array, n-1);
}

template<typename F, typename ...Args>
static auto duration(F&& func, Args&&... args)
{
	const auto start = std::chrono::steady_clock::now();
	std::invoke(std::forward<F>(func), std::forward<Args>(args)...);
	return std::chrono::steady_clock::now() - start;
}

template<typename F, typename ...Args>
static auto measure(std::string text, F&& func, Args&&... args)
{
	const auto t = duration(std::forward<F>(func), std::forward<Args>(args)...);
	std::cout << text << " took " << std::chrono::duration_cast<std::chrono::microseconds>(t).count() / 1000.0 << " ms" << std::endl;
	return t;
}

int main()
{
	int count = 1'000'000;
	std::vector<int64_t> v;
	v.resize(count);
	measure("write " + std::to_string(count), write, v, count-1);
}

Basically the program allocates an array of one million 64-bit integers and then writes to each of them using recursive write function.

Luna output:

31630.7974ms

C++ output:

write 1000000 took 0.595 ms

Methodology: ran several times, took the best result
Luna: used shell luna from luna-core
C++: MSVC 15,7 optimized x64 build

Perhaps I am again unknowingly triggering some thunk-exploding lazy evaluation trap.
Otherwise, the results would suggest some serious performance issue, as such task shouldn't be more than 50 000 times slower than C++.

The text was updated successfully, but these errors were encountered:

kustosz · 2018-08-08T23:12:35Z

@mwu-tow can you please check how these times in Luna scale with the size of array? If they are linear, we have an awesome test case for @iamrecursion to look into :)

mwu-tow · 2018-08-08T23:24:52Z

They seem linear enough.

Count	Time [ms]
1	0.0
10	1.000900001
100	4.019999999
1000	37.9863
2500	110.9783
5000	220.929600001
10000	373.840300001
25000	946.707
50000	1815.5325
100000	3062.110000001
200000	6056.7706
300000	9237.2525
400000	12961.7436
500000	15301.819100001
600000	19188.2202
700000	21644.0454
800000	26398.623699999
900000	29034.9906
1000000	31063.763700001

kustosz · 2018-08-08T23:29:01Z

Perfect. This means we have a "it's not optimizing well" problem instead of a "it's exponential, semantics are unclear and everything is on fire" problem.

@iamrecursion please take a look at what happens here. I think you can also get away with benchmarking things like recursive adding numbers instead of memory writes, the results are pretty disappointing there too (which I think is a good thing again – if you optimize that super simple code, other things should speed up automatically).

iamrecursion · 2018-08-09T07:45:53Z

Brilliant catch! I'll add this to my 'to look at' list.

Please do add any performance-related issues to my epic.

piotrMocz · 2018-08-09T13:51:07Z

Are we doing tail-call-optimization in Luna by any chance?

iamrecursion · 2018-08-09T15:28:35Z

Also on my to-look-at-list.

mikusp · 2018-08-10T13:15:31Z

One observation: running luna with +RTS -N1 resulted in better performance than default. I see that GC time rises dramatically, maybe because of contention of 8 threads running garbage collection?

mwu-tow added the T - Enhancement label Aug 8, 2018

wdanilo assigned iamrecursion Aug 8, 2018

wdanilo added D - Core Contributor p-high Should be completed in the next sprint labels Aug 8, 2018

mwu-tow mentioned this issue Dec 6, 2018

FFI calls performance problem #347

Closed

turion pushed a commit to turion/luna that referenced this issue Jun 25, 2019

Add status bar interpreter controls. enso-org#239

09c4a1a

turion pushed a commit to turion/luna that referenced this issue Jun 25, 2019

Add Interpreter control events and handlers in gui. enso-org#239

86ff978

turion pushed a commit to turion/luna that referenced this issue Jun 25, 2019

Minor fix. enso-org#239

699a9a9

turion pushed a commit to turion/luna that referenced this issue Jun 25, 2019

Initial version of interpreter control enso-org#239

0ada1e2

turion pushed a commit to turion/luna that referenced this issue Jun 25, 2019

Send updates about interpreter status enso-org#239

e23a3fa

turion pushed a commit to turion/luna that referenced this issue Jun 25, 2019

Set proper message on start enso-org#239

1411d35

joenash closed this as completed Jun 23, 2020

MichaelMauderer mentioned this issue Jun 6, 2023

IDE logging not working when opening cloud project #6899

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Luna performance case — writing memory #239

Luna performance case — writing memory #239

mwu-tow commented Aug 8, 2018

kustosz commented Aug 8, 2018

mwu-tow commented Aug 8, 2018

kustosz commented Aug 8, 2018

iamrecursion commented Aug 9, 2018 •

edited

Loading

piotrMocz commented Aug 9, 2018

iamrecursion commented Aug 9, 2018

mikusp commented Aug 10, 2018

Luna performance case — writing memory #239

Luna performance case — writing memory #239

Comments

mwu-tow commented Aug 8, 2018

kustosz commented Aug 8, 2018

mwu-tow commented Aug 8, 2018

kustosz commented Aug 8, 2018

iamrecursion commented Aug 9, 2018 • edited Loading

piotrMocz commented Aug 9, 2018

iamrecursion commented Aug 9, 2018

mikusp commented Aug 10, 2018

iamrecursion commented Aug 9, 2018 •

edited

Loading