Support for polymorphic associations #64

vovimayhem · 2018-02-07T01:28:43Z

This PR addresses #37

It provides support for polymorphic associations. I expect some conversation and/or change requests to be made during review :)

Note that I also added performance benchmarks for polymorphic cases, and the speed_factor is not 25x, but rather 5 :( I don't know if 25x is a hard goal or not. The factor for the other cases remains unchanged.

This implementation iterates over the associated collection's objects, and maps out the ID and the object type whenever it's a has_many. I wonder if reading an already-extracted "dictionary" would increase performance, although I'm sure that would be transferring the performance problem to somewhere else.

Add missing attribute in README example

christophersansone · 2018-02-07T13:59:38Z

lib/fast_jsonapi/serialization_core.rb

@@ -26,6 +26,21 @@ def ids_hash(ids, record_type)
        id_hash(ids, record_type) # ids variable is just a single id here
      end

+      def id_hash_from_record(record)
+        { id: record.id.to_s, type: record.class.name.underscore.to_sym }


@vovimayhem Thanks for tackling this! I'm wondering if record.class.name.underscore.to_sym is making assumptions about what the type should be: it must be determined by the class name, it must be underscored, etc. It seems like the canonical value for this record type would be on the serializer itself, e.g. PersonSerializer#record_type. If the serializer does not exist, it could potentially fall back to the underscored class name.

It looks like the primitives for accessing other serializers are built in... something like compute_serializer_key(record.class.name).constantize. It still makes the assumption that the serializer name can be inferred from the record class, but there needs to be some sort of convention for a polymorphic lookup...

Hmmm... What I can see is that the most brutal performance gains this gem is by reducing the assumptions/choices to be made at serializing time, and passing it instead to the serializer definition time... hence why we have here options such as belongs_to :owner, record_type: :user (here we're assuming the object type is user, so we don't waste time figuring out the serializer to use at serializing time).

We can't do that with a non-homogeneous collection (i.e. polymorphic association) because there's no way to know exactly which object types are in a collection at definition time...

...but may I suggest something in the middle: We may know some (if not all) of the types we expect to see in a collection. That way, if the type of object in a collection is known, we won't need to figure the record_type/serializer out (that record.class.name.underscore.to_sym).

I'll give that a try

Awesome, thanks for determining the performance bottleneck.

I offered a suggestion in #49 that I think could apply nicely here as well. Can we define a class method for determining the record type, given the record instance? For example:

def self.polymorphic_record_type(record) ... end

The default implementation can use the same methodology for determining the default record type (and potentially even cache the results as key/value pairs rather than calculating them every time), but it can be overridden for those (rare?) cases where you have a polymorphic association whose type is not the default. It seems like this solution would provide sensible defaults without sacrificing performance, and also have the ability to easily override when the situation requires it.

I'm gonna take a look at this.

I'm deffinitely going to take a look at this... thanks for the idea!

christophersansone · 2018-02-07T14:14:53Z

@vovimayhem Looks awesome in general, thanks a lot. I'm wondering why the speed factor changed so much. The implementation looks solid, and nothing jumps out as a performance problem, but that much of a performance differential would seem to indicate there is something that can be optimized. Have you tried to isolate where the performance change might be occurring? Is it perhaps record.class.name.underscore.to_sym?

vovimayhem · 2018-02-07T18:21:59Z

@christophersansone I felt like cheating with this one:

class GroupSerializer
  include FastJsonapi::ObjectSerializer
  set_type :group
  attributes :name
  has_many :groupees, polymorphic: { Person => :person, Group => :group }
end

Notice the polymorphic option now includes a dictionary of "record_type" for each possible class.

I'm not tackling the root problem with this, tho. But, the speed factor is back to 25 :)

vovimayhem · 2018-02-07T19:16:46Z

lib/fast_jsonapi/serialization_core.rb

-        { id: record.id.to_s, type: (record_type || record.class.name.underscore.to_sym) }
+      def id_hash_from_record(record, record_types)
+        # memoize the record type within the record_types dictionary, then assigning to record_type:
+        record_type = record_types[record.class] ||= record.class.name.underscore.to_sym


@christophersansone Not yet a method like you suggested (that's something I can refactor, tho), but caching like a boss...

vovimayhem · 2018-02-07T19:28:17Z

@shishirmk back to 25x, now it looks good.

I added the polymorphic option to associations. Then in an attempt to bring the performance back to 25x, I made the polymorphic option to be a dictionary of expected object classes & record_types (cheating).

Finally, I followed @christophersansone tip and did a memoization (caching) of the object classes & record types for unexpected record types.

I've got a question: Should we keep the polymorphic hash option? or can we get it back to a boolean, for simplicity & ease of use sake? I feel the performance gain of having a pre-defined dictionary is negligible...

christophersansone · 2018-02-07T20:17:18Z

@vovimayhem Nice work! I think if we had the class method for determining the type, we could just make polymorphic: true and remove the hash option. If we did not have the method, then we should still have a way to override the default type, so the hash option would be a possible way to do so. My personal preference would be to have that method: it's clear, it's easy, and it's highly customizable.

vovimayhem · 2018-02-07T20:22:58Z

@christophersansone On a second thought, having a Class => record_type dictionary in the polymorphic option allows us to define/override the record type of each class... so maybe it will stay.

However, I'll move the memoization logic to it's own method...

...actually there is a lot of code that may be split in smaller parts throughout the gem!

shishirmk · 2018-02-08T03:08:28Z

This is awesome work @vovimayhem. @christophersansone thank you for helping and explaining more about the performance tests.

Do you mind sharing the benchmark numbers for with this branch for 1000 records with and without polymorphic relationships

shishirmk · 2018-02-08T03:10:42Z

spec/lib/object_serializer_performance_spec.rb

+        our_json, ams_json = run_json_benchmark(message, group_count, our_serializer, ams_serializer)
+
+        message = "Serialize to Ruby Hash #{group_count} with polymorphic has_many"
+        run_hash_benchmark(message, group_count, our_serializer, ams_serializer)


can you add the jsonapi_serializer to the benchmark. Refer to line 80 in https://github.com/Netflix/fast_jsonapi/blob/dev/spec/lib/object_serializer_performance_spec.rb

vovimayhem · 2018-02-08T18:00:30Z

@shishirmk Updated the performance specs to include the jsonapi-rb benchmarks...
...although the jsonapi-rb benchmarks look funny:

Without polymorphic relationships (original case):

Serialize to JSON string 1000 records

Serializer	Records	Time
AMS serializer	1000	344.28 ms
jsonapi-rb serializer	1000	31.94 ms
Fast serializer	1000	12.76 ms

Serialize to Ruby Hash 1000 records

Serializer	Records	Time
AMS serializer	1000	320.46 ms
jsonapi-rb serializer	1000	30.51 ms
Fast serializer	1000	10.25 ms

With polymorphic has_many (new case):

Serialize to JSON string 1000 with polymorphic has_many

Serializer	Records	Time
AMS serializer	1000	225.71 ms
jsonapi-rb serializer	1000	0.33 ms
Fast serializer	1000	6.99 ms

Serialize to Ruby Hash 1000 with polymorphic has_many

Serializer	Records	Time
AMS serializer	1000	199.81 ms
jsonapi-rb serializer	1000	0.18 ms
Fast serializer	1000	5.13 ms

shishirmk · 2018-02-09T06:18:32Z

@vovimayhem Thank you for running the benchmarks.

shishirmk · 2018-02-09T06:28:55Z

@vovimayhem Forgot to mention. Do you mind updating the readme with a section about how to set up polymorphic associations in the serializer class?.

vovimayhem · 2018-02-09T15:52:52Z

No prob

mrryanjohnston

One comment on how belongs_to works.

mrryanjohnston · 2018-02-23T03:59:11Z

lib/fast_jsonapi/serialization_core.rb

+        return ids_hash(
+          record.public_send(relationship[:id_method_name]),
+          relationship[:record_type]
+        ) unless polymorphic


First of all, thanks so much for this PR. I'd really like to use this feature in a project, but I think the implementation here should change a bit in the case of polymorphic belongs_to.

type_method_name should be inferred directly from the object_method_name in the case of a polymorphic association since this value should be saved directly on the record. Something along the lines of this might make that work (I have yet to test this):

type_name = polymorphic ? record.public_send("#{relationship[:object_method_name]}_type") : relationship[:record_type] return ids_hash( record.public_send(relationship[:id_method_name]), type_name, ) if relationship[:relationship_type] == :belongs_to || !polymorphic

This is in opposition to the implementation here of always referring to the record.class and sending it to id_hash_from_record in the case of a belongs_to association. I believe this would save a database query. It would also save having to define a map when your belongs_to relationship is different than the underlying record's class name. Thoughts?

Ok, I tested this out locally and messed it up a little. I'm going to edit my original code sample above for something that worked for me.

type_name = polymorphic ? record.public_send("#{relationship[:object_method_name]}_type") : relationship[:record_type] # could be re-written for readability as: type_name = if polymorphic type_method_name = "#{relationship[:object_method_name]}_type" record.public_send(type_method_name) else relationship[:record_type] end

Side note: Instead of constructing the type_name here, you could also define it within the fetch_polymorphic_option method call.

* add hash benchmarking to performance tests * Add missing attribute in README example * Disable GC before doing performance test * Enable oj to AM for fair benchmark test * Support for polymorphic associations * Optional dictionary for polymorphic associations * Added polymorphic record types memoization * Updated performance tests for polymorphic examples to include jsonapi-rb

shishirmk and others added 6 commits February 1, 2018 20:21

add hash benchmarking to performance tests

74bb873

Add missing attribute in README example

631754c

Disable GC before doing performance test

823d0df

Enable oj to AM for fair benchmark test

4312d02

Merge pull request Netflix#34 from oboxodo/patch-1

bb7bc45

Add missing attribute in README example

Support for polymorphic associations

2e8afd4

vovimayhem force-pushed the feature/support-for-polymorphic-associations branch from d325bc9 to 2e8afd4 Compare February 7, 2018 01:30

christophersansone reviewed Feb 7, 2018

View reviewed changes

Optional dictionary for polymorphic associations

0c70d7c

vovimayhem force-pushed the feature/support-for-polymorphic-associations branch from 929533a to 0c70d7c Compare February 7, 2018 18:30

Added polymorphic record types memoization

ae00e70

vovimayhem commented Feb 7, 2018

View reviewed changes

vovimayhem mentioned this pull request Feb 7, 2018

After OJ is enabled for AMS, the speed test randomly failed #51

Open

shishirmk reviewed Feb 8, 2018

View reviewed changes

shishirmk changed the base branch from master to dev February 8, 2018 03:18

vovimayhem added 2 commits February 8, 2018 11:22

Merge branch 'dev' into feature/support-for-polymorphic-associations

e5f309c

Updated performance tests for polymorphic examples to include jsonapi-rb

c0cab21

shishirmk merged commit 6d516c2 into Netflix:dev Feb 9, 2018

vovimayhem deleted the feature/support-for-polymorphic-associations branch February 9, 2018 15:55

vovimayhem mentioned this pull request Feb 9, 2018

Polymorphic associations and Serializer DSL #72

Open

mrryanjohnston reviewed Feb 23, 2018

View reviewed changes

shuheiktgw mentioned this pull request Mar 11, 2018

Remove unused local variables from #relationships_hash #112

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for polymorphic associations #64

Support for polymorphic associations #64

vovimayhem commented Feb 7, 2018 •

edited

Loading

christophersansone Feb 7, 2018

vovimayhem Feb 7, 2018

christophersansone Feb 7, 2018

vovimayhem Feb 7, 2018

vovimayhem Feb 7, 2018

christophersansone commented Feb 7, 2018

vovimayhem commented Feb 7, 2018

vovimayhem Feb 7, 2018

vovimayhem commented Feb 7, 2018

christophersansone commented Feb 7, 2018

vovimayhem commented Feb 7, 2018

shishirmk commented Feb 8, 2018

shishirmk Feb 8, 2018

vovimayhem Feb 8, 2018

vovimayhem commented Feb 8, 2018 •

edited

Loading

shishirmk commented Feb 9, 2018

shishirmk commented Feb 9, 2018

vovimayhem commented Feb 9, 2018

mrryanjohnston left a comment

mrryanjohnston Feb 23, 2018 •

edited

Loading

mrryanjohnston Feb 23, 2018

mrryanjohnston Feb 23, 2018 •

edited

Loading

mrryanjohnston Feb 23, 2018

Support for polymorphic associations #64

Support for polymorphic associations #64

Conversation

vovimayhem commented Feb 7, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

christophersansone commented Feb 7, 2018

vovimayhem commented Feb 7, 2018

Choose a reason for hiding this comment

vovimayhem commented Feb 7, 2018

christophersansone commented Feb 7, 2018

vovimayhem commented Feb 7, 2018

shishirmk commented Feb 8, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vovimayhem commented Feb 8, 2018 • edited Loading

Without polymorphic relationships (original case):

Serialize to JSON string 1000 records

Serialize to Ruby Hash 1000 records

With polymorphic has_many (new case):

Serialize to JSON string 1000 with polymorphic has_many

Serialize to Ruby Hash 1000 with polymorphic has_many

shishirmk commented Feb 9, 2018

shishirmk commented Feb 9, 2018

vovimayhem commented Feb 9, 2018

mrryanjohnston left a comment

Choose a reason for hiding this comment

mrryanjohnston Feb 23, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mrryanjohnston Feb 23, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vovimayhem commented Feb 7, 2018 •

edited

Loading

vovimayhem commented Feb 8, 2018 •

edited

Loading

mrryanjohnston Feb 23, 2018 •

edited

Loading

mrryanjohnston Feb 23, 2018 •

edited

Loading