incr.comp.: Span fingerprinting cannot afford inaccuracies when caching query results #46303

michaelwoerister · 2017-11-27T14:36:30Z

In the current implementation of the incremental compilation change tracking system, the compiler will sometimes explicitly ignore source-span-related information (e.g. when hashing type definitions). We do this in order to avoid false positives that spring from conflating span and HIR information. For example, when the compiler generates debuginfo for a data type, it will access the HIR of the type's definition in order to extract certain information. Thus, even though we explicitly do not include the source location of a type definition in the corresponding debuginfo, changing the source location would still cause the debuginfo to be considered as having changed and the whole object file would need to be compiled.

Explicitly ignoring some span information during change detection was never pretty but it was feasible as long as we only had to consider the consequences this had for cached machine code. Now, that we are starting to cache arbitrary query results I think it is a bad idea to try and keep following this strategy. It seems likely that having such manual exceptions would break every few weeks due to largely unrelated changes to some query.

Also, up until now the compiler makes the assumption that source location information is only relevant when generating debuginfo. This too was only true as long as we just cache machine code. This assumption actually already doesn't hold anymore since we started caching error messages a while ago. These contain source locations but are independent of whether debuginfo is generated or not.

I'm pretty convinced that we should always hash all span information. The question is how to best avoid the negative consequences this will have on change detection accuracy. I think that splitting span information out of HIR and into a side-table will be one of the things we'll have to do. Spans would then be accessed by the NodeId of the corresponding HIR node. This would allow for simply not storing span information in most query results (e.g. MIR) and for avoiding the conflation of source location and other HIR information.

cc @nikomatsakis

The text was updated successfully, but these errors were encountered:

michaelwoerister · 2018-01-22T13:29:33Z

We are hashing spans unconditionally since #46556.

michaelwoerister added the A-incr-comp Area: Incremental compilation label Nov 27, 2017

TimNN added the C-cleanup Category: PRs that clean code up or issues documenting cleanup. label Nov 28, 2017

michaelwoerister mentioned this issue Dec 4, 2017

EXPERIMENTAL: Hash spans unconditionally during incr. comp. #46490

Closed

michaelwoerister closed this as completed Jan 22, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

incr.comp.: Span fingerprinting cannot afford inaccuracies when caching query results #46303

incr.comp.: Span fingerprinting cannot afford inaccuracies when caching query results #46303

michaelwoerister commented Nov 27, 2017

michaelwoerister commented Jan 22, 2018

incr.comp.: Span fingerprinting cannot afford inaccuracies when caching query results #46303

incr.comp.: Span fingerprinting cannot afford inaccuracies when caching query results #46303

Comments

michaelwoerister commented Nov 27, 2017

michaelwoerister commented Jan 22, 2018