Optimizing synctex edit by using an array instead of a linked list for tag lookup #69

user202729 · 2024-02-09T22:02:44Z

This is a profiling result on a relatively large PDF file (1000 pages, with around 500 tags):

A synctex edit command takes 1.5 seconds on the file.

According to the benchmark result, most of the time is spent on searching for the input node corresponding to a tag, which is done by a linked list traversal.

https://github.com/jlaurens/synctex/blob/2020/synctex_parser.c#L6334-L6342

Because the output tags are numbered sequentially by the engines, this can be changed to use an array instead.

By a rough estimation, this improvement would speed up the relevant part by an order of at least 50, which results in approximately 25-30% overall reduction in runtime.

The text was updated successfully, but these errors were encountered:

jlaurens · 2024-02-09T23:01:12Z

I agree. The question is how to implement that in POC? Initially no smart C library was available in TeXLive, but nowadays we have at least lua.

user202729 · 2024-02-09T23:31:51Z

I don't think it would be too difficult to implement it (but it would to require quite a bit of work), basically all we need is a resizable array, which can be implemented with just malloc.

user202729 · 2024-02-25T21:57:07Z

I implemented a proof of concept: user202729/luatex@0831bb6

Caveat: I don't really understand why the source code and ownership system need to be that complicated (there's a signaling system to free the nodes?), so instead of removing the linked list entirely, I just put the array in addition to the linked list.

In theory, it should be possible to remove the double-indirection entirely and make a contiguous array of synctex_node_s, which should improve memory locality and performance.

For me, this indeed shows a ≈ 33% improvement in performance. (from 1.5s to 1s)

user202729 mentioned this issue Feb 25, 2024

Optimizing synctex_edit by hand-write string-to-integer conversion #70

Closed

user202729 linked a pull request Oct 17, 2024 that will close this issue

Use an array to store synctex_node_p nodes indexed by tag #90

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimizing synctex edit by using an array instead of a linked list for tag lookup #69

Optimizing synctex edit by using an array instead of a linked list for tag lookup #69

user202729 commented Feb 9, 2024

jlaurens commented Feb 9, 2024

user202729 commented Feb 9, 2024

user202729 commented Feb 25, 2024 •

edited

Loading

Optimizing synctex edit by using an array instead of a linked list for tag lookup #69

Optimizing synctex edit by using an array instead of a linked list for tag lookup #69

Comments

user202729 commented Feb 9, 2024

jlaurens commented Feb 9, 2024

user202729 commented Feb 9, 2024

user202729 commented Feb 25, 2024 • edited Loading

user202729 commented Feb 25, 2024 •

edited

Loading