CI logs / artifacts analyzing tool #67

Totktonada · 2021-05-21T21:48:42Z

In short:

Implement as an external crawler (see reasoning below).
A service that holds collected logs / artifacts and shows generated HTML reports.
Weekly HTML reports to track progress, statistics timeline and graphs.
Ability to regenerate old HTML reports, when a new matcher is added (or existing one is changed).
The service should allow to download all collected logs and artifacts at once (for local experiments).
Support are script language for problem matchers.
Easy to experiment locally with a new matcher and propose it via GitHub pull request.
Collect all jobs, not only ones on release branches, to have great amount of statistics.

Reasoning behind external crawler

Here we compare two approaches:

Send completed (successful or failed) job id from the job itself.
Gather past jobs information externally.

The first approach is simpler, it is the main its advance.

The second approach has the following advances:

Able to track GitHub Actions and self-hosted runners infrastructure fails.
In case of an error in the crawler or outage of the service, we can fix it and collect past jobs.

Matchers API thoughts

Input: log and artifact files in argv.

Output: simple key-value pairs.

type: flaky test
issue: gh-xxxx
summary: blah blah
(and so on)

The text was updated successfully, but these errors were encountered:

ligurio · 2021-09-14T06:52:18Z

I believe ELK should match to your requirements

Totktonada · 2021-10-20T17:04:30Z

The initial implementation is there: https://github.com/tarantool/multivac

It has the following features at the moment:

Fetch and update jobs metainfo and logs.
Generate last seen test fails report over the whole logs collection (there is also filtering per branch). The report can be in the CSV or the HTML format.
(There is also spent time calculator, but it is the side feature, out of context here.)

It is possible to add the script into cron, setup nginx and generate daily / weekly reports about overall situation with tests.

There are some known problems and ideas about a new functionality:

But I think that the problem is generally solved from the scripting side.

The next action is to setup a machine and provide HTTP access to regularly updated reports. @kyukhin volunteered for this.

Totktonada · 2021-11-04T11:13:26Z

Now the service can be found here: https://shame.tarantool.dev/crawler/

NickVolynkin · 2022-10-24T04:44:27Z

Pretty much of this issue is done already in multivac/InfluxDB/Graphana toolchain. I will move this ticket to tarantool/multivac and close it.

kyukhin added the teamX label Sep 17, 2021

Totktonada assigned kyukhin Oct 20, 2021

Totktonada mentioned this issue Nov 4, 2021

Run regression test nightly multiple times to detect flaky tests tarantool/tarantool#4974

Closed

3 tasks

NickVolynkin transferred this issue from tarantool/tarantool-qa Oct 24, 2022

NickVolynkin closed this as completed Oct 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CI logs / artifacts analyzing tool #67

CI logs / artifacts analyzing tool #67

Totktonada commented May 21, 2021 •

edited

Loading

ligurio commented Sep 14, 2021

Totktonada commented Oct 20, 2021

Totktonada commented Nov 4, 2021

NickVolynkin commented Oct 24, 2022

CI logs / artifacts analyzing tool #67

CI logs / artifacts analyzing tool #67

Comments

Totktonada commented May 21, 2021 • edited Loading

Reasoning behind external crawler

Matchers API thoughts

ligurio commented Sep 14, 2021

Totktonada commented Oct 20, 2021

Totktonada commented Nov 4, 2021

NickVolynkin commented Oct 24, 2022

Totktonada commented May 21, 2021 •

edited

Loading