Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sql/logictest: TestLogic failed #73425

Closed
cockroach-teamcity opened this issue Dec 3, 2021 · 3 comments · Fixed by #74163
Closed

sql/logictest: TestLogic failed #73425

cockroach-teamcity opened this issue Dec 3, 2021 · 3 comments · Fixed by #74163
Assignees
Labels
branch-master Failures and bugs on the master branch. C-test-failure Broken test (automatically or manually discovered). O-robot Originated from a bot. T-sql-queries SQL Queries Team

Comments

@cockroach-teamcity
Copy link
Member

sql/logictest.TestLogic failed with artifacts on master @ b8f13f26ed9647428ce3130c129adc913c60316c:

=== RUN   TestLogic
    test_log_scope.go:79: test logs captured to: /go/src/github.com/cockroachdb/cockroach/artifacts/logTestLogic112937844
    test_log_scope.go:80: use -show-logs to present logs inline
=== CONT  TestLogic
    logic.go:3555: -- test log scope end --
test logs left over in: /go/src/github.com/cockroachdb/cockroach/artifacts/logTestLogic112937844
--- FAIL: TestLogic (2438.72s)
=== RUN   TestLogic/5node-disk/distsql_union/Except
    logic.go:2690: 
        
        testdata/logic_test/distsql_union:332: (SELECT y FROM xyz ORDER BY y) EXCEPT ALL (SELECT y FROM xyz ORDER BY y)
        expected success, but found
        (58C01) inbox communication error: rpc error: code = Canceled desc = context canceled
        inbox.go:328: in Next()
    logic.go:2416: 
         pq: inbox communication error: rpc error: code = Canceled desc = context canceled
    logic.go:1962: 
         testdata/logic_test/distsql_union:336: too many errors encountered, skipping the rest of the input
--- done: testdata/logic_test/distsql_union with config 5node-disk: 35 tests, 3 failures
--- total progress: 156339 statements/queries
--- progress: testdata/logic_test/hash_join_dist: 16 statements/queries
--- done: testdata/logic_test/hash_join_dist with config 5node-disk: 18 tests, 0 failures
            --- FAIL: TestLogic/5node-disk/distsql_union/Except (0.02s)
=== RUN   TestLogic/5node-disk/distsql_union
=== PAUSE TestLogic/5node-disk/distsql_union
=== CONT  TestLogic/5node-disk/distsql_union
        --- FAIL: TestLogic/5node-disk/distsql_union (1.44s)
=== RUN   TestLogic/5node-disk
    --- FAIL: TestLogic/5node-disk (0.00s)
Help

See also: [How To Investigate a Go Test Failure \(internal\)](https://cockroachlabs.atlassian.net/l/c/HgfXfJgM)Parameters in this failure:

  • GOFLAGS=-parallel=2

Same failure on other branches

/cc @cockroachdb/sql-queries

This test on roachdash | Improve this report!

@cockroach-teamcity cockroach-teamcity added branch-master Failures and bugs on the master branch. C-test-failure Broken test (automatically or manually discovered). O-robot Originated from a bot. labels Dec 3, 2021
@blathers-crl blathers-crl bot added the T-sql-queries SQL Queries Team label Dec 3, 2021
@yuzefovich
Copy link
Member

I originally thought this issue was another instance of #72017, however there we have no inbound stream error and here we have a cancellation error. For the no inbound stream error I think it's possible that the recent change to max_running_flows to be dependent on the CPU count is to blame, so I want to be on lookout for those, but the context cancellation error in this issue seems benign to me, so I'll just close it and wait for more failures like this before investigating.

@yuzefovich
Copy link
Member

yuzefovich commented Dec 18, 2021

This can be rarely reproduced on master even with the revert of #73887. I think it started happening because of 62ea0c6.

@yuzefovich
Copy link
Member

Never mind, I reproduced this with #72998 reverted, but that PR might be making the reproduction more likely. My current thinking is that maybe the way vectorizedInboundStreamHandler.Timeout is somehow broken because that seems like to be affected the most by #72998, yet I don't see the problem with the implementation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
branch-master Failures and bugs on the master branch. C-test-failure Broken test (automatically or manually discovered). O-robot Originated from a bot. T-sql-queries SQL Queries Team
Projects
Archived in project
Development

Successfully merging a pull request may close this issue.

2 participants