Fix errcheck in service/history/shard #3755

MichaelSnowden · 2022-12-22T01:29:06Z

What changed?

Why?

How did you test it?

Potential risks

Is hotfix candidate?

yycptt · 2023-01-06T19:41:27Z

service/history/shard/context_impl.go

@@ -1505,18 +1503,24 @@ func (s *ContextImpl) createEngine() Engine {

 // start should only be called by the controller.
 func (s *ContextImpl) start() {
-	s.transition(contextRequestAcquire{})
+	if err := s.transition(contextRequestAcquire{}); err != nil {


s.transition() already emits logs if the transition is invalid I believe.

yycptt · 2023-01-06T19:41:29Z

service/history/shard/context_impl.go

@@ -1436,8 +1437,7 @@ func (s *ContextImpl) handleReadError(err error) error {
 	case *persistence.ShardOwnershipLostError:
 		// Shard is stolen, trigger shutdown of history engine.
 		// Handling of max read level doesn't matter here.
-		s.transition(contextRequestStop{})
-		return err
+		return multierr.Combine(err, s.transition(contextRequestStop{}))


I don't think we can/should combine the errors here (and two more places below).

Many places in our code path is still checking error type directly not via errors.As, so those places might break.

To me, the error from s.transition is internal to the shard context impl and upper layer should not know.

cc @dnr Would you mind also take a look?

Yeah, this PR doesn't make much sense. I think you should revert these changes.

The error returned from transition is really only meaningful for contextRequestAcquired, the others will always return nil and the result doesn't have to be checked. I know errcheck isn't smart enough to figure that out but we can just manually ignore them.

As Yichao said, transition already logs so callers should not.

And I agree the multierr stuff is not appropriate here.

dnr · 2023-01-10T00:13:02Z

service/history/shard/context_impl.go

@@ -1436,8 +1437,7 @@ func (s *ContextImpl) handleReadError(err error) error {
 	case *persistence.ShardOwnershipLostError:
 		// Shard is stolen, trigger shutdown of history engine.
 		// Handling of max read level doesn't matter here.
-		s.transition(contextRequestStop{})
-		return err
+		return multierr.Combine(err, s.transition(contextRequestStop{}))


Yeah, this PR doesn't make much sense. I think you should revert these changes.

The error returned from transition is really only meaningful for contextRequestAcquired, the others will always return nil and the result doesn't have to be checked. I know errcheck isn't smart enough to figure that out but we can just manually ignore them.

As Yichao said, transition already logs so callers should not.

And I agree the multierr stuff is not appropriate here.

dnr · 2023-01-10T00:16:37Z

service/history/shard/controller_test.go

@@ -790,7 +790,9 @@ func (s *controllerSuite) TestShardControllerFuzz() {
 			shardID := int32(rand.Intn(int(s.config.NumberOfShards))) + 1
 			switch rand.Intn(5) {
 			case 0:
-				s.shardController.GetShardByID(shardID)
+				if _, err := s.shardController.GetShardByID(shardID); err != nil {
+					return err


returning defeats the purpose of this code, which is to generate load on the shard controller. this shouldn't return an error but if it does the worker should continue to run.

MichaelSnowden · 2023-01-25T18:02:13Z

Thanks for spotting these issues. I have a PR to revert these changes here: #3842

Fix errcheck in service/history/shard

bf5d25c

MichaelSnowden requested a review from a team as a code owner December 22, 2022 01:29

yux0 approved these changes Dec 28, 2022

View reviewed changes

MichaelSnowden merged commit c912454 into master Dec 28, 2022

MichaelSnowden deleted the service/history/shard branch December 28, 2022 17:33

yycptt reviewed Jan 6, 2023

View reviewed changes

dnr reviewed Jan 10, 2023

View reviewed changes

MichaelSnowden added a commit that referenced this pull request Jan 25, 2023

Revert #3755

13338c6

MichaelSnowden added a commit that referenced this pull request Jan 26, 2023

Revert #3755 (#3842)

c52848e

MichaelSnowden added a commit that referenced this pull request Jan 31, 2023

Revert #3755 (#3842)

706617f

MichaelSnowden added a commit that referenced this pull request Feb 3, 2023

Revert #3755 (#3842)

7c0d471

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix errcheck in service/history/shard #3755

Fix errcheck in service/history/shard #3755

MichaelSnowden commented Dec 22, 2022

yycptt Jan 6, 2023

yycptt Jan 6, 2023

dnr Jan 10, 2023

dnr Jan 10, 2023

dnr Jan 10, 2023

MichaelSnowden commented Jan 25, 2023

Fix errcheck in service/history/shard #3755

Fix errcheck in service/history/shard #3755

Conversation

MichaelSnowden commented Dec 22, 2022

yycptt Jan 6, 2023

Choose a reason for hiding this comment

yycptt Jan 6, 2023

Choose a reason for hiding this comment

dnr Jan 10, 2023

Choose a reason for hiding this comment

dnr Jan 10, 2023

Choose a reason for hiding this comment

dnr Jan 10, 2023

Choose a reason for hiding this comment

MichaelSnowden commented Jan 25, 2023