RESTMapper doesn't update to reflect new CRDs #321

danwinship · 2019-02-11T16:41:53Z

Currently, if you use a controller-runtime Client to first create a CRD, and then create an instance of that CRD, you get an error about, eg, no matches for kind "NetNamespace" in version "network.openshift.io/v1". Because the client's RESTMapper is initialized at startup and then never updated with any new information about newly-available resource kinds, so it is only able to work with the kinds that existed before it was created.

Our workaround for now is a hacked-up wrapper RESTMapper that watches for that error and reloads its cached data and tries again if it sees it. If you assume that that error will never occur other than in this case, then that seems like a plausible way to solve the problem. I could rework this into a PR for controller-runtime if you want. The only question would be whether it should replace the existing default RESTMapper or if it should be an alternative MapperProvider function and users have to pick whether they want static vs dynamic.

Or OTOH maybe the real fix should be at a lower level? I came across DeferredDiscoveryRESTMapper which is similar to my wrapper, but doesn't auto-invalidate on cache miss. But we could make it do that, and then change controller-runtime to use that?

The text was updated successfully, but these errors were encountered:

danwinship · 2019-02-11T16:42:27Z

oh, meant to link to our workaround, openshift/cluster-network-operator#95

DirectXMan12 · 2019-02-14T22:20:35Z

Yeah, I'd been meaning to port some of the code I wrote over to deal with resources added later. We'd probably want rate-limiting on the cache invalidation, but otherwise I'd be open to making that the default implementation. Watchable discovery would also be nice at some point :-/.

Thus far, the problem is generally mitigated by the fact your pod can just fail and restart, but there are other reasons that updating discovery information is nice.

Feel free to send a PR.

DirectXMan12 · 2019-02-14T22:20:50Z

/kind feature
/priority important-longterm

JoelSpeed · 2019-02-26T14:22:38Z

We came across this problem recently and for the moment have created a RestMapper using the LazyRestMapperLoader and a FirstHitRESTMapper something like below:

drm, err := apiutil.NewDiscoveryRESTMapper(config)
if err != nil {
	return nil, err
}			
lrm := meta.NewLazyRESTMapperLoader(func() (meta.RESTMapper, error) {
	return apiutil.NewDiscoveryRESTMapper(config)
})
 options.Mapper = meta.FirstHitRESTMapper{MultiRESTMapper: meta.MultiRESTMapper{drm, lrm}}

The idea being that if any CRDs are loaded in after the first discovery, the LazyRESTMapperLoader should pick those up, but for the most part we use the original DiscoveryRESTMapper. Would this be worth doing in for Controller-Runtime or is there a better way that invalidates the Discovery cache somehow?

DirectXMan12 · 2019-02-26T21:26:32Z

I was thinking something like a lazy discovery rest mapper, with a wrapper that invalidates on cache misses, but in a rate-limited way. You can use the DeferredDiscoveryRESTMapper with some custom wrapper around it to do the invalidation and rate limiting (since the deferredsicoveryrestmapper won't actually invalidate).

see: kubernetes-sigs/controller-runtime#321

see: kubernetes-sigs/controller-runtime#321 Signed-off-by: Artiom Diomin <[email protected]>

* use dynamic client Signed-off-by: Artiom Diomin <[email protected]> * Re-initialize dynamic client to drop CRD cache see: kubernetes-sigs/controller-runtime#321 Signed-off-by: Artiom Diomin <[email protected]> * Small review tweaks Signed-off-by: Artiom Diomin <[email protected]>

fejta-bot · 2019-05-27T21:45:38Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

kron4eg · 2019-05-28T10:10:42Z

/remove-lifecycle stale

DirectXMan12 · 2019-05-31T22:00:37Z

/help-wanted

chrisplo · 2019-06-01T01:15:19Z

I ran into a similar issue when starting a controller with a "Kind" Source that I know was installed after the Manager client was created, thought it was going to be a race condition since it seems to be pulling from cache: (controller-runtime/pkg/source/source.go: Start):

	// Lookup the Informer from the Cache and add an EventHandler which populates the Queue
	i, err := ks.cache.GetInformer(ks.Type)
	if err != nil {
		if kindMatchErr, ok := err.(*meta.NoKindMatchError); ok {
			log.Error(err, "if kind is a CRD, it should be installed before calling Start",
				"kind", kindMatchErr.GroupKind)
		}
		return err
	}

~~But sounds like I need to work on the Manager Client?~~

looks like @danwinship workaround is working for me,seems straightforward enough

@danwinship

* Delete custom cache setup, no longer necessary * Consolidate client usage and use dynamic discovery (see kubernetes-sigs/controller-runtime#321 — hat tip to @danwinship for openshift/cluster-network-operator#95). Fixes [bz1711373](https://bugzilla.redhat.com/show_bug.cgi?id=1711373). * Plumb the cache through for future use

…d. (#663) * Use DynamicRESTMapper to reload REST mappings when types are not found. We see some flakes because the default DiscoveryRESTMapper caches the REST mapping and never reloads it. This causes some races if types are not available when a client is initialized. Fixes #650 I copied this fix from the PR/issue here: kubernetes-sigs/controller-runtime#321

vincepri · 2019-08-22T00:06:39Z

@DirectXMan12 I've just hit this issue in Cluster API, I added a new Cached (and rate limited) RESTMapper here kubernetes-sigs/cluster-api@ee96c31. If you feel like that's a reasonable implementation, I'm happy to PR against controller-runtime as well.

DirectXMan12 · 2019-08-22T22:50:16Z

We're just about ready to get #554 merged , that should solve your problem. Can you try that?

vincepri · 2019-08-22T22:56:33Z

@DirectXMan12 Thanks, after a quick glance it looks very similar, it should work as well. I'm happy to switch once it'll be merged in controller-runtime.

* upstream issue kubernetes-sigs/controller-runtime#321 Signed-off-by: Artiom Diomin <[email protected]>

* Upgrade controller-runtime to v0.3.0 * Upstream k8s libs are updated to kubernetes-1.15.4 release tag * Fixed many API breaking changes * Logic regarding checking version of CCM is removed Signed-off-by: Artiom Diomin <[email protected]> * Whitelist ICS license for dependencies Signed-off-by: Artiom Diomin <[email protected]> * Fix borken API in e2e tests Signed-off-by: Artiom Diomin <[email protected]> * Fix linter Signed-off-by: Artiom Diomin <[email protected]> * Removed HackIssue321InitDynamicClient hack * upstream issue kubernetes-sigs/controller-runtime#321 Signed-off-by: Artiom Diomin <[email protected]> * Return back accidentally removed initialization of dynamic client Signed-off-by: Artiom Diomin <[email protected]> * Revert "Removed HackIssue321InitDynamicClient hack" This reverts commit 73710af. Signed-off-by: Artiom Diomin <[email protected]> * Better comment reason for pki.go existance. Signed-off-by: Artiom Diomin <[email protected]>

Fixes #321

k8s-ci-robot added kind/feature Categorizes issue or PR as related to a new feature. priority/important-longterm Important over the long term, but may not be staffed and/or may need multiple releases to complete. labels Feb 14, 2019

kron4eg added a commit to kubermatic/kubeone that referenced this issue Mar 18, 2019

Re-initialize dynamic client to drop CRD cache

35b8d46

see: kubernetes-sigs/controller-runtime#321

kron4eg added a commit to kubermatic/kubeone that referenced this issue Mar 18, 2019

Re-initialize dynamic client to drop CRD cache

981fd56

see: kubernetes-sigs/controller-runtime#321 Signed-off-by: Artiom Diomin <[email protected]>

kron4eg added a commit to kubermatic/kubeone that referenced this issue Mar 18, 2019

Re-initialize dynamic client to drop CRD cache

b16bb45

see: kubernetes-sigs/controller-runtime#321 Signed-off-by: Artiom Diomin <[email protected]>

kron4eg added a commit to kubermatic/kubeone that referenced this issue Mar 18, 2019

Re-initialize dynamic client to drop CRD cache

798b095

see: kubernetes-sigs/controller-runtime#321 Signed-off-by: Artiom Diomin <[email protected]>

jcrossley3 mentioned this issue Apr 15, 2019

Default RESTMapper returns stale resources operator-framework/operator-sdk#1328

Closed

mrIncompetent mentioned this issue May 2, 2019

Maintain a RESTMapper per cluster to avoid performing a discovery on each API call kubermatic/kubermatic#3405

Merged

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 27, 2019

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 28, 2019

DirectXMan12 added the help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines. label May 31, 2019

ironcladlou mentioned this issue Jun 4, 2019

Bug 1717494: Refactor client and cache handling openshift/cluster-ingress-operator#244

Merged

ironcladlou mentioned this issue Jun 10, 2019

Bug 1711373: Refactor client and cache handling openshift/cluster-dns-operator#116

Merged

guymguym mentioned this issue Jul 24, 2019

client.New() takes more than 10 sec when far from apiserver (apiutil.NewDiscoveryRESTMapper) #537

Closed

This was referenced Jul 27, 2019

Fix race with WaitForCRDs not waiting for CRDs to be fully ready. kudobuilder/kudo#662

Closed

Use DynamicRESTMapper to reload REST mappings when types are not found. kudobuilder/kudo#663

Merged

trierra mentioned this issue Aug 3, 2019

implemented CRUD skeleton for ClusterOperation CRD libopenstorage/operator#30

Merged

joelanford mentioned this issue Aug 7, 2019

[WIP] pkg/restmapper: use exponential backoff with DynamicRESTMapper calls operator-framework/operator-sdk#1792

Closed

galderz mentioned this issue Sep 2, 2019

Post Infinispan 10 integration tasks infinispan/infinispan-operator#140

Merged

estroz mentioned this issue Sep 5, 2019

⚠️ DynamicRESTMapper that reloads on REST cache miss #554

Merged

lrgar mentioned this issue Sep 16, 2019

Handle scenario when Istio is installed after the Operator is started Dynatrace/dynatrace-oneagent-operator#137

Merged

k8s-ci-robot closed this as completed in #554 Oct 10, 2019

kron4eg added a commit to kubermatic/kubeone that referenced this issue Oct 23, 2019

Removed HackIssue321InitDynamicClient hack

8ab6736

* upstream issue kubernetes-sigs/controller-runtime#321 Signed-off-by: Artiom Diomin <[email protected]>

kron4eg added a commit to kubermatic/kubeone that referenced this issue Oct 23, 2019

Removed HackIssue321InitDynamicClient hack

73710af

* upstream issue kubernetes-sigs/controller-runtime#321 Signed-off-by: Artiom Diomin <[email protected]>

kron4eg added a commit to kubermatic/kubeone that referenced this issue Oct 23, 2019

Removed HackIssue321InitDynamicClient hack

63c7de9

* upstream issue kubernetes-sigs/controller-runtime#321 Signed-off-by: Artiom Diomin <[email protected]>

yanniszark mentioned this issue Dec 18, 2019

Client initialization doesn't use DynamicRESTMapper by default #734

Closed

morvencao mentioned this issue Jan 10, 2020

remove dynamic rest mapper from openshift. istio/operator#744

Merged

DirectXMan12 pushed a commit that referenced this issue Jan 31, 2020

Validate create api flags

aacef4a

Fixes #321

sophieliu15 mentioned this issue Mar 6, 2020

HNC: Avoid restart HNC after adding a new CRD kubernetes-retired/multi-tenancy#488

Closed

ibrokethecloud mentioned this issue May 22, 2020

Fleet agents dont pick up newly created CRDs rancher/fleet#44

Closed

danehans mentioned this issue Aug 5, 2020

Bug 1866568: Removes RestMapper since DynamicRestMapper is now the pkg default openshift/cluster-ingress-operator#437

Merged

danehans mentioned this issue Aug 13, 2020

Bug 1868816: Removes RestMapper in favor of pkg defaults openshift/cluster-dns-operator#189

Merged

howieyuen mentioned this issue Aug 22, 2022

[kusion apply] Support to apply CRD and CR at one time KusionStack/kusion#131

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RESTMapper doesn't update to reflect new CRDs #321

RESTMapper doesn't update to reflect new CRDs #321

danwinship commented Feb 11, 2019

danwinship commented Feb 11, 2019

DirectXMan12 commented Feb 14, 2019 •

edited

Loading

DirectXMan12 commented Feb 14, 2019

JoelSpeed commented Feb 26, 2019

DirectXMan12 commented Feb 26, 2019

fejta-bot commented May 27, 2019

kron4eg commented May 28, 2019

DirectXMan12 commented May 31, 2019

chrisplo commented Jun 1, 2019 •

edited

Loading

vincepri commented Aug 22, 2019

DirectXMan12 commented Aug 22, 2019

vincepri commented Aug 22, 2019

RESTMapper doesn't update to reflect new CRDs #321

RESTMapper doesn't update to reflect new CRDs #321

Comments

danwinship commented Feb 11, 2019

danwinship commented Feb 11, 2019

DirectXMan12 commented Feb 14, 2019 • edited Loading

DirectXMan12 commented Feb 14, 2019

JoelSpeed commented Feb 26, 2019

DirectXMan12 commented Feb 26, 2019

fejta-bot commented May 27, 2019

kron4eg commented May 28, 2019

DirectXMan12 commented May 31, 2019

chrisplo commented Jun 1, 2019 • edited Loading

vincepri commented Aug 22, 2019

DirectXMan12 commented Aug 22, 2019

vincepri commented Aug 22, 2019

DirectXMan12 commented Feb 14, 2019 •

edited

Loading

chrisplo commented Jun 1, 2019 •

edited

Loading