Leader election #411

shawkins · 2021-04-28T11:14:15Z

Related to #409 are there plans to add leader election functionality similar to https://docs.openshift.com/container-platform/4.7/operators/operator_sdk/osdk-leader-election.html to the java operator sdk?

csviri · 2022-02-02T10:39:16Z

see https://kubernetes.slack.com/archives/CAW0GV7A5/p1643798302258639?thread_ts=1643796946.554699&cid=CAW0GV7A5

csviri · 2022-06-08T12:11:04Z

@shawkins I added this to 3.3 milestone for now.

To summarize it to my understanding there are two cases when running multiple instances of operators is happening and/or desirable, and leader election makes sure only one of them is actively reconciling, thus other than leader instances don't execute reconcilers:

minimize downtime in following cases:
- There is an updated version of operator being released, and deployment first creates the new version of the operator pod then stops the old one. (For now to handle this scenario use recreate deployment strategy)
- Minimise downtime of an operator crash, so there are multiple instances running all the time. However there are multiple strategies are this. So if an operator not the leader, should it populate the caches? and just not reconcile the events.
make sure fail-over operators are be provisioned on the cluster, so multiple instance are provisioned, therefore in case the active operator's pod is crashed, it cannot happen that a new instance is not provision because of cluster resources are not available.

In summary there is one design question:

Should the non-leader operator instances activate event sources, and just don't trigger reconciliation until elected as leader? Or just basically start the operator when it is elected as leader. Both has pros and cons, when event sources are activated will consume resources (polling in some cases possibly, cache resources in memory), on the other hand will minimize downtime, in case of the syncing the caches on startup takes long time.

metacosm · 2022-06-08T21:10:58Z

Maybe the strategy should be configurable i.e. the framework would support event replication but let users activate or deactivate it depending on their needs?

csviri · 2022-06-09T11:11:19Z

Maybe the strategy should be configurable i.e. the framework would support event replication but let users activate or deactivate it depending on their needs?

Yes, a feature flag would be nice for that, agree.

csviri · 2022-06-09T14:24:48Z

Just one more note, in both cases, if an operator becomes the leader will need reconcile all the resources anyways. Since there is no info how long the other "leader before" was down.

Fixes #411 Co-authored-by: Chris Laprun <[email protected]>

jmrodri added kind/feature Categorizes issue or PR as related to a new feature. feature labels Sep 2, 2021

csviri self-assigned this Jan 10, 2022

csviri added this to the 3.3 milestone Jun 8, 2022

csviri modified the milestones: 3.3, 3.2 Jun 20, 2022

csviri linked a pull request Jul 21, 2022 that will close this issue

Leader Election #1358

Merged

metacosm added a commit that referenced this issue Aug 24, 2022

feat: support leader Election (#1358)

e5c4928

Fixes #411 Co-authored-by: Chris Laprun <[email protected]>

csviri added a commit that referenced this issue Aug 25, 2022

feat: support leader Election (#1358)

6e874fc

Fixes #411 Co-authored-by: Chris Laprun <[email protected]>

csviri closed this as completed Aug 26, 2022

csviri added a commit that referenced this issue Aug 30, 2022

feat: support leader Election (#1358)

4a560fe

Fixes #411 Co-authored-by: Chris Laprun <[email protected]>

csviri added a commit that referenced this issue Sep 5, 2022

feat: support leader Election (#1358)

4676ce6

Fixes #411 Co-authored-by: Chris Laprun <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Leader election #411

Leader election #411

shawkins commented Apr 28, 2021

csviri commented Feb 2, 2022

csviri commented Jun 8, 2022 •

edited

Loading

metacosm commented Jun 8, 2022

csviri commented Jun 9, 2022

csviri commented Jun 9, 2022 •

edited

Loading

Leader election #411

Leader election #411

Comments

shawkins commented Apr 28, 2021

csviri commented Feb 2, 2022

csviri commented Jun 8, 2022 • edited Loading

metacosm commented Jun 8, 2022

csviri commented Jun 9, 2022

csviri commented Jun 9, 2022 • edited Loading

csviri commented Jun 8, 2022 •

edited

Loading

csviri commented Jun 9, 2022 •

edited

Loading