stolonctl: implement postgres reload/restart #88

sgotti · 2015-10-30T11:46:22Z

No description provided.

sgotti · 2018-05-11T13:19:25Z

Closing since it's old and without any context.

prabhu43 · 2018-09-04T13:48:29Z

We need this feature of postgres restart to take effect of updated pgParameters like max_connections.

I went through this issue (#255) and got an idea of restarting the postgres instance of slave keepers first and then restarting the postgres instance of master. On restarting the postgres of master, sentinal will elect another healthy keeper as master

We thought of implementing this feature as follows:

Introduce a flag PgRestart in keeper status in cluster data
On executing stolonctl pgRestart, the following should happen (done by the CLI stolonctl)
a. Update Cluster Data: Set .Keepers[keeperId].Status.PgRestart: true for slave keepers
b. This would restart Postgres of all slaves with updated ClusterSpecification as in ClusterData.
c. CLI will wait for atleast one slave to be restarted and marked DBs.Healthy and Keeper.Healthy with a defined timeout
d. If atleast one slave is restarted and healthy, restart postgres of master keeper:
- Check if replication is in sync.
- ForceFail existing master Keeper. This would trigger a re-election.
- Update Cluster Data: Set .Keepers[keeperId].Status.PgRestart: true for master Keeper
  e. If none of the slave keepers are healthy (within defined timeout), exit with non-zero status.
The following should happen on Keepers:
If Status.PgRestart is true,
- Set Status.PgRestart is false,
- Restart postgres instance

This would avoid downtime as well. But there are few gotchas

There are few parameters in Postgres if changed on slave before on master, the slave wil not restart (limitations of hot_standby). For eg. decreasing max_connections. In this case, always 2.e will happen. For this, we can provide --no-wait option for the pgRestart command through master database will be restarted without waiting for slaves to be healthy.

Any thoughts on this?

sgotti · 2018-09-07T12:29:18Z

@prabhu43 I reopened the issue (it was closed and I was losing your comment).

Your proposal is basically an operator that handles a long running transaction. It requires a lot of logic to handle all the possible failures (it implements one of the possible workflows but there can be others). And as you said there're different gotchas that are difficult to know ahead of time since you should reimplement all the postgresql parameter checking logic.

For this reason I won't add a command like stolonctl pgrestart that will be a blocking command carrying all of this logic.
Perhaps this could be implemented (in go or as shell script or whatever) as a contrib script/tool outside stolonctl.

Some notes

ideally a keeper can automatically discover (and report) if an instance needs a restart querying the pg_parameters table. It's simply not yet implemented.

On restarting the postgres of master, sentinal will elect another healthy keeper as master

To be precise, restarting the master keeper doesn't imply that a new primary will be elected, if the restart is fast enough (usually) and doesn't fail due to wrong parameters the sentinel won't detect the master as failed and won't elect a new master.

aswinkarthik · 2018-09-07T16:08:52Z

if the restart is fast enough (usually) and doesn't fail due to wrong parameters the sentinel won't detect the master as failed and won't elect a new master.

@prabhu43 and myself tried exactly this in this PR #561

We just blindy restart all 3 postgres and tested it out. It restarted very fast with very minimal downtime (lesser than 1 second) but it was triggered from a CLI command stolonctl pgrestart. We could also make the changes such that stolon-keeper itself can decide if a restart is needed and it will restart pg if necessary. What do you think?

sgotti closed this as completed May 11, 2018

prabhu43 mentioned this issue Sep 7, 2018

[WIP] Add command to restart all underlying pg #561

Closed

sgotti reopened this Sep 7, 2018

prabhu43 mentioned this issue Sep 21, 2018

Restart postgres if required when updating pgParameters #568

Merged

sgotti closed this as completed in #568 Nov 7, 2018

sgotti mentioned this issue Oct 2, 2019

Concerns about stability of automatic restart of nodes feature #707

Open

Samusername mentioned this issue Feb 18, 2020

[ Question ] Rolling upgrade, how short downtime can be? How to implement? #757

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stolonctl: implement postgres reload/restart #88

stolonctl: implement postgres reload/restart #88

sgotti commented Oct 30, 2015

sgotti commented May 11, 2018

prabhu43 commented Sep 4, 2018 •

edited

Loading

sgotti commented Sep 7, 2018

aswinkarthik commented Sep 7, 2018 •

edited

Loading

stolonctl: implement postgres reload/restart #88

stolonctl: implement postgres reload/restart #88

Comments

sgotti commented Oct 30, 2015

sgotti commented May 11, 2018

prabhu43 commented Sep 4, 2018 • edited Loading

sgotti commented Sep 7, 2018

Some notes

aswinkarthik commented Sep 7, 2018 • edited Loading

prabhu43 commented Sep 4, 2018 •

edited

Loading

aswinkarthik commented Sep 7, 2018 •

edited

Loading