Apisix-dashboard: program panic when failed to initialize etcd store is unreasonable

Created on 28 Mar 2021 · 11Comments · Source: apache/apisix-dashboard

Issue description

Say my etcd cluster contains some corrupted data, which causes the manager api launching failed, the manager-api just crashes and the output is not so friendly.

Expected behavior

Output some messages to hint at this failure and exit.

Screenshots

Environment

apisix version (cmd: apisix version):
OS (cmd: uname -a):
OpenResty / Nginx version (cmd: nginx -V or openresty -V):
etcd version, if have (cmd: run curl http://127.0.0.1:9090/v1/server_info to get the info from server-info API): 3.5.0-alpha0
apisix-dashboard version, if have: 2.5
Browser version, if have:

Additional context

backend enhancement

Source

tokers

👍2

Most helpful comment

Hii!! everyone, can I work on this issue?
I have been able to reproduce the same error. After looking into it, I found that the error is thrown during store initialization & caching.

So what should be the ideal behaviour for manager-api if such a case happens?
Just put a descriptive log, ignore the current entry and continue processing the next one. How does it sound to you? Let me know what you think. Thanks :)

bisakhmondal on 1 Apr 2021

❤2

All 11 comments

that also happens when ectd is unable to connect

batman-ezio on 29 Mar 2021

that also happens when ectd is unable to connect

Yep, let it crash is not a good way here.

tokers on 29 Mar 2021

thanks for feedback

nic-chen on 30 Mar 2021

Hii!! everyone, can I work on this issue?
I have been able to reproduce the same error. After looking into it, I found that the error is thrown during store initialization & caching.

bisakhmondal on 1 Apr 2021

❤2

@bisakhmondal Sure, assigned to you.

tokers on 2 Apr 2021

❤1

hi @bisakhmondal
Here is the expected behavior mentioned in issue content:

Output some messages to hint at this failure and exit.

nic-chen on 2 Apr 2021

❤1

How is it now?

bisakhmondal on 2 Apr 2021

@bisakhmondal We don't have to panic the program, instead, we may report the error reason and exit with a non-zero code.

tokers on 3 Apr 2021

@bisakhmondal We don't have to panic the program, instead, we may report the error reason and exit with a non-zero code.

Okay, we can definitely go with it. I would like to mention an issue in this approach. We are keeping a slice of closers [ ref ] for all the allocated resources (including etcd connection), so in case of any error, for a graceful shutdown, the already allocated resource's closer method should be called.

os.Exit(1) will immediately abort the program. But that is not the case for panic. Even after panic the early evaluated defers will get executed in LIFO order. So I have put utils.CloseAll() into a defer statement before the scope of any panics.

IMHO, panic is fine here. Let me know what you think. Thanks :)

bisakhmondal on 3 Apr 2021

@bisakhmondal We don't have to panic the program, instead, we may report the error reason and exit with a non-zero code.

Okay, we can definitely go with it. I would like to mention an issue in this approach. We are keeping a slice of closers [ ref ] for all the allocated resources (including etcd connection), so in case of any error, for a graceful shutdown, the already allocated resource's closer method should be called.

os.Exit(1) will immediately abort the program. But that is not the case for panic. Even after panic the early evaluated defers will get executed in LIFO order. So I have put utils.CloseAll() into a defer statement before the scope of any panics.

IMHO, panic is fine here. Let me know what you think. Thanks :)

Yes, I agree that all finalizers or closers should be run even if exceptions occur, the reason why I think the spontaneous panic is not suitable is this is a clear and specific exception, not a programming fault.

tokers on 3 Apr 2021

👍1

Very true. Pushing the new changes then :)

bisakhmondal on 3 Apr 2021

Was this page helpful?

0 / 5 - 0 ratings