Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug: Access to https://cms-rucio-webui.cern.ch/ seems to fail. #827

Open
eachristgr opened this issue Jul 18, 2024 · 4 comments
Open

Bug: Access to https://cms-rucio-webui.cern.ch/ seems to fail. #827

eachristgr opened this issue Jul 18, 2024 · 4 comments
Assignees
Labels

Comments

@eachristgr
Copy link
Contributor

Bug Description

Trying to access https://cms-rucio-webui.cern.ch/ returns a time out error.

Checking the logs of the relative pod, it seems like an Apache issue:

httpd-error-log [Thu Jul 18 08:12:36.487491 2024] [mpm_event:error] [pid 7:tid 7] AH03490: scoreboard is full, not at MaxRequestWorkers.Increase ServerLimit.
httpd-error-log [Thu Jul 18 08:12:37.488555 2024] [mpm_event:error] [pid 7:tid 7] AH03490: scoreboard is full, not at MaxRequestWorkers.Increase ServerLimit.

In the other hand, https://cms-rucio-webui-int.cern.ch/ seems to work fine

Reproduction Steps

No response

Expected Behavior

No response

Possible Solution

No response

Related Issues

No response

@eachristgr eachristgr added the bug label Jul 18, 2024
@dynamic-entropy dynamic-entropy self-assigned this Jul 18, 2024
@haozturk
Copy link
Contributor

[haozturk@lxplus996 ~]$ k logs webui-rucio-ui-5544c44759-2tnpf  
Defaulted container "httpd-error-log" out of: httpd-error-log, rucio-ui
[Thu Jul 18 05:48:33.333140 2024] [mpm_event:error] [pid 7:tid 7] AH03490: scoreboard is full, not at MaxRequestWorkers.Increase ServerLimit.
[Thu Jul 18 05:48:34.334202 2024] [mpm_event:error] [pid 7:tid 7] AH03490: scoreboard is full, not at MaxRequestWorkers.Increase ServerLimit.

@dynamic-entropy
Copy link
Contributor

Thanks, Christos for reporting this.
A simple restart should fix this and reset the connections with busy clients.

Kindly let me know if it happens again and we can change or update the server config to accommodate high loads.
Can you please check if it works for you too, now?


Just for my record.
Example use case for monitoring in : #381

@eachristgr
Copy link
Contributor Author

Hi @dynamic-entropy, thanks for taking this. The issue seems to be resolved, I can access https://cms-rucio-webui.cern.ch/ without any problem.

@haozturk
Copy link
Contributor

It happened again:

$ k logs webui-rucio-ui-f79f5b6db-fj86l  
Defaulted container "httpd-error-log" out of: httpd-error-log, rucio-ui
[Mon Jul 29 01:15:41.144028 2024] [mpm_event:error] [pid 7:tid 7] AH03490: scoreboard is full, not at MaxRequestWorkers.Increase ServerLimit.
[Mon Jul 29 01:15:42.145535 2024] [mpm_event:error] [pid 7:tid 7] AH03490: scoreboard is full, not at MaxRequestWorkers.Increase ServerLimit.
[Mon Jul 29 01:15:43.145649 2024] [mpm_event:error] [pid 7:tid 7] AH03490: scoreboard is full, not at MaxRequestWorkers.Increase ServerLimit.

https://mattermost.web.cern.ch/cms-o-and-c/pl/d99nw33cwpbwmyodirit7sodsw

We need to revisit the server limits

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants