Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Explore using billing alerting infrastructure directly from the cloud provider #13

Open
choldgraf opened this issue Aug 31, 2022 · 0 comments

Comments

@choldgraf
Copy link
Member

choldgraf commented Aug 31, 2022

Context

In a recent incident there was some cloud infrastructure running in the background that we did not track with our Grafana dashboards (because it was on an old cluster).

We have an issue to track using Grafana for cloud provider alerting (2i2c-org/infrastructure#1288). However, this would not have caught this problem because it was outside of Grafana's scope.

Each cloud provider also tends to provide their own cloud billing monitoring and alerting infrastructure. For example, you can trigger emails or warnings at certain spend levels, and you can even automatically trigger some actions like cluster shutdown.

For example:

We may also be able to automate this process. For example:

One of the biggest concerns that researchers have with cloud is the "hidden and ballooning costs" problem, so we need to do whatever we can to reduce this uncertainty for others.

Proposal

For each of the clusters that we deploy, we should also use the cloud provider's cost management and alerting system, in order to warn us when unexpected amounts of spending occur. We can define the specific rules in collaboration with Community Representatives, but they could be something like:

  • Define the expect monthly cost given the estimated user size
  • Set up an alert for 100% higher than this expected amount.
  • (optionally) define a "shut down" point if we reach a really high threshold.

Updates and actions

No response

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
No open projects
Status: Needs Shaping / Refinement
Development

No branches or pull requests

1 participant