Closed Bug 1309543 Opened 9 years ago Closed 9 years ago

Terminated clusters should not be visible from the UI

Categories

(Cloud Services Graveyard :: Metrics: Pipeline, defect, P1)

defect

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: rvitillo, Assigned: jezdez)

References

Details

User Story

I don't think terminated clusters should be visible from the UI as they don't really give the user any useful information; I don't have a strong opinion about it though. 

If we really want to display the terminated clusters, then they should be shown after all currently running ones as the user will be more interested to see which clusters are running than which ones have been terminated.

Jannis, for how long are terminated clusters visible in the UI?

Attachments

(1 file)

No description provided.
Flags: needinfo?(jezdez)
Makes sense, this is somewhat related to bug 1309536. It doesn't make sense to show terminated clusters in the list other than for archive reasons. So I suggest to fix this by doing two things: 1) Order clusters by date and show active clusters first 2) Add a user facing feature to archive clusters that hides them from the users (but doesn't delete it) For 2) we'd need a new ticket if you agree
Flags: needinfo?(jezdez)
(In reply to Jannis Leidel [:jezdez] from comment #1) > Makes sense, this is somewhat related to bug 1309536. It doesn't make sense > to show terminated clusters in the list other than for archive reasons. So I > suggest to fix this by doing two things: > > 1) Order clusters by date and show active clusters first > 2) Add a user facing feature to archive clusters that hides them from the > users (but doesn't delete it) > > For 2) we'd need a new ticket if you agree What about auto-archiving terminated clusters and have a separate page with the history of all the spawned clusters?
(In reply to Roberto Agostino Vitillo (:rvitillo) from comment #2) > (In reply to Jannis Leidel [:jezdez] from comment #1) > > Makes sense, this is somewhat related to bug 1309536. It doesn't make sense > > to show terminated clusters in the list other than for archive reasons. So I > > suggest to fix this by doing two things: > > > > 1) Order clusters by date and show active clusters first > > 2) Add a user facing feature to archive clusters that hides them from the > > users (but doesn't delete it) > > > > For 2) we'd need a new ticket if you agree > > What about auto-archiving terminated clusters and have a separate page with > the history of all the spawned clusters? Sure sounds equally good. Would that be a per-user-archive? Or something only admins can see? The latter case can be handled by Django's admin in which we can add a list filter for terminated clusters.
(In reply to Jannis Leidel [:jezdez] from comment #3) > > What about auto-archiving terminated clusters and have a separate page with > > the history of all the spawned clusters? > > Sure sounds equally good. Would that be a per-user-archive? Or something > only admins can see? > > The latter case can be handled by Django's admin in which we can add a list > filter for terminated clusters. I think it makes sense to have a per-user-archive if in the cluster summary page there is a link to the logs of the cluster (Bug 1307528). That would come in handy if users want to report failures to us.
(In reply to Roberto Agostino Vitillo (:rvitillo) from comment #4) > (In reply to Jannis Leidel [:jezdez] from comment #3) > > > > What about auto-archiving terminated clusters and have a separate page with > > > the history of all the spawned clusters? > > > > Sure sounds equally good. Would that be a per-user-archive? Or something > > only admins can see? > > > > The latter case can be handled by Django's admin in which we can add a list > > filter for terminated clusters. > > I think it makes sense to have a per-user-archive if in the cluster summary > page there is a link to the logs of the cluster (Bug 1307528). That would > come in handy if users want to report failures to us. Excellent
Assignee: nobody → jezdez
Status: NEW → ASSIGNED
Well now that we've got terminated clusters being kept around, do we want to also have their logs available to see from the web ui?
Points: --- → 1
Priority: -- → P1
(In reply to Frank Bertsch [:frank] from comment #6) > Well now that we've got terminated clusters being kept around, do we want to > also have their logs available to see from the web ui? Ideally yes.
(In reply to Roberto Agostino Vitillo (:rvitillo) from comment #7) > (In reply to Frank Bertsch [:frank] from comment #6) > > Well now that we've got terminated clusters being kept around, do we want to > > also have their logs available to see from the web ui? > > Ideally yes. Should the logs automatically be fetch by the cluster detail page, or on request only? How big are cluster logs generally?
Blocks: 1310157
I've filed bug 1310157 to track the log showing feature
Blocks: 1309227
(In reply to Jannis Leidel [:jezdez] from comment #8) > (In reply to Roberto Agostino Vitillo (:rvitillo) from comment #7) > > (In reply to Frank Bertsch [:frank] from comment #6) > > > Well now that we've got terminated clusters being kept around, do we want to > > > also have their logs available to see from the web ui? > > > > Ideally yes. > > Should the logs automatically be fetch by the cluster detail page, or on > request only? How big are cluster logs generally? That really depends on the job. I don't think we necessarily have to retrieve the logs but we should to the very least have a pointer to them so that users can easily retrieve them if needed, as we did in atmo v1.
s/as we did in atmo v1/as we do with airflow [1] [1] https://github.com/mozilla/telemetry-airflow/blob/master/dags/operators/emr_spark_operator.py#L136 With this approach users without AWS credentials cannot access the logs. That said, pointers to logs for ad-hoc jobs aren't that useful for users as they can inspect logs directly on their clusters. Links to the logs are useful for the Data Platform team for post-mortem debugging sessions though.
You can find an example of all the logs generated for an EMR cluster here: s3://telemetry-airflow/logs/mdoglio@mozilla.com/Crash Aggregate View/j-15GLG06TV5GQ4
Closing this bug as fixed since we hide terminated clusters by default, let's use bug 1310157 for further discussion about the log files
Status: ASSIGNED → RESOLVED
Closed: 9 years ago
Resolution: --- → FIXED
Product: Cloud Services → Cloud Services Graveyard
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: