Kibana: [Infra UI] Limit "group by" choices for Metrics Explorer UI based on metric selection

Created on 25 Jun 2019 · 18Comments · Source: elastic/kibana

Describe UX challenge:
In the Metrics Explorer, we auto-populate all available fields in the "group per" control. However, depending on the metric selection, only a small subset of fields will be relevant.

Describe recommended change in experience:
Only populate relevant selections for "group per". For Metricbeat, for instance, these will include fields that are captured for all beats and fields specific to that metricset. However, we likely will need to arrive at determining what fields are relevant dynamically because of how some modules work, and the fact that users have options to enable different processors when the use Metricbeat.

Options:

Perform an "exists" query within the given time range for the selected metric fields, grab a small number of documents from this set (5?), then de-duplicate the fields from these documents to create a list of suggested group-by fields
Hard-code a list of commonly grouped by ECS fields that we always show, along with any of the top-level module fieldsets included in the selected metrics (e.g. filtering on system.load.1 would result in all of the system.* fields being included. (This option could eventually get smarter and also include recently used fields per user, etc.)

Both options would have a "show all" escape hatch built in that would allow a user to get the experience that happens today.

Screenshots below show the current behavior we are trying to improve.

Screen Shot 2019-06-25 at 11 56 43 AM

Uploading Screen Shot 2019-06-25 at 11.56.49 AM.png…

Metrics UI logs-metrics-ui

Source

tbragin

Most helpful comment

That sounds correct @jasonrhodes, but once you have decided on a metric, the fields available for grouping should be pretty stable. Reason for this is that normally metrics always have the same set of labels, just different values on them.

Having the rest of fields available somehow could make sense, as you point out, as we may miss some fields with this trick. The only situation I can think of where this could happen is with docker/kubernetes labels. These are free form, so you can expect different set of labels depending on the service being monitored.

That said, if you check a good amount of documents in the last couple of minutes you should probably get most of the available fields.

exekias on 26 Jun 2019

👍2

All 18 comments

Pinging @elastic/infra-logs-ui

elasticmachine on 25 Jun 2019

Possibly related to more general "discuss PR" @simianhacker started, but limiting the discussion only to the "group per" selection: https://github.com/elastic/kibana/pull/36843

tbragin on 25 Jun 2019

@roncohen @exekias correct me if I'm wrong, but I think this work will have the same issues as the general metric field we want to limit, because even once we know which fields the user has selected in the metrics, we won't easily know the full set of possible "group by" fields that would correspond there, right?

I wonder if another first iteration approach is to do the best we can and move those fields to the top of the drop down, and then leave the rest below, in a sort of "Suggested Fields" / "All Fields" split…

jasonrhodes on 26 Jun 2019

That said, if you check a good amount of documents in the last couple of minutes you should probably get most of the available fields.

exekias on 26 Jun 2019

👍2

@exekias what's the status on getting the timeseries.instance into metricbeat documents by default? we could use that to make sure we get one of each

roncohen on 26 Jun 2019

It went in already as experimental, should be in 7.2, more details here: https://github.com/elastic/beats/pull/10293

It needs to be enabled with timeseries.enabled: true. Rationale for that: let's play with it and make sure it's what we need in the UI side before hard-coding it as a mandatory field.

exekias on 26 Jun 2019

Great, so my thought for an approach here is this:

Take the current list of selected metrics (e.g. system.load.15, system.load.1) and do an "exists" query on those fields, for documents in the currently selected time range (maybe with something likee "top_hits" so you get a few example docs?)
From the sample of docs, grab all of the fields
Use that list of fields to populate the "group_by" drop down

Does that sound like a sane first approach?

jasonrhodes on 26 Jun 2019

👍1

more or less, just making sure:

From the sample of docs, grab all of the fields

when the user (or we, on load) have selected a metric, then we need to do another query to get a sample of documents that have that specific field. We can't use the same sample of docs that we used to determine the "on-load" metric.

roncohen on 26 Jun 2019

SGTM

exekias on 26 Jun 2019

when the user (or we, on load) have selected a metric, then we need to do another query to get a sample of documents that have that specific field.

@roncohen yes, that's the first bullet point in the above approach, I think? If I understand correctly. :) The user visits the page and has a list of metrics to choose from (how to make that experience better is handled in another ticket). As they select metrics, we do an "exists" query for those fields to get a sample of docs that have those fields, then use the other fields in those docs to drive the group by dropdown.

jasonrhodes on 26 Jun 2019

👍1

Another solution suggested by @skh:

Have a list of fields that are _always_ available to group by (ECS fields we expect in all documents -- do these exist?), then add all other fields that start with the same top level key, e.g. for server.load.1 we show all of the system.**.* fields.

jasonrhodes on 26 Jun 2019

That could be another option, just take into account that ECS has a ton of fields in it, and most of them will have no data. So while the situation would improve a bit, we would still have many fields that make no sense there.

I would say the initial proposed solution would provide the right field set in most cases. Perhaps a Show all button in the drop-down would cover the other 1%

exekias on 27 Jun 2019

👍1

++ on first trying the approach described by Jason here, and having a "show all" button as a fallback. we can evaluate and see. if we notice we are missing too many fields, we can go to the approach Sonja suggested. How does that sound?

tbragin on 27 Jun 2019

OK. I suggest we make sure to add telemetry to track if the "show all" is ever used

roncohen on 27 Jun 2019

👍1

@simianhacker do you mind making a new investigation issue for looking into the suggestion from @weltenwort about scripted metrics? Then we can be free to move forward with the above agreed approach from this discussion here and if you find a better way, we can move forward with that.

jasonrhodes on 8 Jul 2019

This should be included in the meta issue that @simianhacker is looking into re: field availability

jasonrhodes on 23 Jul 2019

I just added a brief summary of the two options we've discussed to this ticket's description. We should choose one and move forward.

jasonrhodes on 5 Aug 2019

I would do a combination of the two options. If the metric field is prefixed with kubernetes, docker or prometheus I would include all the string fields from those modules along with the sample fields from the last 5 documents.