1255543 - Create longitudinal dataset for Telemetry Experiments

Reporter

Description

•

10 years ago

We should create a dataset that contains all submissions running one or more Telemetry Experiments. This would involve: - Adding a Heka field that flags records that are running an experiment - Sending these records to a separate location in S3 - A periodic job for reorganizing this data into a longitudinal structure using the same code as bug 1242039 - Make this data available in all the usual ways (Spark, Presto, etc)

Roberto Agostino Vitillo (:rvitillo)

Updated

•

10 years ago

Blocks: 1255755

Thomas Huelbert

Comment 1

•

10 years ago

might need to broken into sub bugs

Points: --- → 3

Priority: -- → P3

Katie Parlante

Comment 2

•

10 years ago

jjensen, mreid and rvitillo are proposing a dataset to support analysis of telemetry experiments. Let us know if there are any other requirements for search test experiments that are not covered in https://bugzilla.mozilla.org/show_bug.cgi?id=1258529. (If there are other search test experiments spec'd out, would be good to know about them). Un-prioritizing to retriage, as we need this for search

Flags: needinfo?(jjensen)

Priority: P3 → --

John Jensen

Comment 3

•

10 years ago

I will punt this back a little bit, but from what I understand from this bug and bug 1258529, I think we're OK. Simply put, we need longitudinal data for profiles in the experiment, where "in the experiment" also means "in the control group". By "longitudinal data", I'm typically referring to usage data: - # sessions - session duration - # searches, by SAP - # crashes, by type - etc In short, many/most of the "non-Telemetry" pings in the UT payloads. If this approach provides this, then I think we are OK. To be safe I'm going to needinfo my talented colleague Sam Penrose about this as well.

Flags: needinfo?(jjensen) → needinfo?(spenrose)

Sam Penrose

Comment 4

•

10 years ago

(In reply to John Jensen from comment #3) > I will punt this back a little bit, but from what I understand from this bug > and bug 1258529, I think we're OK. > > Simply put, we need longitudinal data for profiles in the experiment, where > "in the experiment" also means "in the control group". By "longitudinal > data", I'm typically referring to usage data: > - # sessions > - session duration > - # searches, by SAP > - # crashes, by type > - etc > > In short, many/most of the "non-Telemetry" pings in the UT payloads. If this > approach provides this, then I think we are OK. To be safe I'm going to > needinfo my talented colleague Sam Penrose about this as well. This description sounds like enough to get started on. It does not look precise enough to vet a solution, even allowing for shifts in which fields are included. What does the analysis process look like, for example? Are the people who will do the analysis trained on a toolchain that can answer their questions with acceptable latency? (Has that toolchain been chosen?) I have requested permission to access the problem statement document in the other bug, which may answer those questions.

Flags: needinfo?(spenrose)

Roberto Agostino Vitillo (:rvitillo)

Comment 5

•

10 years ago

> This description sounds like enough to get started on. It does not look > precise enough to vet a solution, even allowing for shifts in which fields > are included. As suggested in Comment 1 the proposed solution is to use the same job that generates the longitudinal dataset to generate the per-experiment datasets. That will allow the datasets to be queried with SQL as well. > What does the analysis process look like, for example? See [1] for an example analysis of an A/B experiment. > Are the people who will do the analysis trained on a toolchain that can answer their > questions with acceptable latency? (Has that toolchain been chosen?) I can't speak for everyone else but I am fairly confident that our engineers can use, or learn how to use, our toolchain with acceptable latency. [1] https://github.com/vitillo/e10s_analyses

Thomas Huelbert

Updated

•

10 years ago

Assignee: nobody → mreid

Priority: -- → P1

Mark Reid [:mreid]

Reporter

Comment 6

•

9 years ago

(In reply to Mark Reid [:mreid] from comment #0) > - Adding a Heka field that flags records that are running an experiment See https://github.com/mozilla-services/data-pipeline/pull/200 r? trink

Flags: needinfo?(mtrinkala)

Mark Reid [:mreid]

Reporter

Comment 7

•

9 years ago

(In reply to Mark Reid [:mreid] from comment #0) > - Sending these records to a separate location in S3 See https://github.com/mozilla-services/puppet-config/pull/1917 r? whd

Flags: needinfo?(whd)

Mark Reid [:mreid]

Reporter

Comment 8

•

9 years ago

For the separate location in S3, I propose to store data using the following dimensions: - submissionDate - docType - activeExperimentId

Mike Trinkala [:trink]

Comment 9

•

9 years ago

PR200 merged

Flags: needinfo?(mtrinkala)

Mark Reid [:mreid]

Reporter

Comment 10

•

9 years ago

Puppet PR 1917 has also been merged. Thanks!

Flags: needinfo?(whd)

Mark Reid [:mreid]

Reporter

Comment 11

•

9 years ago

Remaining work: - A periodic job for reorganizing this data into a longitudinal structure using the same code as bug 1242039 - Make this data available in all the usual ways (Spark, Presto, etc) Roberto, does your team have capacity for this?

Flags: needinfo?(rvitillo)

Roberto Agostino Vitillo (:rvitillo)

Comment 12

•

9 years ago

(In reply to Mark Reid [:mreid] from comment #11) > Roberto, does your team have capacity for this? This is probably not going to happen this quarter. The data in its current form should be accessible through Spark though (get_records), right?

Flags: needinfo?(rvitillo)

Mark Reid [:mreid]

Reporter

Comment 13

•

9 years ago

Yes, the data should be accessible via get_records in Spark. Just use "telemetry-experiments" as the name of the data source. The fields available for filtering the data are listed in Comment 8.

Mark Reid [:mreid]

Reporter

Updated

•

9 years ago

Assignee: mreid → nobody

Priority: P1 → --

Thomas Huelbert

Updated

•

9 years ago

Priority: -- → P3

Thomas Huelbert

Comment 14

•

9 years ago

due to e10s nit needing this, we'll accept a patch but not being actively worked on

Priority: P3 → P5

Firefox Bug Husbandry Bot

Comment 15

•

8 years ago

Closing abandoned bugs in this product per https://bugzilla.mozilla.org/show_bug.cgi?id=1337972

Status: NEW → RESOLVED

Closed: 8 years ago

Resolution: --- → INCOMPLETE

BMO Automation

Updated

•

7 years ago

Product: Cloud Services → Cloud Services Graveyard

Bugzilla

Create longitudinal dataset for Telemetry Experiments

Categories

(Cloud Services Graveyard :: Metrics: Pipeline, defect, P5)

Tracking

(Not tracked)

People

(Reporter: mreid, Unassigned)

References

Details

Crash Data

Security

(public)

User Story

Description

Updated

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Updated

Comment 6

Comment 7

Comment 8

Comment 9

Comment 10

Comment 11

Comment 12

Comment 13

Updated

Updated

Comment 14

Comment 15

Updated