Closed Bug 1357875 Opened 8 years ago Closed 8 years ago

Add `topline_dashboard` to python_etl

Categories

(Data Platform and Tools :: General, enhancement, P1)

enhancement
Points:
2

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: amiyaguchi, Assigned: amiyaguchi)

References

Details

Attachments

(2 files)

`topline_dashboard` is reformats the Topline Summary view to accommodate the topline/executive report. This does the following things: 1. Marginalize the dataset to a limited set of countries and Other/ROW 2. Compute `ALL` rows 3. Collect and upload the csv to the dashboard view
Assignee: nobody → amiyaguchi
Blocks: 1309574
Status: NEW → ASSIGNED
Points: --- → 1
Priority: -- → P1
Blocks: 1329844
No longer blocks: 1309574
This script will replace both run.sh and v4_reformat.py in the original reporting pipeline. The general approach will be to take the union of historical data and reformatted topline_summary data. For reference, run.sh appends this week/month's data to the end the csv file. I plan to put 'v4-monthly.csv' and 'v4-weekly.csv' in a new, read-only location, such as `net-mozaws-prod-us-west-2-pipeline-analysis/topline/historical`. Are there any restrictions on the raw data that would prevent it from being put here? Alternatively, this data could live next to the new data under a v0 tag if it were imported to telemetry-parquet.
Flags: needinfo?(mreid)
Component: Metrics: Pipeline → Datasets: General
Product: Cloud Services → Data Platform and Tools
There's nothing in the data that would prevent it from being put into the 'analysis' bucket you mentioned.
Flags: needinfo?(mreid)
Points: 1 → 2
This has been merged into python_mozetl.
Status: ASSIGNED → RESOLVED
Closed: 8 years ago
Resolution: --- → FIXED
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
The `file://` protocol doesn't work on EMR, due to permission issues. The workaround it to collect the dataframe and write directly to disk. For some unknown reason, this causes tests to fail, despite functionally being the same thing. In practice, the above patch sucessfully collects the dataframe and uploads it.
Status: REOPENED → RESOLVED
Closed: 8 years ago8 years ago
Resolution: --- → FIXED
Component: Datasets: General → General
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: