Closed
Bug 1321316
Opened 9 years ago
Closed 9 years ago
Backfill: Reprocess 1 month of main_summary to add engagement scalars
Categories
(Cloud Services Graveyard :: Metrics: Pipeline, defect, P1)
Cloud Services Graveyard
Metrics: Pipeline
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: mreid, Assigned: mreid)
References
Details
Target time period:
20161029 to 20161129
Assignee | ||
Updated•9 years ago
|
Comment 1•9 years ago
|
||
Can we make this 20161001 to 20161129 (since last 15 days is always wonky)
Comment 2•9 years ago
|
||
Can we make this 20161001 to 20161129 (since last 15 days is always wonky)
Assignee | ||
Comment 3•9 years ago
|
||
I'm just about done with 20161029 to 20161129 (part way though swapping in the updated files for 20161128). Please take a look at that period first, and if it looks like we'll need more data I will backfill further.
Note that if you are loading this data in Spark, you may need to set the 'mergeSchema' flag per:
https://spark.apache.org/docs/latest/api/python/pyspark.sql.html#pyspark.sql.DataFrameReader.parquet
I will update again when this period is completely finished, should be within the hour.
Assignee | ||
Comment 4•9 years ago
|
||
Ok, import of the original period is done.
Saptarshi, please take a look and let me know if you'll need more backfill.
Flags: needinfo?(sguha)
Assignee | ||
Comment 5•9 years ago
|
||
Calling this done for now. Please re-open if we need more backfill.
Status: NEW → RESOLVED
Closed: 9 years ago
Resolution: --- → FIXED
Updated•7 years ago
|
Product: Cloud Services → Cloud Services Graveyard
Assignee | ||
Updated•7 years ago
|
Flags: needinfo?(sguha)
You need to log in
before you can comment on or make changes to this bug.
Description
•