Closed Bug 1388116 Opened 8 years ago Closed 6 years ago

Prototype main_summary in spark-streaming

Categories

(Data Platform and Tools :: General, enhancement, P3)

enhancement
Points:
3

Tracking

(Not tracked)

RESOLVED WONTFIX

People

(Reporter: frank, Unassigned)

References

Details

(Whiteboard: [DataPlatform] )

There has been a lot of talk about direct-to-parquet main ping. With main_summary having all histograms (eventually) and scalars, this may be unnecessary. We can just use the code as-is to implement real-time parquet main_summary. Replacing the current batch-job with this would mean we could run all the dependent jobs 4+ hours earlier, and reduce latency further for some other analyses.
Idea: test this out first with experiments_summary using the telemetry-cohorts source
Points: --- → 3
Priority: -- → P2
Assignee: nobody → amiyaguchi
See Also: → 1412798
Assignee: amiyaguchi → dthorn
Whiteboard: [DataPlatform]
Priority: P2 → P1
Priority: P1 → P2
Priority: P2 → P3
Assignee: dthorn → nobody
Status: NEW → RESOLVED
Closed: 6 years ago
Resolution: --- → WONTFIX
Component: Datasets: Main Summary → General
You need to log in before you can comment on or make changes to this bug.