Closed
Bug 1306049
Opened 9 years ago
Closed 8 years ago
Sanitize arguments to "get_pings" before applying filtering
Categories
(Data Platform and Tools :: General, defect, P3)
Data Platform and Tools
General
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: mreid, Assigned: mreid)
References
Details
String arguments to "get_pings" should be sanitized before being used to filter S3 object names.
For example, filtering on docType = "saved-session" should actually match "saved_session".
Path components are sanitized[1] when raw data is stored on S3, so anything with a hyphen etc will never match.
This may not be needed in the Dataset API (though might save some surprises), but should at least be included in 'get_pings'.
[1] https://github.com/mozilla-services/data-pipeline/blob/master/heka/plugins/s3splitfile/s3splitfile_common.go#L167
Assignee | ||
Comment 1•9 years ago
|
||
Alternatively, we could modify Dataset to list prefixes available at each level in the tree to make it easy to discover what values can be used for filtering.
Updated•9 years ago
|
Points: --- → 1
Priority: -- → P3
Updated•8 years ago
|
Component: Metrics: Pipeline → Telemetry APIs for Analysis
Product: Cloud Services → Data Platform and Tools
Assignee | ||
Updated•8 years ago
|
Assignee: nobody → mreid
Assignee | ||
Comment 2•8 years ago
|
||
Status: NEW → RESOLVED
Closed: 8 years ago
Resolution: --- → FIXED
Updated•3 years ago
|
Component: Telemetry APIs for Analysis → General
You need to log in
before you can comment on or make changes to this bug.
Description
•