One way to manage your data ingest is to set up data dropping rules. With data dropping you can:
- Filter out unimportant low-value data
- Filter out potentially sensitive data
Overview
With data dropping rules, you can specify which types of data you don't want saved to your New Relic organization.
Dropped data does not count towards your data ingest and so is not billable. To learn more about what data counts as billable or not, see Data ingest.
Drop rules only apply to data that arrives from the moment you create the rule. They don't delete data that's already been ingested.
Learn more about dropping data in this video (7:09 minutes):
Besides creating drop-data rules, other ways to minimize unwanted data include:
- If you want to drop APM metric timeslice data, you can use metric normalization rules.
- If you're reporting logs, you can drop log data via the UI.
- If you're using Prometheus remote write, see Drop Prometheus remote write data.
Requirements
The ability to create and edit drop filter rules is linked to the NRQL drop rules
capability.
The following data types can be targeted for data dropping:
- APM-reported events
- Browser-reported events
- Mobile-reported events
- Synthetics-reported events
- Custom events (like those generated by the APM agent APIs or the Event API)
- Log data (you can also use the UI to drop data)
- Distributed tracing spans
- Default infrastructure monitoring events and infrastructure integrations events. Some caveats:
- When you drop this data, the raw data is dropped, but the aggregated
SystemSample
,ProcessSample
,NetworkSample
andStorageSample
events are still available (for more on this, see Data retention). Though still available, this data doesn't count towards ingest and is not billable. - Raw infrastructure data is used for alerting, so if you drop that data, you can't alert on it. Because the aggregated data is still available, you may still see that data in charts with time ranges above 59 minutes.
- When you drop this data, the raw data is dropped, but the aggregated
- Dimensional metrics (the
Metric
data type). Some caveats:- For organizations on our original pricing model: billing is based on product subscription, meaning dropped dimensional metrics remain billable.
- For metrics generated by the events-to-metrics service: drop rules won't work but these metrics can be stopped or attributes pruned by disabling or re-configuring the events-to-metric rule.
Create a drop data rule
주의
Use caution when deciding to drop data. The data you drop can't be recovered. For more details on potential issues, see Caution notes.
To drop data, create a NerdGraph-format drop rule that includes:
- A NRQL string that specifies what data types to drop
- An action type specifying how to apply the NRQL string
You can form and make the call in the NerdGraph API explorer: one.newrelic.com > Apps > NerdGraph API explorer.
The limit on nrql query length is 4096 characters. If it exceeds the length the nerdGraph will throw an error INVALID_NRQL_TOO_LONG
.
There are two ways to drop data:
Drop entire data types or a data subset (with optional filter). This uses the
DROP_DATA
action type and uses NRQL of the form:SELECT * FROM DATA_TYPE_1, DATA_TYPE_2 (WHERE OPTIONAL_FILTER)For this type of drop rule, you cannot use anything other than
*
in theSELECT
clause.Drop attributes from data types (with optional filter). This uses the
DROP_ATTRIBUTES
action type and uses NRQL of the form:SELECT dropAttr1, dropAttr2 FROM DATA_TYPE (WHERE OPTIONAL_FILTER)For this type of drop rule, you must pass in a non-empty list of raw attributes names.
NRQL restrictions
Not all NRQL clauses make sense for generating drop rules. You can provide a WHERE
clause to select data with specific attributes. Other features such as LIMIT
, TIMESERIES
, COMPARE WITH
, FACET
, and other clauses cannot be used.
SINCE
and UNTIL
are not supported in drop rules. If you have time-specific rules (say, drop everything until a time in the future), use WHERE timestamp < (epoch milliseconds in the future)
. You also can't use SINCE
to drop historical data: NRQL drop rules only apply to data reported after the drop rule was created. If you need to delete data that has already been reported, contact your New Relic representative.
JOIN
and subqueries are also not supported. Drop rules are applied to each data point independently, and other data cannot be queried to determine whether a drop rule should be applied.
The two action types have these restrictions:
DROP_DATA
can use onlySELECT *
.DROP_ATTRIBUTES
requires use ofSELECT
with "raw" attributes (attributes with no aggregator function applied). This also means you cannot useSELECT *
. Additionally, there are some attributes that are integral to their data type and cannot be dropped (such astimestamp
on event data). If you include them, registration will fail.
Example drop rules
Here are some example drop rules:
Verify your drop rule works
After you create a drop rule, verify that it is working as expected. The rule should take effect quickly after a successful registration, so try running a TIMESERIES
version of the query you registered to see that the data drops off.
Drop rule type | NRQL |
---|---|
| Drop rule NRQL:
Validation NRQL:
This should drop to 0. To verify that it did not affect any thing else, invert the |
| Drop rule NRQL:
Validation NRQL:
Both lines should drop to 0. To verify that it did not affect events that contained these attributes and still should, invert the |
View rules
Here is an example NerdGraph call that returns the drop rules set on an account:
{ actor { account(id: YOUR_ACCOUNT_ID) { nrqlDropRules { list { rules { id nrql accountId action createdBy createdAt description } error { reason description } } } } }}
Delete drop rules
Here is an example NerdGraph call deleting two specific drop rules:
mutation { nrqlDropRulesDelete(accountId: YOUR_ACCOUNT_ID, ruleIds: ["48", "98"]) { successes { id nrql accountId action description } failures { error { reason description } submitted { ruleId accountId } } }}
Audit drop rule history
To see who created and deleted drop rules, query your account audit logs. The list endpoint also includes the user ID of the person who created the rule.
Cautions when dropping data
When creating drop rules, you are responsible for ensuring that the rules accurately identify and discard the data that meets the conditions that you have established. You are also responsible for monitoring the rule, as well as the data you disclose to New Relic.
New Relic cannot guarantee that this functionality will completely resolve data disclosure concerns you may have. New Relic does not review or monitor how effective the rules you develop are.
Creating rules about sensitive data can leak information about what kinds of data you maintain, including the format of your data or systems (for example, through referencing email addresses or specific credit card numbers). Rules you create, including all information in those rules, can be viewed and edited by any user with the relevant role-based access control permissions.
Only new data will be dropped. Existing data cannot be edited or deleted.
Drop attributes on dimensional metric rollups
Dimensional metrics aggregate metrics into rollups for long term storage and as a way to optimize longer term queries. Metric cardinality limits are applied to this data.
You can use this feature to decide which attributes you don't need for long term storage and query, but would like to maintain for real time queries.
For example, adding containerId
as an attribute can be useful for live troubleshooting or recent analysis, but may not be needed when querying over longer periods of time for larger trends. Due to how unique something like containerId
can be, it can quickly drive you towards your metric cardinality limits which when hit stops the synthesis of rollups for the remainder of that UTC day.
This feature also allows you to keep the high cardinality attributes on the raw data and drop it from rollups which gives you more control over how quickly you approach your cardinaliity limits.
Usage
Drop attributes from dimensional metrics rollups (with optional filter). This uses DROP_ATTRIBUTES_FROM_METRIC_AGGREGATES
action type and uses NRQL of the form:
SELECT dropAttr1, dropAttr2 FROM Metric (WHERE OPTIONAL_FILTER)
Here is an example NerdGraph request:
mutation { nrqlDropRulesCreate( accountId: YOUR_ACCOUNT_ID rules: [ { action: DROP_ATTRIBUTES_FROM_METRIC_AGGREGATES nrql: "SELECT containerId FROM Metric WHERE metricName = 'some.metric'" description: "Removes the containerId from long term querys." } ] ) { successes { id } failures { submitted { nrql } error { reason description } } }}
To verify it's working, wait 3 to 5 minutes for the rule to be picked up and for aggregate data to be generated. Then assuming the example NRQL above is your drop rule, run the following queries:
SELECT count(containerId) FROM Metric WHERE metricName = 'some.metric' TIMESERIES SINCE 2 hours agoSELECT count(containerId) FROM Metric WHERE metricName = 'some.metric' TIMESERIES SINCE 2 hours ago RAW
The first query retrieves metric rollups and should drop to 0 since containerId
has been dropped per the new drop rule. The second query retrieves metric raws using the RAW
keyword and should continue to hold steady since raw data is not impacted by the new drop rule. For more information on how to see the impact this will have on your cardinality, check out Understand and query high cardinality metrics.
Restrictions
All restrictions that apply to DROP_ATTRIBUTES
apply to DROP_ATTRIBUTES_FROM_METRIC_AGGREGATES
with the additional restriction that you can only target the Metric
data type. They also do not work on Metric
queries targeting data created by an events to metrics rule or on Metric
queries targeting timeslice data.
Learn more
Recommendations for learning more:
- NerdGraph basics and terminology
- NRQL basics
- Browse the Support Forum for community discussions about NRQL drop rules.
- For a deep dive into managing data ingest for a complex organization, see Data ingest governance.