Adguardfilters: Consider using DuckDuckGo tracker radar data sets to include new trackers to AdGuard filters

Created on 1 Jul 2020  Â·  8Comments  Â·  Source: AdguardTeam/AdguardFilters

Prerequisites

Please answer the following questions for yourself before submitting an issue. YOU MAY DELETE THE PREREQUISITES SECTION.

  • [ ✅ ] I checked the documentation and found no answer
  • [ ✅ ] I checked to make sure that this issue has not already been filed

Problem Description

Some trackers are not in AdGuard lists but they appear on DuckDuckGo tracker radar

Proposed Solution

Check DuckDuckGo tracker radar data sets to find entries that are not included in AdGuard tracking protection filter

Alternatives Considered

Additional Information

DDG data sets ( https://github.com/duckduckgo/tracker-radar ) are crawled automatically once per month ( https://spreadprivacy.com/duckduckgo-tracker-radar/ ) and that level of finding new trackers (specially fingerprinters https://github.com/duckduckgo/tracker-radar/blob/master/docs/DATA_MODEL.md ) at scale is quite uncommon (I do not know if AdGuard has that capability or relies on community finding news trackers)

All 8 comments

This screenshots shows some of the fingerprinters that were not blocked by other lists (EasyList and AdGuard did not block some of them) but that appeared on DuckDuckGo tracker radar data sets. Maybe there are more that could be added to more lists

Captura de Pantalla 2020-07-01 a la(s) 11 09 41

many domains included in duckduckgo tracker radar data are not trackers. Discussion here: https://github.com/uBlockOrigin/uAssets/issues/7073

Given list development is completely separate between each company, just a wholesale import yet another list could seem an overkill and just bloat the list

I'd just like to take a closer look at their methodology and maybe pick up something useful from it. Of course blindly importing a list that was automatically created by a bot does not seem a sane idea to me.

of course, I agree with everything that has been said. What I had in mind was trying to crawl the data set and import perhaps only the ones that are undoubtedly trackers (for example those that use lots of apis or set many cookies and have high site prevalande). The data set allows you to filter what you want. Of course, exclude CDN and other elements that may cause site breakage. I am not saying DDG list is perfect but maybe there is some useful info that can be pulled from it

There is a few instances where lists can /akam/11/* specific folders/directorys, without being specific.

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

Was this page helpful?
0 / 5 - 0 ratings

Related issues

adguard-bot picture adguard-bot  Â·  4Comments

adguard-bot picture adguard-bot  Â·  3Comments

adguard-bot picture adguard-bot  Â·  4Comments

adguard-bot picture adguard-bot  Â·  3Comments

adguard-bot picture adguard-bot  Â·  3Comments