Hosts: do you plan to include these hosts in future?

Created on 28 Nov 2017  路  11Comments  路  Source: StevenBlack/hosts

All the sub categories available under this :
hpHosts

Most helpful comment

And just to pose a possible solution to the above problems, after contacting Malwarebytes about the redistribution of their data for an open-source project and procuring their permission, I am sure a diff study over a reasonable period of time would be welcomed. A "reasonable period of time" being defined by Steven. After the said study is conducted and the results posted here, we can then have more to work with and make a definitive decision backed up by the facts. So if anyone is willing to take this on, I am sure it would be a welcomed project.

Entries are checked via a variety of means: randomly, automatically, via funilrys' funceable script (https://github.com/funilrys/funceble), combinations of checks, socially/crowd-sourcing, and more. Steven personally vets the sources and some of those threads are still here if you want to dig into the closed issues and see what the conversations have gone like for a source to get approved and how they have described their curation process to the community. I am sure when Steven becomes available he can provide a better explanation of his exact process from his viewpoint. From my experience, having talked with many of the sources personally, many of them own and operate large-scale networks and are constantly gathering live data from live traffic and this is a large part of how they curate their lists to keep them up to date with how their traffic is trending.

All 11 comments

Hello! Thank you for opening your first issue in this repo. It鈥檚 people like you who make these host files better!

We only include hosts which are actively curated to keep our list as streamlined and efficient as possible, keeping in mind this list has also seen use in some integrated systems. So if those hosts should happen upon our actively curated lists, we will include them. Otherwise, we don't import bulk lists that do not maintain a high degree of curation, as this is counter to the mission statement of the repository.

The classified files over there are refreshed far more often rather than the month to month hosts.txt files that they upload, you can incorporate those smaller files. Furthermore, indeed, you're right about excluding the bulk ones that are not frequently updated.

A simple test we could perform is diff the lists from month to month to get a better feel for the curation happening here. Most of the time they are simply adding and not checking or pruning older invalid entries, which is the key to our list.

There would also be a terms of use issue we would have to work out, as any redistribution of their data requires their expressed permission.

Besides, would you be able to reveal some insight into how would you check for invalid entries and more seasoned ones? Do you do it physically or utilize some kinda program or a product?

And just to pose a possible solution to the above problems, after contacting Malwarebytes about the redistribution of their data for an open-source project and procuring their permission, I am sure a diff study over a reasonable period of time would be welcomed. A "reasonable period of time" being defined by Steven. After the said study is conducted and the results posted here, we can then have more to work with and make a definitive decision backed up by the facts. So if anyone is willing to take this on, I am sure it would be a welcomed project.

Entries are checked via a variety of means: randomly, automatically, via funilrys' funceable script (https://github.com/funilrys/funceble), combinations of checks, socially/crowd-sourcing, and more. Steven personally vets the sources and some of those threads are still here if you want to dig into the closed issues and see what the conversations have gone like for a source to get approved and how they have described their curation process to the community. I am sure when Steven becomes available he can provide a better explanation of his exact process from his viewpoint. From my experience, having talked with many of the sources personally, many of them own and operate large-scale networks and are constantly gathering live data from live traffic and this is a large part of how they curate their lists to keep them up to date with how their traffic is trending.

All things considered, I investigated a few different issues, and Steven doesn't appear to be keen to incorporate hpHosts as it is tremendous and doesn't have a similar level of curation, yet I was simply hoping that was this specific classification (https://hosts-file.net/pup.txt) incorporated into the Unified hosts record? That is to say, particularly the Potentially Unwanted Programs Category.

Just as a note, you can also always add custom entries to the myhosts file in the root directory of the repository and they will be automatically added to your build of the list. If you're using my Unified Hosts AutoUpdate for Windows, you can just add them anywhere you want in your hosts file as long as they are outside of the "#### BEGIN UNIFIED HOSTS ####" and "#### END UNIFIED HOSTS ####" markings and they won't be touched during a hosts update. This might be helpful in the future for any situation in which you wish to include entries not supported by this project.

Hi @udit-001! The main problem with HP hosts is its outsized collection. Hundreds of thousands of hosts.

Some operating systems, especially the most popular one, degrade notably once the hosts file reaches a certain size. Mobile devices, with their limited processing power, also seem to have issues dealing with very large host files.

HP Hosts was once included here. Not for very long; it led to an avalanche of complaints 鉀勶笍

Oh! I get it has an extreme number of false calls possibly. Also, coincidentally, do you intend to incorporate domains that utilize our PC assets to mine cryptographic money coins?

Some lists like this most likely:
ZeroDot1 - CoinBlocker Lists
Adblock No-Coin List

@udit-001 I suspect that, in due course, those hosts will end up in one or more of our curated sources.

Was this page helpful?
0 / 5 - 0 ratings

Related issues

beerisgood picture beerisgood  路  3Comments

bigdargon picture bigdargon  路  3Comments

dhavalgoti24 picture dhavalgoti24  路  3Comments

RaydenX93 picture RaydenX93  路  3Comments

Laicure picture Laicure  路  3Comments