Do_not_track
46 points by RubyGuy 3 hours ago | 19 comments

PufPufPuf 9 minutes ago
This is set up for the same fate as DNT in browsers. Collecting all the "do not track" env vars into a single "do_not_track.env" file, however, may not be a bad idea...
reply
spudlyo 48 minutes ago
I was surprised how hard it was to stop the Python transformers library from phoning home to Hugging Face. I set HF_HUB_DISABLE_TELEMETRY=1, and when I called Wav2Vec2CTCTokenizer.from_pretrained I explicitly passed local_files_only=True, but still I got got a warning about not having a valid HF_TOKEN. It wasn't until I stumbled upon HF_HUB_OFFLINE=1 that I'm somewhat confident that I'm not making outgoing connections to HF every time I load a wav2vec2 model from disk.

I wouldn't have realized this was happening at all if it weren't for the obnoxious HF_TOKEN warning.

reply
ximm 44 minutes ago
Looks like a helpful honeypot! Any tool that will public announce support for this spec is a tool I know to avoid because it collects telemetry without explicit opt-in in the first place.
reply
XCSme 21 minutes ago
I thought it would be a sh script to automatically set the flags for all known do not track env vars.
reply
drayfield 40 minutes ago
Given the URL and list of different opt-outs I thought this was going to be a shell script to set all these for you. In fact, I've just had an idea...
reply
LeoPanthera 59 minutes ago
The most useful part of this page is the list of optout commands to stick in my shellrc.

Is anyone maintaining a more complete list of those?

reply
paddw 45 minutes ago
an LLM would do a fine job for most common things, doesn't really matter if a few of them get hallucinated
reply
smartmic 51 minutes ago
> Many CLI tools, SDKs, and frameworks collect telemetry data by default.

Any of those are using a dark pattern and before exploring new ways to opt out you should look for and spend your energy on an alternative which respects your freedoms upfront.

reply
Otek 45 minutes ago
Exactly, new “standard” won’t fix it
reply
batisteo 2 hours ago
It worked so well on the browser already
reply
huksley 35 minutes ago
Also this, we disable it when building or deploying apps in DollarDeploy

export SEMGREP_SEND_METRICS=off export COLLECT_LEARNINGS_OPT_OUT=true export STORYBOOK_DISABLE_TELEMETRY=1 export NEXT_TELEMETRY_DISABLED=1 export SLS_TELEMETRY_DISABLED=1 export SLS_NOTIFICATIONS_MODE=off export DISABLE_OPENCOLLECTIVE=true export NPM_CONFIG_UPDATE_NOTIFIER=false

reply
drnick1 54 minutes ago
It's probably easier to run your own DNS and blacklist the offending domains. There are good blacklists with millions of telemetry domains, e.g. https://github.com/hagezi/dns-blocklists.
reply
rvz 8 minutes ago
That is the correct way of handling this.

Everyone proclaiming a "standard" is just adding to the long list of (unofficial) alternatives.

reply
tosti 28 minutes ago
Better yet, don't allow such spyware crap on your computer.
reply
tonymet 27 minutes ago
He’s better off vibecoding an include.sh that sets all the known do not track env vars for you.
reply
stavros 18 minutes ago
Honest question, what's the problem with crash dumps that include no personal info? They just help make the software less buggy. I also don't see an issue with anonymized usage patterns (this feature was used X times this month, this one Y times, etc).

Can someone expound on what they see as a problem?

reply
circadian 10 minutes ago
I would suggest that the default to enrolling people in supplying such information is the issue. In a world driven by surveillance capitalism, even "anonymous" data can be used for much broader purposes (think, for example, of when and where people are using tools geographically and at what times: you can start to track the behaviour of people in this way).

Users should never be opted in through usage alone of free or paid-for tooling to supply information that isn't part of the function of the tool. Where that is required for a service or product, you should opt-in explicitly, not implicitly.

reply
stavros 8 minutes ago
That's fair, thanks.
reply
varispeed 27 minutes ago
Default opt-in tracking should be illegal and enforced with such fines and prison sentences, that companies wouldn't even dare to have anything remotely capable of tracking in the runtime.

Unfortunately big corporations can always find away to make regulators see no problem.

reply
iririririr 7 minutes ago
[dead]
reply