Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

wow+1. Truly creepy.

But this is possible because we let them use this data. Thankfully, in EU the GDRP makes this almost impossible.

On a side note, working on a platform that is similar to sentiance in data aquisition, but for totally other purpose and zero effect on privacy, due to a transparent anonymization.



Are you going to invade a foreign power when one of their sovereigns ignores your demand to forget?


No, just like China doesn't invade the United States because someone on Hacker News posted a nastygram about Jinping.

But it sure as shit won't let you run a Chinese message board where you allow that sort of thing.


Legally, I cannot collect any data and keep the source. So I do strip all the possible identification, until someone's effort to pin point to any personal data becomes economically unfeasible. I also tend to store as little as possible raw data and in some cases, I do data scrambling, where the data as in values is not important for the engine.

A demand to forget has to show that I have data on that person. Without this, claim is impossible. No invasion.


> I do strip all the possible identification

"Anonymizing" data is a myth; stripped or hashed columns can usually be recovered by correlating the remaining data with other data sets.

> until someone's effort to pin point to any personal data becomes economically unfeasible.

Which admits that the data probably is recoverable, just "economically infeasible". Do you have proof of that claim? Unfeasible for who? In general, the difficulty of re-correlating data goes down as the amount of data grows.

> I do data scrambling, where the data as in values is not important for the engine.

DJB once described[1] hashing as "magic crypto pixie dust" that "takes personally identifiable information and makes it incomprehensible to the marketing department".

> A demand to forget has to show that I have data on that person.

So you're trying to launder data to circumvent the letter of the law. This kind of scofflaw, antisocial attitude is how you attract reactionary, heavy handed regulations.

[1] https://projectbullrun.org/surveillance/2015/video-2015.html...


I suppose you talk a bit from outsite.

There is no other data set to correlate. Simply all the external context data is not recorded anywhere. Not even in logs. Unless some higher agency is going to hack some network driver to pick up the tcpip source, etc, I don't see how the data could be associated with datetime and location. This is why I was talking about "economically infeasible".

Data scrambling doesn't mean hashing. Sorry, here you are wrong. It's a on-the-fly frequency/timedomain scrambling, means someone has to physically, again, access a server and pick up from the memory the algorithm. And no marketing department, all research here!

The other stuff, I wound't reply, but let me asure you, there is no law circumventing. We are open and if someone can pinpoint of some personal data, there is no issue removing it.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: