Let’s name the crossover point

Over dinner in Amsterdam recently, George Dyson — who knows a thing or two about the history of computing — told me that a crossover of sorts has happened, or is happening now.

The crossover is between a time when we erased storage media to make room for fresh data and a time when we save nearly all of it. This is one reason there’s all this talk about Big Data. We need big ways (storage, analytics, software, services) to deal with the accumulations.

At the personal level we don’t yet have more than a few primitive means, relative to whatever it is that Google, Amazon, Facebook, the NSA and other big entities are doing. At their level, who knows? Lets say Google wants to save all your deleted Gmails. The mails might be deleted for you, but are they deleted for Google? I have no idea. All I know is that storing and analyzing them is more and more do-able for them.

I don’t have an axe to grind here (not yet, anyway). I’m just noting that this change is freighted with many possibilities and many meanings. And so, to make it easier to talk about, I suggest we name it, if it isn’t named already.

Hmm… since the sum of all stored data is Too Big to Know, maybe we should call it the Weinberger Threshold. One reason I like that (at least provisionally, besides liking David) is that there is what I consider a fallacious assumption, or presumption, behind much Big Data talk: that an analytical system can know us better than we know ourselves.

But that’s a whole ‘nuther topic, and maybe we should avoid conflating one with the other. (Though I do think the two — Big Data and Too Big to Know — are related, and I am sure David has thought about this stuff far more than I.)

Anyway, just blogging out loud here.

Discuss.