• 2 DAYS LEFT: This Target Haunts Me Contest

    Tell us about the one that got away, the flier that ruined your group, the zero that drifted, the shot you still see when you close your eyes. Winner will receive a free scope!

    Join contest

NSA - the original contact tracer

E. Bryant

STABILITRAK ACTIVE
Supporter
Full Member
Minuteman
Supporter
  • Oct 25, 2010
    6,892
    10,004
    MI
    This is a hell of a read:


    Contact chaining on a scale as grand as a whole nation’s phone records was a prodigious computational task, even for Mainway. It called for mapping dots and clusters of calls as dense as a star field, each linked to others by webs of intricate lines. Mainway’s analytic engine traced hidden paths across the map, looking for relationships that human analysts could not detect. Mainway had to produce that map on demand, under pressure of time, whenever its operators asked for a new contact chain. No one could predict the name or telephone number of the next Tsarnaev. From a data scientist’s point of view, the logical remedy was clear. If anyone could become an intelligence target, Mainway should try to get a head start on everyone.

    “You have to establish all those relationships, tag them, so that when you do launch the query you can quickly get them,” Rick Ledgett, the former NSA deputy director, told me years later. “Otherwise you’re taking like a month to scan through a gazillion-line phone bill.” And that, right there, was where precomputation came in. Mainway chained through its database continuously—“operating on a 7x24 basis,” according to the classified project summary. You might compare its work, on the most basic level, to indexing a book—albeit a book with hundreds of millions of topics (phone numbers) and trillions of entries (phone calls). One flaw in this comparison is that it sounds like a job that will be finished eventually. Mainway’s job never ended. It was trying to index a book in progress, forever incomplete. The FBI brought the NSA more than a billion new records a day from the telephone companies. Mainway had to purge another billion a day to comply with the FISA Court’s five-year limit on retention. Every change cascaded through the social graph, redrawing the map and obliging Mainway to update ceaselessly.

    Mainway’s purpose, in other words, was neither storage nor preparation of a simple list. Constant, complex, and demanding operations fed another database called the Graph-in-Memory.

    When the Boston marathon bombs exploded in April 2013, the Graph-in-Memory was ready. Absent unlucky data gaps, it already held a summary map of the contacts revealed by the Tsarnaev brothers’ calls. The underlying details—dates, times, durations, busy signals, missed calls, and “call waiting events”—were easily retrieved on demand. Mainway had already processed them. With the first hop precomputed, the Graph-in-Memory could make much quicker work of the second and the third.

    To keep a Tsarnaev graph at the ready, Mainway also had to precompute a graph for everyone else. And if Mainway had your phone records, it also held a rough and ready diagram of your business and personal life.

    This is an interesting bit of history. Unfortunately, we're all still living in it.
     
    I might say that is an important read.

    Employers compete for data scientists by dangling both salaries and a list of the coolest tools in front of them. Recruiters roll up on campus with presentations that resemble slick action movie promos.

    Some students will make the choice for something like DARPA where they get to work at ridiculous scale and will forever have those five letters on their resume, but may take a massive salary hit vs. private industry. These folks know they probably won't be driving a 911 Turbo, and they won't get to brag about all the amazing coding they are doing because it could land them in jail. But... you do get the satisfaction of telling people that you can't tell them, and people will never cease getting pleasure from being told that the answers they seek are the most serious form of .gov verboten... Such Q&A may end with an exchange of knowing nods. For some, this serves as part of the compensation package.

    A few lucky data scientists will go to work for the likes of Goldman and find themselves with a high 6-figure, or maybe even 7-figure, salary before they hit 30 years of age. They will most definitely drive whatever car they want and they can, and will, give hints about what is going on at the office. You can code like a boss and between sips of $50 drinks you will chuckle at the grad students grovelling around you hoping for a job. These plebes worry about edge cases because inferior grey matter needs a machine to help identify them... hahaha. But not for you... before your fingers hit they keys you know exactly the cost of imperfection. As the obnoxious mechanical keys throw disturbances through the air a new convolutional neural network comes into being at conversational speed. Cha-ching, cha-ching... with all this COVID crap your 2020 bonus may only be 4x your base.
    -----

    For both of these populations most of the luster will wear thin, eventually. The high of professional satisfaction requires ever higher doses of whatever it was that caused you to take the job in the first place. And as that high wears off, for many people, it will reveal ethical choices that were glossed over because their uncomfortable presence was buried beneath gallons of dopamine that had gushed over your senses for too long a period of time. But also because a digital worlds puts distance between you and the rest of humanity. This is true for most all of us.

    The protection of personal data is becoming quaint in comparison to “free choice”, “truth” and “trust”, as discussed by Margo Boenig-Lipstin(1). An increasingly digitized world has helped cloud a sense of shared ethics. If ethics is a group of shared values, how can a digitized world not influence human activity if for no reason other than we have significantly altered how we observe human activity?

    I am fortunate to work with highly ethical people who have recently checked financial ambition with basic human decency. I find a lot of comfort in this.

    Edward Snowden is a hero. He sacrificed for the greater good. I am not sure the value of what he did, even if it is deemed criminal, is widely shared.


    (1) It’s Time for Data Ethics Conversations at Your Dinner Table
     
    Last edited:
    I might say that is an important read.

    Employers compete for data scientists by dangling both salaries and a list of the coolest tools in front of them. Recruiters roll up on campus with presentations that resemble slick action movie promos.

    Some students will make the choice for something like DARPA where they get to work at ridiculous scale and will forever have those five letters on their resume, but may take a massive salary hit vs. private industry. These folks know they probably won't be driving a 911 Turbo, and they won't get to brag about all the amazing coding they are doing because it could land them in jail. But... you do get the satisfaction of telling people that you can't tell them, and people will never cease getting pleasure from being told that the answers they seek are the most serious form of .gov verboten... Such Q&A may end with an exchange of knowing nods. For some, this serves as part of the compensation package.

    A few lucky data scientists will go to work for the likes of Goldman and find themselves with a high 6-figure, or maybe even 7-figure, salary before they hit 30 years of age. They will most definitely drive whatever car they want and they can, and will, give hints about what is going on at the office. You can code like a boss and between sips of $50 drinks you will chuckle at the grad students grovelling around you hoping for a job. These plebes worry about edge cases because inferior grey matter needs a machine to help identify them... hahaha. But not for you... before your fingers hit they keys you know exactly the cost of imperfection. As the obnoxious mechanical keys throw disturbances through the air a new convolutional neural network comes into being at conversational speed. Cha-ching, cha-ching... with all this COVID crap your 2020 bonus may only be 4x your base.
    -----

    For both of these populations most of the luster will wear thin, eventually. The high of professional satisfaction requires ever higher doses of whatever it was that caused you to take the job in the first place. And as that high wears off, for many people, it will reveal ethical choices that were glossed over because their uncomfortable presence was buried beneath gallons of dopamine that had gushed over your senses for too long a period of time. But also because a digital worlds puts distance between you and the rest of humanity. This is true for most all of us.

    The protection of personal data is becoming quaint in comparison to “free choice”, “truth” and “trust”, as discussed by Margo Boenig-Lipstin(1). An increasingly digitized world has helped cloud a sense of shared ethics. If ethics is a group of shared values, how can a digitized world not influence human activity if for no reason other than we have significantly altered how we observe human activity?

    I am fortunate to work with highly ethical people who have recently checked financial ambition with basic human decency. I find a lot of comfort in this.

    Edward Snowden is a hero. He sacrificed for the greater good. I am not sure the value of what he did, even if it is deemed criminal, is widely shared.


    (1) It’s Time for Data Ethics Conversations at Your Dinner Table
    There's a lot of truth in this prose...
     
    There's a lot of truth in this prose...

    Seconded. The value of it will be missed by those who are screeching about the latest distraction. Funny how there are so many with the time and energy to shitpost in threads on some random inconsequential police shooting, but when it comes to stuff like this that literally affects anyone with a phone, it's crickets and awkward side glances.