How to think statistically (about dieting)
There are lots of ways to get statistical thinking wrong, not so many ways to get it right. Here’s a series of examples from wrong to right:
- I did this, and it’s not a “diet,” it’s a lifestyle change, and it works for me!
- I know people who live or interact with the world in a certain way, and it seems to work for them! After all, French women are thin. We should all do what they do.
- There was a study of volunteers, and for the people who stayed in the study to the end, they lost weight doing such and such lifestyle change!
- There was a study of volunteers, and they tracked people down who tried to leave the study, and the average weight gain was still real, among the people they found!
- There was a study of doctors giving advice or enrolling people in programs to help overweight people lose weight, and 97% of people lost no weight and plenty of people gained weight, maybe even more than half.
What I’d love is for people to understand how much difference there is between a personal experience (1) and advice we’d have on public health (5).
Here’s the golden standard: if you can come up with something to tell Medicare about how to have a population of morbidly obese people become a population of regular weight people, then you win. Otherwise, if you’re tempted to tell me about a lifestyle change that worked for you, please don’t, because that’s not statistical.
Also, I’d like a word about the theory that with enough discipline and willpower, anyone can lose weight. I think it’s fair to say I have discipline and willpower. In fact, I’m a fucking poster child for them. I wrote a Ph.D. as one of few women in a male-dominated field. I wrote a book or two. I’ve had three kids and I’ve never struck one of them in anger. In fact I’m pretty nice to people most of the time, even though I’m relatively often filled with rage at the unfairness of the world. That’s hard. It takes willpower.
I even ran a sprint triathlon at 275 pounds, really fast, which took months of ridiculous training. Also, I know all about healthy habits, I don’t eat “emotionally,” just when I’m hungry, and I love brussel sprouts and other healthy foods. I just get really fucking hungry, often.
Readers, I’m the fucking center of the disciple in willpower universe over here.
Given all of that, if anything I’d argue my willpower is one reason I’m so heavy. When I was 22 or so, I went on a fat-free diet, on the advice of my doctor, that fucked me up; I lost 30 pounds but then gained something like 75. I think I messed up my insulin resistance. In fact I believe that also happened to me on my first starvation diet when I was 14.
I’m guessing I’d be thinner if I’d had less willpower, in other words. I wouldn’t be better off, though, because I kind of like my books and my Ph.D. and my kids who don’t fear their parents.
Anyway, from now on let’s talk statistically, shall we?
The nature of choice in diets
There’s a lot of statistical evidence that dieting doesn’t work. I’ll postpone the documentation of the highlights of that evidence for a later post, but you can google it for yourself (avoid, if you can, the links that are trying to get you to buy something).
And when I say “diets don’t work,” here’s what I mean. I mean that, statistically speaking, people who go on diets don’t successfully lose and keep off weight for more than about six months. So, after two years or so, the average weight is about the same or higher in a group of dieters.
Can we take that as a given for now? Thanks. We can argue about it later if you want.
Here’s the thing. That statement confounds lots of people, I think because it’s statistical in nature. They will always imagine that, because they are themselves examples of someone who has lost weight and kept it off for more than two years through dieting, dieting does in fact work, and we should all try what they’ve tried.
It’s annoying to be told this over and over again, especially when you’re someone who’s tried a million things. And believe me, almost every fat person I know has tried a million things. For that reason I’d appreciate no more such advice, although in a later post I will be asking for zany pseudo-scientific theories about why fat people stay fat (there are so many!).
So yeah, people don’t understand statistical facts. But I think there’s something more going on here. Namely, the illusory nature of choice when it comes to dieting.
Because diets do seem to work short term, people think they’ve gotten control over their eating, at least temporarily. And then, at some point, people drop off their diets. They sometimes do it with a “what the fuck” attitude, but my guess is most of them don’t even remember doing it. It’s a kind of momentary amnesia, and before they know what’s happened they’re eating something they shouldn’t have. That is certainly my experience.
From the outsider’s perspective, that’s a person who has chosen to go off their diets, and in a certain sense it’s obviously true, since for example anyone who was locked in a cell with no food would not have the ability to go off a diet, nor would someone who cannot feed themselves. Indeed, it requires the access to food and the action of eating to go off a diet. So in that sense it takes a certain amount of freedom.
But, there’s another sense in which, I’d argue, there’s no choice in the matter at all. After all, dieting requires a positive declaration of a desire to lose weight. Sometimes it even requires forking over cash, maybe a lot of cash. People are trying hard to lose weight, in other words, and yet they can’t, and even statistically speaking they cannot.
Said another way: if 1000 people went to a lot of trouble to do something, and they all tried but 990 of them failed to do it, would we decide they had made the choice not to do it?
I’m ready to say there’s something else at work here, something more basic than free will. It’s like our choice to breathe. We can’t decide not to do it. Or we can, but only for a bit.
Commenters, please stick to the question of the nature of choice in dieting. I will delete other stuff, thanks!
Updates: TED and bariatric surgery
Readers, I’ve got two announcements today.
First, I’ll be giving a TED talk in April in Vancouver. And yes, for those of you remember, I haven’t always been the biggest fan of such things. But I’ve changed my mind/ sold out/ decided that it might just be great.
As a friend of mine explained to me, sometimes things get so douchey they come out the other side and are super cool. Also, I’m giving a talk in the section called Our Robotic Overlords, so that’s a very good sign.
Second, I’ve decided to undergo bariatric surgery. I’m jumping through the many insurance-qualifying hoops for now but if all goes well it will happen later this year, possibly as soon as July.
And… I’m planning to chronicle my journey on mathbabe. If that kind of thing doesn’t interest you, feel free to never come back, but if that kind of thing does interest you, then buckle up!
I’m not planning to keep myself to the subject of the bariatric surgery; in fact that’s just an excuse to think about a lot more, specifically:
- the nature of scientific understanding and how it does or does not percolate throughout society as a whole,
- how money and shame corrupt our understanding of scientific evidence,
- how bad data and bad technologies and biased academic publishing prevent us from learning optimally,
- the nature of individual choice, willpower, and control,
- my historical self-image as a dieter, a fat person, a woman, a feminist, and a thinker,
- how I gathered evidence and made this decision, and of course
- the process itself.
So I’m thinking kind of big and I’m going to have fun with it. Please feel free to comment, I’d love your help!
How Data Can Make Immigrants Look Like Criminals
My newest Bloomberg View column:
How Data Can Make Immigrants Look Like Criminals
Bigger Data Isn’t Always Better Data
My newest piece on Bloomberg:
Bigger Data Isn’t Always Better Data
Insurance and Big Data Are Incompatible
My newest Bloomberg View piece about how that FitBit could be bad for your health:
That Free Health Tracker Could Cost You
New links!
- I wrote about how big data is undermining our understanding and faith in historical facts and in statistics in my newest Bloomberg column, Do You Trust Big Data? Try Googling the Holocaust
- Last week this Vice piece came out, which I contributed to along with lots of writers I really admire like Astra Taylor, on how technology can be made to work for us: Man Versus Machine
- My buddy Paul-Olivier Dehaye is on fire over at medium.com with his newest approach to disrupting the big data surveillance state. He now has devised a way to request your file from Cambridge Analytica, and I’m totally doing this: Quick guide to asking Cambridge Analytica for your data

Age of Algorithms: Data, Democracy and the News Event at NYU Journalism 2/15
Next Wednesday evening I’ll be talking data, democracy, and the news with the amazing Julia Angwin at the NYU Journalism School moderated by Robert Lee Hotz. More information here.
Please come! Or if you can’t come, you can watch the livestream.

Dear President Bannon…. #PostcardsToBannon
How do you get rid of the influence of Steve Bannon’s whispering in Trump’s ear? The best strategy I’ve heard is to make Trump jealous of the attention. And one way to do that is to refer to Bannon as the president.
The hashtag #PostcardsToBannon blew up on Twitter yesterday, with all sorts of people posting pics of their postcards:

From Justin Hendrix via Twitter
In fact, it got so much attention that it was featured overnight on USA Today.
It’s a small act but it might make you feel great to do it.
Donald Trump is the Singularity
I have a new fun piece over at Bloomberg this morning:
Becky Jaffe: Resources to #Resist
This is a guest post by Becky Jaffe.
Per your request, I drafted a quick list of progressive organizations that we will want to support now more than ever. This list of national organizations is by no means comprehensive, just a good place to start if you want to get plugged in to community organizations that build power for the most marginalized sectors of our society. Each of these is a clickable link that will take you directly to the organization’s website so you can learn more about their mission. Please add to this list and circulate widely. I will be creating a Bay Area-specific list soon for people who want to support local community organizations and I encourage you to make a similar list for your region.
Let’s get busy supporting each other, people! We have our work cut out for us and much joyful organizing ahead.
Immigrant/Refugee rights:
- National Network for Immigrant and Refugee Rights
- National Immigration Project of the National Lawyers’ Guild
- National Immigration Law Center
- Catholic Charities
- the New American Leaders Project
- Presente
- Define American
Civil Rights, social justice and legal defense organizations:
- CAIR, the Council on American-Islamic Relations
- SURJ, Showing Up for Racial Justice
- NAACP, National Association for the Advancement of Colored People
- Black Lives Matter
- the Anti-Defamation League
- Race Forward
- Fred T. Korematsu Institute for Civil Rights and Education
- Bend the Arc: a Jewish partnership for Justice
- Center for Constitutional Rights
- Human Rights Watch:United States
- ACLU, the American Civil Liberties Union
- NLG, the National Lawyer’s Guild
- Legal Aid Society
- SPLC the Southern Poverty Law Center
- The Innocence Project
- Schools Not Prisons
- Anti-Eviction Mapping Project
- SEIU, Service Employees International Union
- Planned Parenthood
- National Organization for Women
LGBTQ rights:
- GLAAD: Gay And Lesbian Alliance Against Defamation
- National Center for Lesbian Rights
- Human Rights Campaign
- Lambda Legal Defense and Education Fund
- Transgender Law Center
Disability rights:
Building democracy:
- Women’s March on Washington: 10 Actions for the first 100 Days
- the Equal Justice Society
- The Highlander Research and Education Center
Fight for the Future - Indivisible: Former congressional staffers reveal best practices for making Congress listen
- Common Cause
- FAIR: Fairness and Accuracy in Reporting
- Center for Digital Democracy
- Brennan Center for Justice
- Public Citizen
- Inequality Media
Environmental organizations:
Cambridge Analytica
My newest Bloomberg post is out, in response to this article about Cambridge Analytica:
Get a New York ID Card #Resist
This is a guest post by Elizabeth Hutchinson, an Associate Professor of Art History at Barnard College/Columbia University who supports social justice initiatives at work and in her community. She is also a yarn whisperer who likes nothing better than knitting with Mathbabe.
If you are a regular reader of Mathbabe, you may already be putting your time, money and intellectual labor to work in support of organizations that defend the rights of vulnerable groups and our vulnerable environment (#BlackLivesMatter, Make the Road New York, Planned Parenthood, SURJ, 350.org, NYCStandswithStandingRock, and many others).
But if you are a New York City resident, here’s another practical thing you can do: apply for an ID NYC card.

ID NYC is a program established by the de Blasio administration in 2014 that allows city residents to obtain a photo identification without requiring the same government-generated documents required for a drivers license or passport. These residents then have a municipal ID that can help them open bank accounts, apply for library cards and gain access to other services as well as free membership to a range of NYC cultural institutions like the Museum of Modern Art.

In lieu of a Social Security card or equivalent document, applicants for the ID NYC could use non-U.S. government-generated forms of identification, including, among other things, a combination of a utility bill verifying a local address and a foreign passport or consular identification.
Even if you have a photo ID and a library card, here’s why you should get an ID NYC: this program is widely used by the undocumented immigrants in our midst, and the records of their applications are vulnerable to seizure by federal government authorities charged with expanding the pursuit of both undocumented and documented immigrants.
How is this so, you might ask, knowing that New York is a sanctuary city? Well, it is true that New York is committed to not aiding Immigration and Customs Enforcement (ICE) in a number of ways. For example, it has pledged not to use its city precincts or jails to house immigrants detained by Immigration and Customs Enforcement (though it does cooperate when ICE requests individuals already in NYC custody who were convicted of a serious felony) and to not share city agency information with federal immigration authorities.

Sanctuary Cities according to this site. For a more complete list click here.
The ID NYC program was set up to be in line with this stance: the law establishing the program ordered that the copies of documents used in applying for the ID be destroyed at the end of the first two years, or in December 2016, in the meantime only sharing them with law enforcement only through judicial subpoena (something that happened only a handful of times). However, a case brought by Republican members of the State Assembly from Staten Island in December resulted in a ruling that all records be retained indefinitely.
After Trump’s election, Mayor de Blasio pledged to change the record keeping system and stop retaining copies of the applicants’ documents beginning in 2017. However, the city will continue to retain significant information about applicants, including their name, gender, address, birthdate, and the photo taken when the id was made.
The ID NYC program DOES NOT ask applicants about their immigration status. Nevertheless, because this program is well used by members of New York’s immigrant communities (according to the Gothamist, over a third of NYC residents are foreign-born), these applications could be used for fishing expeditions looking for our undocumented neighbors.
Yes, the Mayor has pledged to fight to keep this paperwork private. But we can’t be sure how the courts will act when push comes to shove.
The solution? Gum up the works.

Blast the program with lots and lots of applications from NYC residents so that any authority that does manage to subpoena applications has an immense archive to wade through. Estimates suggest that about 1 million people have applied for ID NYC to date. That leaves about 6.8 million New Yorkers who still can. (Yes, kids can apply, too, as long as they are 14.)
Applying is easy, though it will take you a little time. You start by making an appointment at one of the 25 enrollment centers. There’s a form to fill out (applications are available in more than 25 languages), that you can do ahead of time and print out or fill out when you get there. Bring along your documents. Once you check in, you wait for an agent to go over the application and take your picture and then you can arrange to receive the id in the mail or pick it up. I got mine at the Mid-Manhattan Library. I made the appointment about a month ahead of time, though there were appointments sooner, and waited less than an hour to see the agent. It was about as much hassle as mailing a package at the post office.

Maybe this isn’t the most effective form of resistance, but it is an easy one that may do some good.
I look forward to seeing you in the streets. And the public library. And MoMA.

To report incidents of discrimination or hate
- The Governor’s Office – 1-888-392-3644
- The Mayor’s Office of Immigrant Affairs 311 or 212-788-7654. Translation is available. You can also go to www1.nyc.gov for many other resources for NYC immigrants.
Additional Resources
- ImmigrationLawHelp.org – Helps low-income immigrants find legal help.
- National Immigration Law Center: Explains your rights, no matter who is president.
- New York Immigrant Coalition and
- Make the Road: Provide policy updates and resources to support immigrants in NYC
- New York Communities for Change
- Causa Justa/Just Cause
Immigrant protests #JFKTerminal4 and 2pm at Battery Park today
I was excited to join the protest at JFK Airport last night. Here’s some footage:
And here’s two nice pictures:


One of the cool things about the protest is how messages were sent and spread through the chants. In particular I learned about another planned protest today at Battery Park at 2pm, which I believe is being organized by immigrant rights group Make the Road.

More information available here.
By the way, in case you’ve heard that a judge put a stay on the Executive Order about immigrants, there are plenty of reasons to question that. It’s also possible that border patrol agents are not obeying those orders.
Bloomberg post: When Algorithms Come for Our Children
Hey all, my second column came out today on Bloomberg:
When Algorithms Come for Our Children
Also, I reviewed a book called Data for the People by Andreas Weigend for Science Magazine. My review has a long name:
Bloomberg View!
Great news! I’m now a Bloomberg View columnist. My first column came out this morning, and it’s called If Fake News Fools You, It Can Fool Robots, Too. Please take a look and tell me what you think!
Pussyhats and the activist knitter
I finally got around to knitting my first pussyhat yesterday, during the inauguration. It took less than two hours because I was using super bulky yarn and because I had lots of anxious energy to tap into.

I got the yarn last Saturday, when I went to a Black Lives Matter march in the morning (you can see my butt multiple times in the embedded video) and then afterwards to Vogue Knitting Live in the Times Square Marriott Marquis.
And here’s the thing, I thought I was going to enjoy the juxtaposition of activist-to-insane hobbiest, but I was wrong – knitters were activists too! Here’s what I saw:

Pink yarn everywhere.

Pussyhats everywhere

Not only women of course! Alex looks dashing with his ombre pussyhat.

Karida Collins doesn’t have a pussyhat on but she’s still killing it.
Since last weekend, I’ve been seeing pussyhats everywhere. You go into a yarn store and here’s what you see.

Or you happen upon an airplane full of women heading to D.C. and here’s what you see.

I’m pretty sure half those women have knitting needles in their laps.
My favorite way to measure this phenomenon is directly, at the source. I am of course referring to Ravelry, the online social media website for knitters and crafters. The pussyhat project has spawned all sorts of creative ideas, of course.

The original pattern has thousands of associated projects

Lots of variations have been invented of course

Here’s a great example

Not particularly cat-like but I like it
Now that it’s happened, it’s obvious that knitters are a perfect community for activism. We’re friendly, community-oriented, and desperate for an opportunity to make something and give it away. Because it gives us an excuse to buy more yarn.
Anyhoo, I’m going to the Women’s March NYC today with mine, and I’m going to try to knit at least one more before I leave at 11am. See you there!

Two out of three “fairness” criteria can be satisfied
This is a continuation of a discussion I’ve been having with myself about the various definition of fairness in scoring systems. Yesterday I mentioned a recent paper entitled Inherent Trade-Offs in the Fair Determination of Risk Scores that has a proof of the following statement:
You cannot simultaneously ask for a model to be well-calibrated, to have equal false positive rates for blacks and whites, and to have equal false negative rates unless you are in the presence of equal “base rates” or a perfect predictor.
The good news is that you can ask for two out of three of these. Here’s a picture of a specific example of this, where I’ve simplified the situation so there are two groups of people being scores, B and W, and they each can be scored as either empty or full, and then the reality is that could either be empty or full. They have different “base rates,” which is to say that in reality, a different proportion of the B group is empty (70%) than the W group (50%). We insist, moreover, that the labeling scheme is “well-calibrated”, so the right proportion of them are labeled empty or full. I’ve drawn 10 “perfect representatives” from each group here:

In my picture, I’ve assumed there was some mislabeling – there’s a full in the empty bin and there are empties in the full bin. Because we are assuming the model is well-calibrated, every time we have one kind of mistake we have to make up for that mistake with exactly one of the other type. In the picture there’s exactly one of each mistake for both the W group and the B group, so that’s fine.
Quick calculation: in the picture above, the “false full rate”, which we can think of as the “false positive rate,” for B is 1/3 = 33% but the “false positive rate” for W is 1/5 = 20%, even though they each have only one mislabeled representative each.
Now it’s obvious that, theoretically, the scoring system could adjust the false positive rate for B to match that of W, which would mean having 3/5 of a representative be mislabeled. But again, that’d mean we would need only 3/5 of a representative be mislabeled in the empty bin as well.
That’s a false negative rate for B of 3/35 = 8.6% (note it used to be 1/7 = 14.3%). By contrast the false negative rate for A stays fixed at 1/5 = 20%.
If you think about it, what we’ve done is sacrificed some false negative rate balance for a perfect match on the false positive rate, while keeping the model well-calibrated.
Applying this to recidivism scores, we can ask for the high scores to reflect base rates for the populations, and we can ask for similar false positive rates for populations, but we cannot also ask for false negative rates to be equal. That might be better overall, though, because the harm that comes from unequal false positive rate – sending someone to jail for longer – is arguably more toxic than an unequal false negative rate, which means certain groups are let off the hook more often than the others.
By the way, I want to be clear that I don’t think recidivism risk algorithms should actually be the goal, summed up in this conversation I had with Tom Slee. I’m not even sure why their use is constitutional, to tell the truth. But given that they are in use, I think it makes sense to try to make them as good as possible, and to investigate what “good” means in this context.



