Archive
Quants for the rest of us
A few days ago I wrote about the idea of having a quant group working for consumers rather than against them. Today I wanted to spell out a few things I think that group could do.
The way I see it, there’s this whole revolution of data and technology and modeling going on right now, but only people with enough dough to pay for the quants are actually actively benefiting from the revolution. So people in finance, obviously, but also internet advertising companies.
The problem with this, besides the lopsidedness of it all, is that the actual models being used are for the most part predatory rather than helpful to the average person.
In other words, most models are answering question of the form:
- how can we get you to spend money you don’t necessarily want to spend? or
- how good a credit risk are you? or
- how likely are you to come back and spend more money? or
- how can we anticipate the market responding to end-of-month accounting shenanigans? I just threw this one in to give people a sense of how finance models work.
It all makes sense: these are businesses that are essentially bloodthirsty, making their money off your clicks and purchases. They are not going away.
Then there are some models that are already out there trying to make the user experience more enjoyable. They answer questions of the form:
- If you like that, what else may you like? (Pandora, Netflix, Google)
- If you bought that last week, maybe you’d like to buy it again this week?
- If you bought that, maybe you’d like to buy this now?
The problem, as you can see, is that these second, helpfulish kinds of things quickly devolve into the first, predatory kinds of things. In other words, you’re being bombarded by ads and suggestions for spending more money than you actually want to spend.
What about stuff that you actually don’t want to do but that is probably not directly profitable to anyone? I’d love to see technology used to tackle some of these chores:
- Figuring out what the best deal is on loans (credit card, student loans or mortgages), without becoming a lawyer. Here I’m not just saying they should all be clear about their terms- that’s a job for the CFPB. I mean there should be a website that asks me a few questions about what I need a loan for and points me to the best deal available.
- Finding the best price for something. I discussed this briefly here.
- Warnings about weird conditions and agreements, like mandatory arbitration.
- Help finding a good doctor or a good plumber etc.
- Help knowing when to go to the DMV or other public services building so the lines are bearable, or even better a way to do stuff on your computer or phone and avoid the trip altogether.
- Getting your kids’ medical records to their schools and camps safely and efficiently.
Some of these things already exist (like the doctor thing) but aren’t well-known or well-publicized, so wouldn’t it be cool if this quant group also provided an app that would aggregate all these services into one place?
And for the things like #4, where you want to be warned before you buy, it’s too much work to go back to a webpage to check everything before buying. Instead, the app could follow you around the web when you’re shopping and overlay a “bullshit warning” icon on the potential purchase in situations where people have complained about unreasonable terms and conditions.
Let’s use technology in our favor. Instead of the companies collecting information about us, let’s collect information about them. Come to think of it, the CFPB should start this quant group today.
Versus what?
I’m going to specialize in short, curmudgeony blog posts this week.
Today’s topic: you always need something to compare a new thing with. It’s this versus what?
If it’s a model, compare it to noise. That is, go ahead and test a model by scrambling the “y”s and see how well your model predicts randomness. It’s a really good and inexpensive way of seeing whether your model is better than noise, so go ahead and do it. There’s even a name for this but I forget what it is (update from reader: permutation testing).
If it’s a plan for a system or the world, compare it to the status quo. I’m so sick of people discarding good plans because they’re not perfect. If they’re better than what’s currently going on, then let’s go with that. Which brings me to my last example.
If it’s someone’s proposal (person A), compare it to other proposals (person B). I don’t think it’s fair for people (person C) to nix an idea unless they come up with a better one. If person C is consistently doing that, it’s a good bet that they have something to protect in the status quo situation, which brings us to the previous example.
The Value Added Teacher Model Sucks
Today I want you to read this post (hat tip Jordan Ellenberg) written by Gary Rubinstein, which is the post I would have written if I’d had time and had known that they released the actual Value-added Model scores to the public in machine readable format here.
If you’re a total lazy-ass and can’t get yourself to click on that link, here’s a sound bite takeaway: a scatter plot of scores for the same teacher, in the same year, teaching the same subject to kids in different grades. So, for example, a teacher might teach math to 6th graders and to 7th graders and get two different scores; how different are those scores? Here’s how different:
Yeah, so basically random. In fact a correlation of 24%. This is an embarrassment, people, and we cannot let this be how we decide whether a teacher gets tenure or how shamed a person gets in a newspaper article.
Just imagine if you got publicly humiliated by a model with that kind of noise which was purportedly evaluating your work, which you had no view into and thus you couldn’t argue against.
I’d love to get a meeting with Bloomberg and show him this scatter plot. I might also ask him why, if his administration is indeed so excited about “transparency,” do they release the scores but not the model itself, and why they refuse to release police reports at all.
Why experts?
I recently read a Bloomberg article (via Naked Capitalism) about wine critics and how they have better powers of taste than we normal people do.
One of the things I love about this article is how it’s both completely dead obvious and at the same time totally outrageous when you think about it.
Obvious because when we see wine experts going on and on about what they can discern in their tiny sip, we know they either have magical powers or they’re lying, and since some of them can do this shit blindfolded we will assume they aren’t lying.
Outrageous because if you think about it, that means we follow the advice of people whose taste is provably different from ours. In other words, the word of experts is fundamentally irrelevant to us, and yet we care about it anyway.
My question is, why do we care?
Just to go over a couple of ground rules. First, yes, let’s assume that the wine experts really do have powers of discernment that are incredible and unusual, even though we have to trust an expert on that, which may seem contradictory. The truth is, this isn’t the first study that’s shown that, and I for one have hung out with these guys and they really do taste that minerally soil in the wine. I’m not even jealous.
Second, I’m not saying you care about wine experts’ opinions, but lets face it, lots of people do. And you could say it’s because of the performance that the experts give when they smell the wine and describe it, and I’ll agree that some of them can be poets, and that’s nice to see. But the truth is they also rate the wines with a number, and these numbers are printed in books, and lots of people carry these books around to wine shops and devote themselves to only buying wines with sufficiently high numbers, even though those people probably don’t themselves have the mouth smarts to tell the difference.
Now that we’ve framed the question, we can go ahead and make guesses as to why. Here are mine:
- People are hoping that they themselves are also supertasters. This kind of seems like the most obvious one, but I can’t help think that true supertasters would not need other people’s opinions at all.
- People think experts’ opinion of “good”, even if not completely the same as theirs, will be highly correlated, and so is better than nothing.
- People want to be seen drinking wine that supertasters would drink, as a sort of cachet thing.
I think #3 above is pretty much the definition of snob, and I think it exists but is not the major reason people do things (but I could be wrong). I’m guessing it’s more typically #2, but even so it doesn’t explain the really expensive high-end wine market’s huge appeal, unless there are way more supertasters than I thought out there.
I think this question, of why people listen to expert advice even when it’s mostly irrelevant, is an important one, because it happens so much in our culture, and clearly not just about wine.
I for one am attracted to the idea of going one step further and ignoring expert advice. I see a natural progression: first, people are ignorant, second, they learn what experts think, and third, they ignore experts and go with their gut.
But even sexier is the idea of never listening to experts at all: skip step two. Am I the only person who thinks that’s sexy? I mean, I guess it mostly means you’re wasting your time, but it also comes out in the end with less herd mentality.
I think this desire I have of skipping the expert advice is very tied into why I despise the echo chamber of the web and how we are profiled online and how our environment is constantly updated and tailored to our profile. It’s in some sense an expert opinion on what we’d like, given our behavior, and I hate the finiteness of that concept, possibly in part because I’ve designed models like that and I know how dumb they are.
Anyway, I’d love to hear your thoughts, and if I’ve missed any reasons why people like to hear partially relevant expert opinion.
Do not track vs. don’t track
There’s been some buzz about a new “do not track” button that will be installed in coming versions of browsers like google chrome. The idea is to allow people their privacy online, if they want it.
The only problem is, it doesn’t give people privacy. It only blocks some cookies (called third-party cookies) but allows others to stick.
Don’t get me wrong- without third-party cookies, the job I do and every other data scientist working in the internet space will get harder. But please don’t think you’re not being tracked simply by clicking on that.
And as I understand it, it isn’t even clear that third-party cookies won’t be added: I think it’s just an honor system thing, so third-party cookie pasters will be politely asked not to add their cookies.
But don’t believe me, visualize your own cookies as you travel the web. The guy (Atul Varma) who wrote this also open-sourced the code, which is cool. See also the interesting conversation in comments on his blog Toolness.
Let me suggest another option, which we can call “don’t track”. It’s when nothing about what you do is saved. There’s a good explanation of it here, and I suggest you take a look if you aren’t an expert on tracking. They make a great argument for this: if you’re googling “Hepatitis C treatments” you probably don’t want that information saved, packaged, and sold to all of your future employers.
They also have a search engine called “DuckDuckGo” which seems to work well and doesn’t track at all, doesn’t send info to other people, and doesn’t save searches.
I’m glad to see pushback on these privacy issues. As of now we have countless data science teams working feverishly in small companies to act as predators against consumers, profiling them, forecasting them, and manipulating their behavior. I’m composing a post about what a data science team working for consumers would have on their priority list. Suggestions welcome.
Charity auctions and hate crimes
I read an absolutely incredible story last night on Bloomberg.
This Morgan Stanley executive William Jennings (co-head of North American fixed-income capital markets) is being charged with a hate crime. Let me piece it together a bit.
On December 22nd Jennings hosted a charity auction at Morgan Stanley until 6pm, then went to Ink48, a hotel in midtown on the west side. After partying on the rooftops for some time, and drinking, his car service didn’t show up fast enough for him so he hailed a cab to take him to Connecticut, where he lives with his wife and three kids in a $3.4 million house.
When he got to Connecticut, he got into a fight with the cab driver and ended up refusing to pay, stabbing the guy in his hand with a knife (which required 60 stitches) while using ethnic slurs. Then he went away to Florida for two weeks on the DL. My favorite line from the article:
Jennings fell asleep during the trip, the driver said. Once at the destination, though, Jennings said “he did not feel like paying” because he was already home.
Up for debate and the trial: did he really refuse to pay or was he just arguing his fare? Was it really 60 stitches or is that an exaggeration? Did he really use ethnic slurs? I’m throwing in these questions because I want to be correct and because the overall point of my post won’t depend on these details anyway.
Not up for debate: he stabbed the cabbie, it was definitely an argument over money, and he was worried enough to go to Florida for two weeks.
Okay, now that I’ve summed this up I’m gonna connect it to charity auctions. Yes I am.
I’ve been to charity auctions myself. I want to devote an entire post to describing what such an event consists of; for now take it from me that they are orgies of self-congratulatory arrogance. And ironically, they are not at all charitable in the sense of being generous and tolerant.
They are in fact celebrations of self-centeredness, displays of careless overabundance. Yes, I’ll pay $120k to go to Australia for a week to golf, and I’ll do it for the poor children, and by the way also because I can afford to throw away such money and especially by the way because everyone in this room now knows that.
So I think it’s extra deliciously ironic that this guy went from that atmosphere to arguing with an Egyptian cabbie over a $200 fare (or maybe $300, if we want to be generous to Jennings and believe his “extortionist cabbie” sob story).
But my point is that, although the cab ride was a different atmosphere from the charity auction, his was not a different attitude at all: both parts of his evening centered on assumptions of entitlement and selfishness and the idea that he is somehow outside the regular rules and cannot be held accountable like normal people. From the article:
He then went on vacation to Florida, police said. Jennings told officers he subsequently called his lawyer after a friend told him police were looking for a suspect in the stabbing incident, according to the report.
“Jennings said he didn’t know what to do — he just wanted the whole thing to go away,” Darien Police Detective Chester Perkowski said in a court document filed with the report.
The part about the car service not showing up is absolutely key: these guys use car services a lot, and when you do that, you get used to not paying for such trivial little things as rides, or for that matter food or drinks. All such things are handed to you for free when you are this important (read: rich). Paying, writing a check or what have you, is reserved for ostentatious displays of wealth. I know hedge fund guys that don’t even carry money in their wallet because they never use cash. Actually I don’t know them personally but I know that this is true because they brag about it in the elevators.
I’m not trying to generalize this story – most Morgan Stanley execs haven’t been charged with knifing down working class cabbies. But it’s impossible for me not to see the consistency in the two events.
This is water
I just started reading Infinite Jest and it’s blowing my mind.
I’m a nerd so I had never heard of David Foster Wallace before reading his book, but now I’ve officially joined his cult. If I’m too late to this party I will start my own, one-woman cult.
As far as I’m concerned he’s the Elliott Smith of literature.
If you haven’t already, please read this, Wallace’s 2005 commencement speech to Kenyon College. I read it at work yesterday and bawled into my keyboard for about 20 minutes.
Open Models (part 2)
In my first post about open models, I argued that something needs to be done but I didn’t really say what.
This morning I want to outline how I see an open model platform working, although I won’t be able to resist mentioning a few more reasons we urgently need this kind of thing to happen.
The idea is for the platform to have easy interfaces both for modelers and for users. I’ll tackle these one at a time.
Modeler
Say I’m a modeler. I just wrote a paper on something that used a model, and I want to open source my model so that people can see how it works. I go to this open source platform and I click on “new model”. It asks for source code, as well as which version of which open source language (and exactly which packages) it’s written in. I feed it the code.
It then asks for the data and I either upload the data or I give it a url which tells the platform the location of the data. I also need to explain to the platform exactly how to transform the data, if at all, to prepare it for feeding to the model. This may require code as well.
Next, I specify the extent to which the data needs to stay anonymous (hopefully not at all, but sometimes in the case of medical data or something, I need to place security around the data). These anonymity limits will translate into the kinds of visualizations and results that can be requested by users but not the overall model’s aggregated results.
Finally, I specify which parameters in my model were obvious “choices” (like tuning parameters, or prior strengths, or thresholds I chose for cleaning data). This is helpful but not necessary, since other people will be able to come along later and add things. Specifically, they might try out new things like how many signals to use, which ones to use, and how to normalize various signals.
That’s it, I’m done, and just to be sure I “play” the model and make sure that the results jive with my published paper. There’s a suite of visualization tools and metrics of success built into the model platform for me to choose from which emphasize the good news for my model. I’ve created an instance of my model which is available for anyone to take a look at. This alone would be major progress, and the technology already exists for some languages.
User
Now say I’m a user. First of all, I want to be able to retrain the model and confirm the results, or see a record that this has already been done.
Next, I want to be able to see how the model predicts a given set of input data (that I supply). Specifically, if I’m a teacher and this is the open-sourced value added teacher model, I’d like to see how my score would have varied if I’d had 3 fewer students or they had had free school lunches or if I’d been teaching in a different district. If there were a bunch of different models, I could see what scores my data would have produced in different cities or different years in my city. This is a good start for a robustness test for such models.
If I’m also a modeler, I’d like to be able to play with the model itself. For example, I’d like to tweak the choices that have been made by the original modeler and retrain the model, seeing how different the results are. I’d like to be able to provide new data, or a new url for data, along with instructions for using the data, to see how this model would fare on new training data. Or I’d like to think of this new data as updating the model.
This way I get to confirm the results of the model, but also see how robust the model is under various conditions. If the overall result holds only when you exclude certain outliers and have a specific prior strength, that’s not good news.
I can also change the model more fundamentally. I can make a copy of the model, and add another predictor from the data or from new data, and retrain the model and see how this new model performs. I can change the way the data is normalized. I can visualize the results in an entirely different way. Or whatever.
Depending on the anonymity constraints of the original data, there are things I may not be able to ask as a user. However, most aggregated results should be allowed. Specifically, the final model with its coefficients.
Records
As a user, when I play with a model, there is an anonymous record kept of what I’ve done, which I can choose to put my name on. On the one hand this is useful for users because if I’m a teacher, I can fiddle with my data and see how my score changes under various conditions, and if it changes radically, I have a way of referencing this when I write my op-ed in the New York Times. If I’m a scientist trying to make a specific point about some published result, there’s a way for me to reference my work.
On the other hand this is useful for the original modelers, because if someone comes along and improves my model, then I have a way of seeing how they did it. This is a way to crowdsource modeling.
Note that this is possible even if the data itself is anonymous, because everyone in sight could just be playing with the model itself and only have metadata information.
More on why we need this
First, I really think we need a better credit rating system, and so do some guys in Europe. From the New York Times article (emphasis mine):
Last November, the European Commission proposed laws to regulate the ratings agencies, outlining measures to increase transparency, to reduce the bloc’s dependence on ratings and to tackle conflicts of interest in the sector.
But it’s not just finance that needs this. The entirety of science publishing is in need of more transparent models. From the nature article’s abstract:
Scientific communication relies on evidence that cannot be entirely included in publications, but the rise of computational science has added a new layer of inaccessibility. Although it is now accepted that data should be made available on request, the current regulations regarding the availability of software are inconsistent. We argue that, with some exceptions, anything less than the release of source programs is intolerable for results that depend on computation. The vagaries of hardware, software and natural language will always ensure that exact reproducibility remains uncertain, but withholding code increases the chances that efforts to reproduce results will fail.
Finally, the field of education is going through a revolution, and it’s not all good. Teachers are being humiliated and shamed by weak models, which very few people actually understand. Here’s what the teacher’s union has just put out to prove this point:
Economists don’t understand the financial system
Cross posted from Naked Capitalism.
A bit more than a week ago I went to a panel discussion at the Met about the global financial crisis. The panel consisted of Paul Krugman, Edmund Phelps, Jeffrey Sachs, and George Soros. They were each given 15 minutes to talk about what they thought about the Eurocrisis, especially Greece, the U.S., and whatever else they felt like.
It was well worth the $25 admission fee, but maybe not for the reason I would have thought when I went. I ended up deciding something I’ve suspected before. Namely, economists don’t understand the financial system, and moreover they don’t get that they don’t get it. Let me explain my reasoning.
The panelists all are pretty left-leaning guys, and each of them basically talked about how the U.S. government should stimulate the economy in one way or another. Krugman kept saying that hey, this isn’t too hard, we’ve seen financial crises before, and this is no different: we should immediately pass a massive stimulus package, that’s the one and only thing that we should be discussing. Sachs was very consistently saying we should do something else: namely, start planning long-term for the future. He focused on the percent of tax dollars going into infrastructure and basic education and research. Phelps also wanted stimulus, but he consistently referred to his own economic models in how exactly it should work. I didn’t completely follow his train of thought.
Soros was the most interesting of the four, in my opinion. He started by saying that we should all acknowledge that, as nice as it would be to think we can model the economy and feel control over the situation, this is a pipe dream and we should get used to not really knowing what will happen when we do one thing versus another. He suggested that we should instead work together to develop a theory, or perhaps even an philosophy, that assumes uncertainty itself. He ended by saying that, even with the three colleagues on the panel with him, who are essentially all united in thinking we need to be proactive, his ideas are essentially being ignored.
The rest of the evening essentially consisted of everyone ignoring Soros and arguing about how Keynesian they all were and how exactly different kinds of stimulus would work and which way they should use 2% of GDP to jumpstart the world’s economy. So basically exactly what Soros said would happen.
It got me more and more riled up. Here are these expert economists, two of whom have Nobel Prizes and the third who runs the Earth Institute at Columbia and is considered a huge swinging dick in his own right, and they don’t seem to acknowledge how much power they actually have over the situation (specifically, not much). For that matter, they clearly don’t know the nitty gritty of the financial system. To listen to them, all you need to do is spread a thick paste of money on the system and it would revive whole cloth. Soros is the exception, probably for the reason that he actually traded and made money inside the system.
At the end I asked a question, since they allowed a few questions, and as you know I’m not shy. I asked how we are going to make the system simple enough to actually make it possible to regulate it. Krugman basically said that Dodd-Frank is going to do it. My conclusion from that is that Krugman must really have only an outline in his head of how this stuff works- the devil, as we know, is really in the detail, and I’m too acquainted with the Volcker Rule’s list of exemptions to have a lot of hope on this score. To be fair, Phelps mentioned Amar Bhide’s book A Call for Judgment, which I’m reading and seems pretty good and at least addresses this exact issue head-on.
Overall, the evening brought me back to the credit crisis, and working at D.E. Shaw, when Larry Summers was consistently quoted at the firm as saying that the “magical liquidity fairy” needed to come and “spread some magical liquidity dust” in the markets to make everything better. No, I’m not kidding.
What I felt then and what I still feel is that these super influential economists are so high on their clean, simple economic models of the world (about the only variables of which are GDP, stimulus, and tax rates) that they focus on the model to the exclusion of the secondary issues. Sometimes you get important results this way: simplifying models can be really useful. But sometimes it’s really truly misleading to do so, and I believe this is one of those cases.
I’m left thinking that they (the economists) are so entranced with their simplified world view that still don’t understand what actually fucked up the world in 2007 and 2008, namely the CDO market’s implosion. Message to Krugman: this is not exactly like other financial crises, because it’s partly caused by complexity, and nobody seems to have the balls to fix it. The problem is that the financial system has been allowed to get so complicated and so rigged in favor of the people with information, that normal people, including homeowners, credit card users, politicians, and regulators have been left in the dark, and many of the little guys are still stuck in ludicrous contracts left over from the outrageous securitizations that took place in the CDO market.
What is especially enraging is how these same economists are still the experts that people turn to to help figure out how to get out of this mess, when they don’t actually understand the mess itself. Why else would a large audience be willing to pay $25 a piece to hear them talk about this? Why else would Obama be considering Larry Summers to lead the World Bank?
As an aside: please, Mr. President, do not let Summers lead the world bank. He does not understand the system well enough to lead it. And he is too arrogant to admit what he doesn’t know. I can introduce you to a bunch of people that may be less imposing but are more informed, more ethical, and wiser. Give me a call any time and we can chat and form a short list of candidates.
By the way, I’m not saying we shouldn’t have a major stimulus, or that we shouldn’t do longer term planning and invest more in infrastructure. I think we should do both. But I also think those efforts will be futile unless we enforce a basic system that is simple enough to be regulated. Otherwise we will be reliving this entire ordeal in another 15 years.
Vikram Pandit: let’s talk
Here’s the coverage from Business Insider.
Here’s the letter (also posted on Naked Capitalism):
Dear Mr. Pandit,
Last October, in an interview with Fortune Magazine, you extended an invitation to Occupy Wall Street for a face-to-face meeting. The Alternative Banking Group, an official working group of Occupy Wall Street, hereby accepts.
As CEO of Citigroup, you recently announced “a new Citi.” You said that you are now “working hard to create a culture of responsible finance.” Our mission as the Alternative Banking Group is exactly the same. We look forward to a fruitful dialogue.
Since this conversation is of importance to the general public, we will have a small camera crew with us to document it. The video will be shared on the websiteoccupy.com, an emerging media platform for the Occupy movement.
Please respond to this email at your earliest convenience to schedule a time and place.
Sincerely,
Cathy O’Neil
Facilitator
The Alternative Banking Group
Occupy Wall Street
Please comment with questions we can ask Vikram if he accepts our offer.
Math-Startup Collaborative at Columbia tomorrow
The Columbia Chapter of SIAM (Society for Industrial and Applied Mathematics) invites YOU to:
Spring 2012 Math-Startup Collaborative Wednesday, February 29 | 6:00 – 8:00 PM | Davis Auditorium, Schapiro Center
Meet Columbia alumni from NYC startups, including Bitly, Foursquare, and Codecademy, and learn about the role of math and engineering in their companies. Students interested in startup careers and internships, or those simply curious to see how math is applied in the wild, are especially welcome. The event, which consists of a series of presentations from our startup members, will be followed by a reception (with free food!). This is sure to be the largest applied math event of the semester!
Startup presenters include:
- Bitly –> http://blog.bitly.com/
- Foursquare –> https://foursquare.com/about/
- Codecademy –> http://blog.codecademy.com/
- BuzzFeed –> http://www.buzzfeed.com/about
- Sailthru –> http://blog.sailthru.com/
Space is limited! Contact Ilana Lefkovitz (itl2103@columbia.edu) for more information.
Please RSVP at bit.ly/Math–Startup-RSVP-2012
Co-sponsored by the Application Development Initiative (ADI). Reception sponsored by AOL Ventures.
Math teaching needs overhaul
My friend Tara sent me a message:
The President’s Council of Advisors on Science and Technology submitted a report on the challenge to producing more college graduates with STEM degrees. In particular, they point out mathematics as a bottleneck, and recommend (on p. 29) that “teaching and curricula [be] developed and taught by faculty from mathematics-intensive disciplines other than mathematics, including physics, engineering, and computer science.” Of course, there are physicists, engineers, and computer scientists on the Council, whereas there is no mathematician.
On some level, they do have a point. They seem to say that we (as a nation) are not doing a good job of teaching K12 mathematics. I strongly disagree with their conclusion that we should therefore take the college-level teaching of mathematics away from the experts in mathematics.
Hmm. I don’t know. I’ve been sounding a warning for a while now that math departments are way too complacent about the way they teach undergrads. I try to make people think of a math department as, to some extent, a brand, and that we should be trying to attract good majors and we should be trying to get more people psyched about math. To that end I am constantly trying to get people to care about the calculus curriculum, which is always at risk of being taken over by the physics, engineering, and economics departments, and I’ve consistently introduced “introduction to higher math” courses which explicitly teach proof techniques.
But there’s a major problem, at least in the very top research departments. Namely, the professors actually think math should be a hard and elite major, and that gives them an excuse to not care about the quality of the undergrad classes. That’s not how they say it, of course, but my experience is that’s how it works.
The other reason I think it makes sense to be a bit concerned about the brand is that if we mathematicians don’t start doing it, then someone else will start doing it for us. This President’s Council of Advisors report is exactly saying that. On the one hand it could be the kick in the ass that math departments need, but on the other hand considering how much reporting they are asking for, it could mean a tremendous amount of paperwork as well as a loss of independence of the math community.
I say mathematicians respond to this by admitting there’s a problem and coming up with a good plan that they organize and control. Otherwise I do think something else will and should be done.
Interestingly, there also seems to be a call in this report for more good math tutors. It reminds me of a commenter from yesterday who wants to start something called “Tutor for America”, which I think is an excellent idea.
Teaching scores released
Anyone who reads this blog regularly knows how detestable I think it is that the teacher value-added model scores are being released but the underlying model is not.
We are being shown scores of teachers and we are even told the scores have a wide margin of error: someone who gets a 30 out of 100 could next year get a 70 out of 100 and nobody would be surprised (see this article).
Just to be clear, the underlying test doesn’t actually use a definition of a good teacher beyond what the score is. In other words, this model isn’t being trained by looking at examples of what is a “good teacher”. Instead, it derived from another model which predicts students’ test scores taking into account various factors. At the very most you can say the teacher model measures the ability teachers have to get their kids to score better or worse than expected on some standardized tests. Call it a “teaching to the test model”. Nothing about learning outside the test. Nothing about inspiring their students or being a role model or teaching how to think or preparing for college.
A “wide margin of error” on this value-added model then means they have trouble actually deciding if you are good at teaching to the test or not. It’s an incredibly noisy number and is affected by things like whether this year’s standardized tests were similar to last year’s.
Moreover, for an individual teacher with an actual score, being told there’s a wide margin of error is not helpful at all. On the other hand, if the model were open source (and hopefully the individual scores not public), then a given teacher could actually see their margin of error directly: it could even be spun as a way of seeing how to “improve”. Otherwise said, we’d actually be giving teachers tools to work with such a model, rather than simply making them targets.
update: Here’s an important comment from a friend of mine who works directly with New York City math teachers:
Thanks for commenting on this. I work with lots of public school math teachers around New York City, and have a sense of which of them are incredible teachers who inspire their students to learn, and which are effective at teaching to the test and managing their behavior.
Curiosity drove me to it, but I checked out their ratings. The results are disappointing and discouraging. The ones who are sending off intellectually engaged children to high schools were generally rated average or below, while the ones who are great classroom managers and prepare their lessons with priority to the tests were mostly rated as effective or above.
Besides the huge margin of uncertainty in this model, it’s clear that it misses many dimensions of great teaching. Worse, this model, now published, is an incentive for teachers to develop their style even more towards the tests.
If you don’t believe me or Japheth, listen to Bill Gates, who is against publicly shaming teachers (but loves the models). From his New York Times op-ed from last week:
Many districts and states are trying to move toward better personnel systems for evaluation and improvement. Unfortunately, some education advocates in New York, Los Angeles and other cities are claiming that a good personnel system can be based on ranking teachers according to their “value-added rating” — a measurement of their impact on students’ test scores — and publicizing the names and rankings online and in the media. But shaming poorly performing teachers doesn’t fix the problem because it doesn’t give them specific feedback.
If nothing else, the Bloomberg administration should also look into statistics regarding whether it’s become a more attractive or less attractive profession since he started publicly shaming teachers. Has introducing the models and publicly displaying the results had the intended effect of keeping good teachers and getting rid of bad ones, Mayor Bloomberg?
I am the most boring person in the world
A few nights ago I went to a CFPB Town Hall Meeting after work. The discussion in the kitchen that morning went something like this:
me: “I’m going to be late tonight, guys, because I’m going to a CFPB Town Hall meeting… I’m really excited about it!”
my husband: “What the hell is CFPB?”
me: “Oh, it stands for the Consumer Financial Protection Bureau. You know, the thing that Elizabeth Warren started but then didn’t get to be in charge of? I need to go see if this guy Cordray is going to be pushy enough to lead an effective government agency. Today the issues at the meeting are things like checking accounts and debit cards and overdraft policies. I totally need to go, can you guys eat leftovers?”
my husband: “You are the most boring person in the world”
my three sons, simultaneously: “Yeah mom, he’s right. You are the most boring person in the world.”
Whatever. I guess they’re right, but I went anyway. After lots of incredibly congratulatory introductions, including a 5 minutes speech from New York Attorney General Eric Schneiderman, there were a bunch of questions from the audience.
There were lots of community groups represented, as well as individuals. Two themes emerged through the questions that seem like particularly egregious consumer issues affecting poor people.
First was the issue of pre-paid debit cards and the corresponding fees. This guy stood up at the microphone and described his friend who get a debit card for child support, court-ordered. But this debit card extracts enormous fees every time she takes money out, including things like $5 just to check the balance. The guy was saying, you know my friend needs that money for her children, and it’s not fair that so much of it goes to fees- it’s abusive. I was totally crying. I mean, I’m an easy cry, but still. That wasn’t the only story about such debit cards where there was no choice in the matter but the fees were extortionist.
Second the issue of Walmart issuing its pay to people in debit card form came up time after time as well. So it seems that Walmart is not only a retailer, but also a financial institution of the crappiest kind now. It issues debit cards as payment to people who don’t accept direct deposit or don’t have checking accounts, and again it seems that the money on the cards is somehow deeply tied to a fee structure. I need to look into this more (as does the CFPB) but I’m wondering off the top of my head whether people can just demand to be paid in cash instead. It’s like these people are being paid really badly, with very few benefits, and even when they get paid they’re being nickeled and dimed every step of the way. It’s like it’s not really their money even then.
So in other words, debit cards are the new check cashers, but maybe worse since their fee structure doesn’t seem to be as transparent.
Of course, I took the opportunity to ask a question too, since I am not shy. And I was told not to ask a question but rather to tell a story, but whatever, I decided to phrase is as “making three suggestions.” After introducing myself as coming from the Alternative Banking group, I mentioned the following:
- The CFPB should use its powers to bring together mortgage investors and homeowners to the same table, in order to align their interests and bypass the banks as servicers, since the banks are only endlessly delaying the process in order to extract fees.
- I mentioned that our group is working on a “find a credit union app” but that the CFPB should really be doing that with us, to help underbanked people find alternatives to crappy banking solutions (like debit cards).
- I mentioned that we had submitted a public comment letter demanding that the credit score models be open sourced, since there was no legitimate reason for such models, which directly affect consumers in their daily lives, to be kept proprietary.
Akshat was there too, from Occupy the SEC, and he asked about the Volcker Rule.
Condray took notes. I mean, what’s he going to say.
Well actually sometimes he did say stuff, like to Akshat, and for the most part it was something along the lines of, “that is not in our jurisdiction”, although there was one exception when he talked about how Walmart, being a retailer, is not in his jurisdiction but since it’s acting as a banking institution it actually is.
Overall I’m a bit disappointed. Although I did certainly like the fact that he held a town hall meeting at all, I am worried that he’s just too nice, and that he’s going to try to please everyone and be kind of wishy-washy. I would have loved to see him manage to sustain disgust at the abuses he was hearing about, but instead he sounded more concerned than angry, and I would put my money on angry any day. I want the CFPB to be led by a son-of-a-bitch that pisses people off and constantly tried to enlarge his jurisdiction rather than keeping well inside the lines. Time will tell.
It’s all mom’s fault
Maybe it’s because I grew up with an unapologetic working mother, but I am confused and enraged by all the cultural norms concerning mothers and how everything is their fault.
When I grew up in the 1970’s I had all sorts of role models of mothering. I was lucky to live next door to Sally, I met MA (Mary Ann) in puberty, and of course there was my own mom. All of these women were fiercely devoted to their choices: Sally and MA stayed home with their young kids but as their kids grew up, devoted more and more time to other things. My mom was a computer science professor my entire life. It goes without saying (but just for the record I’ll say it here) that I support people doing what they want to and need to for their own private reasons, no questions asked.
Sure, there were differences in interactions between my mom and these other surrogate moms. My mom didn’t have a lot of extra time to shop or cook, for example. But on the other hand she was a great role model for me in showing me how to be happy with what you do and have kids at the same time. And some things she didn’t have time for I was lucky enough to get from other things and people.
Here it is, thirty years later, and lots things have changed for working mothers. Some things have gotten easier: there’s online shopping, so I can provide my three sons with clothes and food without leaving home, which was a major struggle for my mom. Some things have gotten harder: school and daycare has gotten more expensive (more on that below). Other things haven’t changed so much, which itself is strange.
Here’s an article that got me pissed off enough to write this post. It’s a New York Times piece about an Olympic swimmer who, after taking time off and having two children, has returned to swimming and is actually competitive at the age of 40. I am so completely impressed by her, but for some reason the Times sees it as appropriate to deliver the following lines:
Evans said she had been criticized on social networking sites for training when she should be home with her children. But she has set up her schedule so her main swimming workout takes place in the morning, from 5:30 to 7:30, so she can make it home in time for breakfast. Her crazy hours are not lost on her daughter, who recently asked, “Why do you swim in the dark, Mommy?”
…
Willson’s job in technology sales allows him to work from home. He can chip in with the children when needed and behold the force of nature that is his wife.
First of all, how is it appropriate to mention idiots on Facebook? It is so entirely defensive and out of place. If I’m training for the Olympics, probably for the very last time in my life, my kids will be psyched for me to do my best, even if it means missing breakfast sometimes. And why is there always a mention of the martyred husband? Just imagine this was a male swimmer coming back to the Olympics after not swimming for 15 years, do we hear about his wife? No we don’t. Ridiculous, and the New York Times should do better. If they mention idiots on Facebook, they should also mention how they are idiots.
Here’s another story that got me incredibly pissed (if you were looking for a happy post this morning, I apologize). It’s about a public ad campaign in Georgia with billboard pictures of fat kids looking unhappy. This is insane and insulting on so many levels I don’t really know where to start, but let me start with the intended target: the mom. Yes, it’s mom’s fault that there are fat kids, and these billboards are telling mom not to let their kids get fat.
As an aside, it’s also now officially okay to blame mom for making her kids fat, as it’s also officially okay to blame the kids themselves. It’s government-sponsored bullying. Never mind the fact that they’ve shown nutrition education and exercise doesn’t actually cause people to lose weight (i.e. understanding where calories are hidden in food doesn’t magically make them leave cheeseburgers). Never mind that nobody has come up with a viable plan for how to address this issue. Let’s blame moms anyway, because then we are taking this issue seriously.
It makes you wonder why women want to become moms at all considering all the things we are signing up for. Oh and wait, actually lots of women aren’t having kids, but interestingly a recent paper came out showing women who are highly educated are having more kids. Here’s the abstract for that paper:
Conventional wisdom suggests that in developed countries income and fertility are negatively correlated. We present new evidence that between 2001 and 2009 the cross-sectional relationship between fertility and women’s education in the U.S. is U-shaped. At the same time, average hours worked increase monotonically with women’s education. This pattern is true for all women and mothers to newborns regardless of marital status. In this paper, we advance the marketization hypothesis for explaining the positive correlation between fertility and female labor supply along the educational gradient. In our model, raising children and home-making require parents’ time, which could be substituted by services bought in the market such as baby-sitting and housekeeping. Highly educated women substitute a significant part of their own time for market services to raise children and run their households, which enables them to have more children and work longer hours. Finally, we use our model to shed light on differences between the U.S. and Western Europe in fertility and women’s time allocated to labor supply and home production. We argue that higher inequality in the U.S. lowers the cost of baby-sitting and housekeeping services and enables U.S. women to have more children, spend less time on home production and work more than their European counterparts.
Also interesting is this interview, where they describe the results of another paper which tracked women vs. men in various fields of science, including math. It looks like evidence for my post about meritocracy and horizon bias, i.e. the idea that women self-select out of certain fields because they are just not very appealing. From the interview:
The women who come in to academic science careers tend to be so highly motivated that they stay. They limit the number of children they have. Other studies have shown that female academics have fewer children than other professional women, such as lawyers. Female graduates see women scientists working very hard in what they feel are less fair conditions, and it puts them off. Societal factors also make it harder for women to have such demanding careers–women tend to manage family problems, for example.
By the way, I am not insufferably sad about mothers and their fates. I make fun of mothers too, and this article about passive parents is one I could have written. From the article:
But seriously, what is the deal with asking our children to behave? “Maybe you should get down?” What the hell is wrong with you lady? She’s four. There’s no room for negotiating here. I’m all for giving my kids choices to make them feel like they’re in control of something, blah, blah, blah, but this is not the time. “Maybe” should be reserved for times like: “Do you want to wear a dress today or MAYBE a skirt?”
I could go on and on about the passivity of modern yuppie parents, and I’d be right (hey I live in the Upper West Side so you know I’d be right). But if you think about it for a minute, this is just another manifestation of the same thing: it’s all mom’s fault. These women are performing a mother role instead mothering from the stomach, and it’s because they are made insecure by all the incredible bullshit out there about how to be a good mom and what other people are going to think if they scream at their kid in public or if their kid starts to scream. We have taught our mothers to be insecure, and to feel at fault, and oh yes, to be the target of bullying ad campaigns as well.
People, let’s get it together and solve problems instead of pointing fingers. I’m looking at you, Santorum.
Model Thinking (part 2)
I recently posted about the new, free online course Model Thinking. I’ve watched more videos now and I am prepared to make more comments.
For the record, the lecturer, Scott Page, is by all accounts a great guy, and indeed he seems super nice on video. I’d love to have him over for dinner with my family someday (Professor Page, please come to my house for dinner when you’re in town).
In spite of liking him, though, pretty much every example he gives as pro-modeling is, for me, an anti-modeling example. Maybe I should make a complementary series of YouTube comment videos. It’s not totally true, of course- I just probably don’t notice the things we agree on. But I do notice the topics on which we disagree:
- He talks a lot about how models make us clearer thinkers. But what he really seems to mean is that they make us stupider thinkers. His example is that, in order to decide who to vote for for president, we can model this decision as depending on two things: the likability of the person in question (presumably he assumes we want our president to be likable), and the extent to which that person is “as left” or “as right” as we are. I don’t know about you, but I actually care about specific issues and where people stand on them, and which issues I consider likely to come up for consideration in the next election cycle. Like, if I like someone for his “stick it to the banks” approach but he’s anti-abortion, then I think about whether abortion is likely to actually become illegal. And by the way, I don’t particularly care if my president is likable, I’d rather have him or her effective.
- He bizarrely chooses “financial interconnectedness” as a way of seeing how cool models are, and he shows a graph where the nodes are the financial institutions (Goldman Sachs, JP Morgan, etc.) and the edges are labeled with an interconnectedness score, bigger meaning more interconnected. He shows that, according to this graph, back in 2008 it shows we knew to bail out AIG but that it was definitely okay to let Lehman fail. I’m wondering if he really meant that this was an example of how your model could totally fail because your “interconnectedness scoring” sucked, but he didn’t seem to be tongue in cheek.
- He then talked about measuring the segregation of a neighborhood, either by race or by income, and he used New York and Chicago as examples. I won’t go into lots of details, but he gave a score to each block, like the census maps do with coloring, and he used those scores to develop a new score which was supposed to measure the segregation of each block. The problem I have with this segregation score is that it depends very heavily on the definition of the overall area you are considering. If you enlarge your definition of the New York City to include the suburbs, then the segregation score of New York City may (probably would) be completely different. This seems to be a really terrible characteristic of such a metric.
- My second problem with his segregation score is that, at the end, he had overall segregation numbers for Philly and Detroit, and then showed the maps and mentioned that, looking at the maps, you wouldn’t really notice that one is more segregated than the other (Philly more than Detroit), but knowing the scores you do know that. Umm.. I’d like to rather say, if you are getting scores that are not fundamentally obvious from looking at these pictures, then maybe it’s because your score sucks. What does having a “good segregation score” mean if not that it captures something you can see through a picture?
- One thing I liked was a demonstration of Schelling’s Segregation Model, which shows that, if you have a group of people who are not all that individually racist, you can still end up with a neighborhood which is very segregated.
I’m looking forward to watching more videos with my skeptical eye. After all, the guy is really a sweetheart, and I do really care about the idea of teaching people about modeling.
#OWS Alternative Banking update
Crossposted from the Alternative Banking Blog.
I wanted to mention a few things that have been going on with the Alternative Banking group lately.
- The Occupy the SEC group submitted their public comments last week on the Volcker Rule and got AMAZING press. See here for a partial list of articles that have been written about these incredible folks.
- Hey, did you notice something about that last link? Yeah, Alt Banking now has a blog! Woohoo! One of our members Nathan has been updating it and he’s doing a fine job. I love how he mentions Jeremy Lin when discussing derivatives.
- Alt Banking also has a separate suggested reading list page on the new blog. Please add to it!
- We just submitted a short letter as a public comment to the new Consumer Financial Protection Bureau regulation which gives them oversight powers on debt collectors and credit score bureaus. We basically told them to make credit score models open source (and I wasn’t even in the initial conversation about what we should say to these guys! Open source rules!!):
Creepy model watch
I really feel like I can’t keep up with all of the creepy models coming out and the news articles about them, so I think I’ll just start making a list. I would appreciate readers adding to my list in the comment section. I think I’ll move this to a separate page on my blog if it comes out nice.
- I recently blogged about a model that predicts student success in for-profit institutions, which I claim is really mostly about student debt and default,
- but here’s a model which actually goes ahead and predicts default directly, it’s a new payday-like loan model. Oh good, because the old payday models didn’t make enough money or something.
- Of course there’s the teacher value-added model which I’ve blogged about multiple times, most recently here. And here’s a paper I’d like everyone to read before they listen to anyone argue one way or the other about the model (h/t Joshua Batson). The abstract is stunning: Recently, educational researchers and practitioners have turned to value-added models to evaluate teacher performance. Although value-added estimates depend on the assessment used to measure student achievement, the importance of outcome selection has received scant attention in the literature. Using data from a large, urban school district, I examine whether value-added estimates from three separate reading achievement tests provide similar answers about teacher performance. I find moderate-sized rank correlations, ranging from 0.15 to 0.58, between the estimates derived from different tests. Although the tests vary to some degree in content, scaling, and sample of students, these factors do not explain the differences in teacher effects. Instead, test timing and measurement error contribute substantially to the instability of value-added estimates across tests. Just in case that didn’t come through, they are saying that the results of the teacher value-added test scores are very very noisy.
- That reminds me, credit scoring models are old but very very creepy, wouldn’t you agree? What’s in them that they want to conceal them?
- Did you read about how Target predicts pregnancy? Extremely creepy.
- I’m actually divided about whether it’s the creepiest though, because I think the sheer enormity of information that Facebook collects about us is the most depressing thing of all.
Before I became a modeler, I wasn’t personally offended by the idea that people could use my information. I thought, I’ve got nothing to hide, and in fact maybe it will make my life easier and more efficient for the machine to know me and my habits.
But here’s how I think now that I’m a modeler and I see how this stuff gets made and I see how it gets applied. That we are each giving up our data, and it’s so easy to do we don’t think about it, and it’s being used to funnel people into success or failure in a feedback loop. And the modelers, the people responsible for creating these things and implementing them, are always already the successes, they are educated and are given good terms on their credit cards and mortgages because they have a nifty high tech job. So the makers get to think of how much easier and more convenient their lives are now that the models see how dependable they are as consumers.
But when there are funnels, there’s always someone who gets funneled down.
Think about how it works with insurance. The idea of insurance is to pool people so that when one person gets sick, the medical costs for that person are paid from the common fund. Everyone pays a bit so it doesn’t break the bank.
But if we have really good information, we begin to see how likely people are to get sick. So we can stratify the pool. Since I almost never get sick, and when I do it’s just strep throat, I get put into a very nice pool with other people who never get sick, and we pay very very little and it works out great for us. But other people have worse luck of the DNA draw and they get put into the “pretty sick” pool and their premium gets bigger as their pool gets sicker until they are really sick and the premium is actually unaffordable. We are left with a system where the people who need insurance the most can’t be part of the system anymore. Too much information ruins the whole idea of insurance and pooled risk.
I think modern modeling is analogous. When people offer deals, they can first check to see if the people they are offering deals are guaranteed to pay back everything. In other words, the businesses (understandably) want to make very certain they are going to profit from each and every customer, and they are getting more and more able to do this. That’s great for customers with perfect credit scores, and it makes it easier for people with perfect credit scores to keep their perfect credit scores, because they are getting the best deals.
But for people with bad credit scores, they get the rottenest deals, which makes a larger and larger percentage of their takehome pay (if they even get a job considering their credit scores) go towards fees and high interest rates. This of course creates an environment in which it’s difficult to improve their credit score- so they default and their credit score gets worse instead of better.
So there you have it, a negative feedback loop and a death spiral of modeling.
What data science _should_ be doing
I recently read this New York Times article about a company that figures out how to get the best deal when you rent a car. The company is called AutoSlash and the idea is you book with them and they keep looking for good deals, coupons, or free offers every day until you actually need the car.
Wait a minute, a data science model that actually directly improves the lives of its customers? Why can’t we have more of these? Obviously the car companies absolutely hate this idea. But what are they going to do, stop offering online shopping?
Why don’t we see this in every category of shopping? It seems to me that you could do something like this and start a meta-marketplace, where you buy something and then, depending on how long you’re willing to wait until delivery, the model looks for a better online deal, in exchange for a small commission. Then you’d have to make sure that on average the commission is paying for itself with better deals, but my guess is it would work if you allowed it a few days to search per purchase. Or if you really are a doubter, fix a minimum wait time and let the company take some (larger) percentage of the difference between the initial price and the eventual best price.
Another way of saying this, is that when you go online to buy something, depending on the scale (say it’s on the expensive side) you probably shop around for a few days or weeks. Why do that in person? Just have a computer do it for you and tell you at the end what deal it gave you. Don’t get bombarded by ads, let the computer get bombarded by ads for you.
Why I love nerds
What is it that grad students do all day? Well if you’re Zachary Abel in the M.I.T. math department, then the answer may be that you fiddle with paperclips and make awesome nerdy and beautiful sculpture (I found his page through the God Plays Dice blog). Here’s my favorite sculpture from his site:
Be sure to read the explanations he gives of the things he’s made, they are very cool and sometimes comes with animation.







