The NCAA Division I Men's Basketball Championship is a single-elimination
tournament that holds a unique place in the American sporting landscape.
Sports have long been intertwined with gambling — ask the 1919
Chicago White Sox — and football in particular has attracted a
fan base that increasingly seems to care less about who won or lost than
about who covered the point spread. But only when the Big Dance rolls
around every March do we see pretty much everyone who's watching the games,
and a lot of people who aren't, attempt to predict an entire tournament.
For instance, take this guy:
This fellow is "filling out a bracket," that is, forecasting the results
not only of the 32 scheduled games but also of the 31 hypothetical games
that would follow in the subsequent five rounds should all his picks turn
out to be correct. As noted, this ritual has spread far beyond the ranks
of college basketball fans; it's very common for a workplace to have an
"office pool" in which everyone, including the stereotypical "secretary
who makes picks based on which mascot sounds tougher," kicks in a couple
of bucks and submits a bracket. There are a number of factors that make
bracketology appealing even to civilians. One is that even people who
don't much care about sports do tend to hold opinions about the universities
in the tournament, or at the very least, about the parts of the country they
represent. (Elizabeth's interest in basketball is roughly equivalent to
my interest in shoes, yet we were able to spin a few minutes of conversation
out of the all-important question of whether northern Iowa is better than
Las Vegas.) Another is that completing a bracket is a sort of world-building
exercise — a very limited one, yes, but still, by the time you
get to the later rounds you've spun out this whole hypothetical future.
Entering your bracket in an office pool is not entirely dissimilar to
showing off your Quizilla results on Livejournal. Except that instead of
having, like, six possible outcomes, you have over nine quintillion.
But I think the main thing is that, in filling out a bracket, you're not
just wagering on a game — you're playing one. There's strategy
involved. One of the fun things about the tournament is that the vast
majority of the games are not even-strength: there's a clear favorite
and a clear underdog, and each team receives a seeding that indicates where
it falls in its region's pecking order. (This is yet another thing that makes
it friendly to outsiders, since they can always just play the odds.) So you
look at that matchup between (4) Vanderbilt and (13) Murray State
and you think, hmm, yes, Vanderbilt's probably going to win, but Murray State
probably has a better shot than the seedings would indicate... should I go for
the more likely outcome, or pick the upset and hope to score some bonus points?
And the answer to that question depends on the rule set of the pool you're in.
Which brings me to my topic for today: what's the best way to score an NCAA
tournament pool?
Obviously, it depends on what you want to reward. For instance, is it more
important to pick a lot of early games correctly, or to nail the really
important games at the end of the tournament? Many pools double the number
of points awarded per game with each successive round: 1-2-4-8-16-32. This
is how it works on espn.com, and there are a couple of rationales for it.
One is that in any given first-round matchup there are only two possible
winners, while in the championship game there are
possible candidates; since your task is 32 times harder, you should get 32
times as many points. Another is simply that the whole point of the
tournament is to anoint a champion, so if you don't even pick the champion
correctly there's no way you should win a pool. However, not everyone
agrees with this. The first tournament pool I ever saw set the number of
points awarded for each game equal to the round it was in: 1-2-3-4-5-6.
To see the effect of this, compare the following two brackets. The first,
marked "A.H.," is that of a Chicago-area teenager who
after the first two rounds, but whose subsequent picks were terrible: he
failed to correctly forecast a single Final Four team. The second, marked
"P.B.," is also an actual bracket, this one belonging to someone whose first
two rounds were fairly average (21 out of 32 correct picks in the first round,
9 out of 16 in the second) but who nailed three of the Final Four teams and
the eventual winner. Here's how each would have fared under each system:
Round: |
1st | 2nd | 3rd |
4th | 5th | 6th | TOTAL |
A.H., 1-2-3-4-5-6 |
32 | 32 |
15 | 0 | 0 | 0 |
79 |
P.B., 1-2-3-4-5-6 |
21 | 18 |
12 | 12 |
5 | 6 |
74 |
|
A.H., 1-2-4-8-16-32 |
32 | 32 |
20 | 0 | 0 | 0 |
84 |
P.B., 1-2-4-8-16-32 |
21 | 18 |
16 | 24 |
16 | 32 |
127 |
Since I think P.B. should win in this scenario, I prefer the 1-2-4-8-16-32
system. But I don't actually like either of the above rule sets, because
neither one accounts for differences in team strength. Which brings me to
the second consideration: how much to reward those who correctly pick
surprising outcomes. According to the odds I was able to find online,
if you were to have bet $100 on overwhelming underdog (14) Ohio to upset
(3) Georgetown, you would have won $1100. Bet the same amount on
(1) Duke, the eventual champion,
to defeat (16) Arkansas-Pine Bluff, which started the season 0-11, and
you would have won a grand total of $1.43. And yet standard practice in
pools is to award these picks the exact same number of points, or at best,
to award double points to an upset pick. Preposterous! Upsets are the
soul of the tournament. Anyone can pick
;
the laurels should go to those who identify which teams will outperform
their seeds. And that's why every year I enter a
that multiplies the points awarded per round by the seed of the winning
team. Pick (1) Kansas to beat (16) Lehigh in the first round and
you get a point; pick (11) Old Dominion to topple (6) Notre Dame
and you get 11. Of course, this rule runs the risk of rewarding an anti-chalk
strategy: why not predict that every first-round game will be an upset, and
clean up? This is where the 1-2-4-8-16-32 multiplier becomes especially
important. Even in an upset-heavy year like 2010, choosing all underdogs in
the first round would have left you with no points in rounds three through six.
And while those 12 points for picking (12) UTEP over (5) Butler look
pretty sweet, they pale in comparison to the
points you get for picking Butler to win five games. Especially since those
155 points actually would have been awarded and the 12 would not have been.
Let's run some more numbers. In the table below, "chalk" indicates a strategy
of always picking favorites; "anti-chalk" indicates a strategy of picking all
underdogs in the first round, then playing favorites from there (since 9-12
seeds make it to the third round way more often than 13-16 seeds do). The
espn.com system has no weighting by seed, while the dfan.org system uses the
scheme described above. Finally, "G.M." is a real bracket from 2009 that
got two of the eventual Final Four teams, (1) North Carolina and
(3) Villanova. A.H. and P.B. you already know, though I should
probably specify that the three Final Four teams P.B. correctly picked
were (1) Duke, (2) West Virginia, and, yes, (5) Butler. Here
we go:
Round: |
1st | 2nd | 3rd |
4th | 5th | 6th | TOTAL |
espn.com chalk 2009 |
23 | 28 |
24 | 16 |
16 | 32 |
139 |
espn.com anti-chalk 2009 |
9 | 2 |
0 | 0 | 0 | 0 |
11 |
espn.com G.M. 2009 |
21 | 24 |
20 | 16 |
16 | 32 |
129 |
|
dfan.org chalk 2009 |
82 | 64 |
32 | 16 |
16 | 32 |
242 |
dfan.org anti-chalk 2009 |
108 | 24 |
0 | 0 | 0 | 0 |
132 |
dfan.org G.M. 2009 |
89 | 54 |
28 | 32 |
16 | 32 |
251 |
|
espn.com chalk 2010 |
22 | 16 |
16 | 8 |
0 | 0 |
62 |
espn.com anti-chalk 2010 |
10 | 6 |
0 | 0 | 0 | 0 |
16 |
espn.com A.H. 2010 |
32 | 32 |
20 | 0 | 0 | 0 |
84 |
espn.com P.B. 2010 |
21 | 18 |
16 | 24 |
16 | 32 |
127 |
|
dfan.org chalk 2010 |
83 | 32 |
24 | 8 |
0 | 0 |
147 |
dfan.org anti-chalk 2010 |
109 | 84 |
0 | 0 | 0 | 0 |
193 |
dfan.org A.H. 2010 |
192 | 160 |
56 | 0 | 0 | 0 |
408 |
dfan.org P.B. 2010 |
83 | 54 |
36 | 64 |
16 | 32 |
285 |
As you can see, the espn.com system favors chalk over anti-chalk even
in a topsy-turvy year. Troublingly, in a more straightforward year,
it favors chalk over someone who pegs a relatively big surprise (Villanova
winning four rounds instead of two). While the dfan.org system gives
A.H. too much credit for picking early upsets and not enough of a
penalty for missing the Butler story, it does a nice job of rewarding
a balanced bracket over both chalk and anti-chalk.
In fact, I'd always considered the dfan.org system pretty much perfect, but
this year's tournament raised an interesting objection. Consider the NFL
playoffs, which use what's called a "floating bracket." In each conference,
six teams advance to the playoffs. In the first round, #3 plays #6, and #4
plays #5. The next weekend, #1 plays the weakest team that remains,
whether that's #4, #5, or #6. But NCAA brackets are fixed. If a 1-seed
makes it past the 16-seed, as has happened in every 1-16 pairing ever played,
it goes on to play the 8-9 winner, even if weaker teams remain in the field.
The flip side of this is that the winners of big first-round upsets play
weaker teams in the second round than do the winners of small first-round
upsets. So after (12) Cornell beat (5) Temple, it only had to
defeat (4) Wisconsin to make it to the third round. But after
(9) Northern Iowa defeated (8) UNLV, it had to play mighty
(1) Kansas, a much more daunting task. When Cornell won its
second-round game, it was surprising, but not overwhelmingly so: of the
4.8 million people who entered brackets in the espn.com contest,
11.3% had Cornell advancing to the Sweet Sixteen. But when Northern Iowa
won its second-round game, dismissing the overall favorite, it was
shocking. Only 0.9% of entrants had tabbed the Panthers to knock
out the Jayhawks. So why did Cornell's victory merit 24 points and
Northern Iowa's only 18?
This got me thinking: what if each game were worth a set number of points,
using the 1-2-4-8-16-32 system, but then those points were divided up among
all entrants who predicted a given winner? So right now I'm looking at a
pool on dfan.org — not the one I play in — with 29
entrants. All 29 picked Syracuse to win its second-round game; for this
each would receive 2/29 or 0.069 points. Five people said Washington would
do the same; they would receive 2/5 or 0.400 points. And the single brave
soul who had St. Mary's winning a second-round game would get a full 2/1 or
2.000 points for that prediction. This seemed kind of cool, but I didn't
like the way the decimals got really small. Then it occurred to me that you
could achieve the same effect by taking the number of points corresponding to
a given round and dividing by the percentage of people who had chosen a
particular outcome. In this pool that would mean 2 for Syracuse, 11.6
for Washington, and a big 58 for St. Mary's.
Now, I imagine that such a system would encourage people to take more oddball
choices in hopes of being one of the select few to benefit from them, which
could really distort the percentages in small pools. In large ones, though,
I think the effect would be positive. Michael Lewis has been promoting his
new book The Big Short all over the place, and while I haven't read
it, I've heard quite a few of his interviews. He says that one of the key
concepts in the book is that people tend to drastically underestimate
the likelihood of unlikely events. "Lehman Brothers collapse? That'll
never happen!" Well, no — it might be unlikely to happen,
but it could, and that means you probably shouldn't stake your entire
financial system on the firm's survival. To return to basketball, take the
1993 NBA draft lottery. The system in place at the time took the eleven
teams that didn't make the playoffs and assigned each one a number of
ping-pong balls: eleven for the worst team, ten for the second-worst, and
so on up to the best non-qualifying team, which got only one. In 1993,
that team was the 41-41 Orlando Magic — and it won. Onlookers
gasped, Bob Costas declared the result "unbelievable!", and Magic general
manager Pat Williams called it "an absolute miracle." A one-in-66 chance
is a longshot, certainly, but a miracle? A miracle is your pants spontaneously
turning to gold, not "hey, your birthday is the same week as mine." Another
example: the 2009-10 NBA season is currently drawing to a close, and I
thought I'd compare the near-final standings to the predictions of ESPN's
team of analysts. First up: the Milwaukee Bucks, currently ensconced in
6th place in the East after spending a fair amount of time in 5th. What
did the ten prognosticators selected by the Worldwide Leader foresee? 13th,
14th, 14th, 14th, 14th, 14th, 15th, 15th, 15th, 15th. Not one was
willing to take a flyer on the Bucks? Now let's look at the team sitting in
14th as I type this, the Washington Wizards. What said the prophets? 4th,
5th, 6th, 6th, 6th, 6th, 6th, 7th, 8th, 9th. A bit more of a range, but
still closely clustered around 6th, and all completely wrong. How about
out west, where the Oklahoma City Zombie Sonics hold down the 6th spot?
ESPN's finest basketball minds came up with 9th, 10th, 11th, 11th, 11th, 11th,
11th, 12th, 12th, and 13th.
There was a funny bit in a Dollhouse episode
in which Topher, needing to be in two places at once, uploads his mind into
Victor's body and thereby creates a mental clone of himself. After their
caper is over and they reconvene, Victor asks to keep Topher's brain patterns
so he can help with the project and offer a second expert opinion. Topher
replies: "It wouldn't be a second opinion! It'd be the same opinion twice!"
What's the point of bringing together a huge panel if it's just going to be
an echo chamber? You can sit there like a Bush Administration official and
say, "But no one could have predicted that Andrew Bogut would start playing
like a #1 overall pick and that Brandon Jennings would be a contender for
rookie of the year!" and "No one could have predicted that Gilbert Arenas
would start waving guns around and get suspended for the year!" But there
are always a couple of teams that do much better than expected and
there are always a couple of teams that suffer an unexpected collapse. The
whole point of a prediction panel should be to guess which teams those will
be! And the same is true for the NCAA tournament. Was it more likely that
Kansas would make the final than that Butler would? Sure. But it was not,
as the espn.com entrants seem to have thought, 198 times more likely.
So I'm all for a system that discourages groupthink.
There is of course no way of telling what the percentages would look like
for a contest such as the one I'm talking about. But using the espn.com
numbers, here are some of the point totals that would have been awarded in
2009:
Round: |
1st | 2nd | 3rd |
4th | 5th | 6th |
(1) North Carolina |
1.01 | 2.11 |
4.77 | 13.2 |
39.3 | 118 |
(2) Michigan State |
1.01 | 2.48 |
8.75 | 59.3 |
271 | 0 |
(3) Villanova |
1.02 | 2.70 |
10.6 | 66.1 |
0 | 0 |
(3) Missouri |
1.03 | 2.96 |
22.0 | 0 | 0 | 0 |
(12) Arizona |
2.25 | 22.5 |
0 | 0 | 0 | 0 |
(13) Cleveland State |
14.9 | 0 | 0 |
0 | 0 | 0 |
(4) Wake Forest |
0 | 0 | 0 |
0 | 0 | 0 |
You can make a good argument that this system rewards someone who makes
the incorrect prediction that the result of the championship game would
be, say, Michigan State over Pittsburgh, more than it rewards someone who
makes the somewhat more accurate prediction that it would be North Carolina
over Connecticut. That's why I said that what you consider the best
system depends on your priorities. North Carolina was supposed to win
six games. It did win six. And yes, someone who predicted that deserves a
fair number of points for not forecasting an upset that never materialized.
But Michigan State was only supposed to win three games, and it won five.
Predicting that is, to me, a more impressive feat.
Now compare these numbers to those available in the more exciting tournament
of 2010:
Round: |
1st | 2nd | 3rd |
4th | 5th | 6th |
(1) Duke |
1.01 | 2.36 |
5.43 | 18.9 |
94.7 | 500 |
(5) Butler |
1.42 | 5.05 |
70.2 | 500 |
5330 | 0 |
(5) Michigan State |
1.11 | 3.88 |
83.3 | 381 |
0 | 0 |
(6) Tennessee |
1.32 | 10.1 |
57.1 | 0 | 0 | 0 |
(9) Northern Iowa |
1.92 | 222 |
0 | 0 | 0 | 0 |
(10) St. Mary's |
2.34 | 54.1 |
0 | 0 | 0 | 0 |
(11) Washington |
3.86 | 22.7 |
0 | 0 | 0 | 0 |
(12) Cornell |
3.03 | 17.7 |
0 | 0 | 0 | 0 |
(11) Old Dominion |
5.81 | 0 | 0 |
0 | 0 | 0 |
(14) Ohio |
41.7 | 0 | 0 |
0 | 0 | 0 |
(1) Kansas |
1.00 | 0 | 0 |
0 | 0 | 0 |
(3) Georgetown |
0 | 0 | 0 |
0 | 0 | 0 |
This isn't perfect either, but I like it. It rewards people for predicting
the key stories of the tournament. In 2010, one of those stories was that
Kansas was supposed to win six games and won one. Putting the correct
winners where most people penciled in Kansas earned huge bonus
points — Northern Iowa's eye-popping 222 is the obvious one,
but look at how Michigan State goes from 3.88 for advancing to the Sweet
Sixteen to 83.3 for continuing into the Elite Eight. You'd get more than
twice as many points for choosing Tennessee to do the former, but only
two-thirds as many for choosing Tennessee to do the latter... because
Kansas wasn't in Tennessee's octet! Another big story, of course, was
Butler. Butler went into the tournament as the #8 team in the country in
the ESPN/USA Today poll, but out of 4.8 million entries in the
espn.com contest, only 12,316 put it in the championship game. Now, did
someone who predicted a final of Butler over East Tennessee State deserve
to beat someone who, like P.B., picked the winner and put Butler in the
Final Four, but didn't put Butler in the championship game? Mm, maybe not.
So that 5330 might be pushing it. But note that it too is a product of
Kansas Effect. Putting Butler in the Final Four only gets you as many
points as picking 1-seed Duke to win the whole thing. But putting Butler
in the championship game, well, that would mean picking against Kansas!
Well over half the entrants — 59.3%! — had Kansas
there. Perhaps more would have gone for Butler had they known that
and Kansas only 27. And, as discussed, I would consider that side effect
a big plus.
To close, here's how this system would have calculated the scores of
those brackets covered above:
Round: |
1st | 2nd | 3rd |
4th | 5th | 6th | TOTAL |
chalk 2009 |
26.3 | 36.7 |
36.5 | 34.8 |
39.3 | 118 |
292 |
anti-chalk 2009 |
41.0 | 22.5 |
0 | 0 | 0 | 0 |
63.5 |
G.M. 2009 |
27.9 | 31.1 |
30.2 | 79.3 |
39.3 | 118 |
326 |
|
chalk 2010 |
27.9 | 20.3 |
21.9 | 18.9 |
0 | 0 |
89.0 |
anti-chalk 2010 |
71.3 | 317 |
0 | 0 | 0 | 0 |
388 |
A.H. 2010 |
99.2 | 367 |
86.6 | 0 | 0 | 0 |
553 |
P.B. 2010 |
26.9 | 38.7 |
85.6 | 546 |
94.7 | 500 |
1290 |
And there you have it. Chalk beats anti-chalk in a boring year, but
someone who backs the right underdog can still beat chalk: G.M. missed
the bonus points associated with Michigan State, but even the Villanova
points were enough to boost him past the all-favorites approach. In a
more interesting year, anti-chalk beats chalk, but more importantly,
someone who backs the right underdog beats both: P.B. missed the bonus
points associated with putting Butler in the championship game, but
even the Final Four points were enough to boost him past the gimmicks.
And the fact that he was able to find the correct underdog at the
end of the tournament rather than just during the first weekend
gave him the edge over A.H. as well. So if I had to vote for a system
it might well be this one.
Of course, all of this goes out the window next year when the NCAA
destroys the tournament by going to 96 teams.
Return to the Calendar page!
|