Thursday, April 01, 2010

Confessions of a Duke Alum

It is time for the truth: the reason why Duke is the #1 team in the nation is because we at the Immaculate Inning paid off Ken Pomeroy.

Since 2007, these e-pages have been graced with seemingly logical discussion of the Duke University men's basketball team. We dissect the game from various angles using the statistics-- and those stats, facts based on what happened in each game, come from one website: kenpom.com. The obsession was so obvious that one time in 2007 the blog was briefly shut down by Google because they thought it was a spam link generator:

Re: #118781655 Blogger beta non-spam review and verification request: mehmattski.blogspot.com

Hello,

Your blog has been reviewed, verified, and cleared for regular use so that
it will no longer appear as potential spam. If you sign out of Blogger and
sign back in again, you should be able to post as normal. Thanks for your
patience, and we apologize for any inconvenience this has caused.

Sincerely,
The Blogger Team

Over the years, some have wondered how it is that mediocre Duke teams were still ranked near the top of Pomeroy's rankings. All this season, fans dismissed the rankings because "everyone knew" that Kansas, Kentucky, and Syracuse were the best teams in the land. Yet, for all but a few weeks of 2010, the Duke Blue Devils were ranked #1 on kenpom.com; no ranking so obviously flawed could be trusted! Well, now you know the reason; with our beloved Blue Devils trashed in the national media for being "alarmingly unathletic," and unable to compete because of our racist coach, we wanted to feel like we were the best at something.

Some of you may ask for proof, so here is the little "Christmas Present" I gave to Mr. Pomeroy. His demands were quite specific.

Most of you will not be surprised by this revelation; it is of course common knowledge that Duke receives special treatment from the NCAA, CBS, and the officials (Georgia Tech superfreshman Derick Favors said after the ACC Championship game: "It was very frustrating. We played good defense, and the referees bailed them out.") It's such common knowledge that even after six seasons of "adjustment" by the officials, keeping Duke out of the Final Four, the first explanation for Duke's trip to Indianapolis was that "Duke Gets All the Calls." Of course we do, and there's a very good reason for that too! First, the facts:

It's no secret that referees are corruptible. Tim Donaghy, who was caught wagering on the basketball games he officiated, has claimed that the problem is not an isolated one; he claims that 13 NBA officials are involved in wagering on the game. He further accuses the NBA of "turning a blind eye," because they are more interested in the money than in fairness. The distrust this has created in the casual sports fan has trickled down to mingle with the "Duke Gets All the Calls" meme; no longer is it ridiculous tinfoil hat talk. Real referees are really swayed by real cash.

It's also no secret that Duke alums make a lot of money; in 2009 USA Today ran a "payscale bracket" which picked teams based on median graduate salary. Duke won, and it wasn't close. The Duke Endowment, despite the economic downturn, is still worth nearly $3 billion. The Duke Annual Fund employs banks of undergraduates to call previous donors; this year I made a donation that the university recorded as going to "The Nicholas School of the Environment," and they even made it look good by sending me a thank you letter. All of it was done with a wink, and an understanding.

Truth #2: Every year, Duke alums make a donation to the "Referee Fund," which goes directly to the National Association of College Basketball Referees. Our sizable contribution is made with good faith understanding that it will be paid back with whistles.

We are human, though; we do feel a slight bit of guilt every time we write that check. For six years we had to sit on our hands because the officials said that it would be too obvious to hand Duke a title in 2006, so soon after the officials handed them the 2004 and 2001 Final Four runs. Many alums canceled their contributions after the Duke-UConn game in 2004, and the Duke-LSU game in 2006. It seemed that in those games, the referees were blatantly defying their loyal contributors!

Finally, we have some retribution. Handed the easiest bracket since UNC "won" the 1924 national championship by playing exactly zero postseason games, Duke sailed to the Final Four beating a couple of high school teams and your mother's quilting club. It almost wasn't enough, so the refs did have to step in and prevent Baylor from rebounding any of their (fairly frequent) missed shots. Now, we are two games away from a fourth national championship. But at what cost? We felt it was time to stand up for what was right and come clean.

To the referees in the Final Four: May you call every touch foul and carry on Duke; may every block-charge call go against us. May you clear your conscience and hand West Virginia twice as many free throws. It is your massive control over the game-- more than the coaches, more than the players themselves, you referees always decide the outcome of every game. So, in the name of justice, and as a Duke alum and current student who will be in attendance, please make West Virginia win on Saturday. Only then can we rest in peace.

Wednesday, March 17, 2010

2010 NCAAT Fan Anxiety Matrix

Simulating the NCAA tournament has been pretty popular this year, although I imagine that many of the Duke haters out there are not very satisfied with the results. The results, as expected, agree with the discrete log5 projections published by Basketball Prospectus (Click the links for South, East, West, and Midwest region log5 predictions). The difference between those predictions and my simulations, is the type of data that can be pulled out of the simulations.

For instance, below is a list of the five teams that all 64 teams in the field are most likely to end their seasons against. For example, while Duke was found to win the whole tournament in 24% of simulations, who did they lose to in the other 76% of the one million simulations?




As you can see, California is the team most likely to send Duke home with a disappointing season (and give plenty of bloggers and media mouthpieces lots to puff their chests about). It's not surprising to see that Kansas is one of the top five teams to knock off Duke, since Duke averaged 3.5 wins per simulation, and got to the final game very often, as did Kansas.

We're calling this presentation the Fan Anxiety Matrix. Which team is the most likely to knock out your favorite team? The second tab of the spreadsheet above shows every team's chances of being knocked out by every other team. Click here to see the full spreadsheet.

With a focus on the ACC teams, here's some pie charts to gaze at. Maryland fans will be pleased to see that Duke has only a 2% chance of ending their season. Unfortunately, there is only a 2% chance of winning the national championship as well. The other four ACC teams have an uphill battle to make it until the second weekend (click a Fan Anxiety Index to enlarge):






Sunday, March 14, 2010

2010 NCAA Tournament Simulations

Be sure to also check out the "Fan Anxiety Matrix" Who will your team lose to this March?

The brackets are set, and so for college basketball stat nerds, that means simulations. A number have already popped up: 5000 simulations using Sagarin's predictor rating; an online tool for simulating one random bracket, using Pomeroy's statistics.

As I did last year, I take both approaches one step further: using Pomeroy's offensive and defensive efficiency ratings, and the log5 prediction method, I simulated the 2010 NCAA tournament one million times. My script tabulated the results below. (You can also access them in Google Spreadsheet form by clicking here).



Each column is a round of the tournament; each value is the percentage of the one million simulations that a team reached a given round. On the right-hand side is the average number of wins each team in the tournament had in the simulations.

Last year, hardly any first round games were likely to be upsets based on the simulation. This year is much different, thanks to several major discrepancies between Pomeroy's rankings and the seedings made by the committee. Brigham Young, in particular, is seventh overall at kenpom, while stuck in a 7-seed in this year's tournament. The best games to watch this Thursday and Friday will be the 6-11 games, as all four are slated to be near coin-flips.

The intuition here is to do a face-slap and say "Duh, the teams at the top of the Pomeroy rankings have the best chance in simulations using the Pomeroy rankings!" That dismissal would miss several key features of the simulation, and one interesting thing to do is to see how the simulations correspond to our gut instincts about the basketball matchups in each game. For example, Kentucky and Syracuse have rough roads to the national title because of very high (about 25%) chances of losing in the second round. Florida State is the culprit for Syracuse; if you combine that information with the possibility that Arinze Onuaku will not play this weekend, an FSU-Syracuse game on Sunday suddenly gets very interesting.

Kentucky could run into Texas in the second round, and the Longhorns are ranked much higher than in the RPI and by humans (both the selection committee and the polls). And while Texas has been a bit of an enigma to the national media this year, it is clear that the possess the talent to be efficient on both ends of the floor. Kentucky's road is further blocked by Wisconsin, a team actually ranked higher than the Wildcats. That Sweet Sixteen matchup would be really bruising on Kentucky's boards, as the nation's #2 offensive rebounder (Collins) takes on the Badgers' nation-best defensive rebounding squad.

The Pomeroy rankings are typically recognized as being successful post-season analysis of the teams, including the tournament games. All six NCAA champions since Pomeroy's website launched were ranked in the top two post-season, and in the top 15 for both offense and defense. However, the accuracy of the pre-tournament stats is a bit more rusty; last year the "best simulation" out of one million got 53 games right, and averaged 37 correct games. Is it a lack of complete data, a flaw in the system's ability to prognosticate, or just the general stochasticity of the NCAA tournament?

This will be a very interesting year for seeing the ability of the RPI rating system (used by the committee, and which does not include margin of victory) versus the Pomeroy rating system (which goes to the opposite extreme, including margin of victory with no cap). If the adjusted efficiencies are the more accurate predictor this tournament, then we are likely to have a mad, mad, mad, mad March.


I wanted to get the results out quickly, but I will have some further analysis in this post and others throughout the week. Thanks as always to Ken Pomeroy for his absolutely terrific website; without his stats, none of the fun simulators would exist!

Thursday, March 11, 2010

One Million ACC Tournament Simulations

One year ago, we had some fun with spreadsheets and used a number of different methods to predict the ACC Tournament. Unfortunately, the raw numbers had no way of knowing the status of Ty Lawson's ankle, so the team predicted to win about 30% of the time grabbed the title, while UNC rested for bigger fish. In the past year the number of Pomeroy Disciples has grown, and so traditional "log5" predictions of the conference tournaments can be found all across teh internets (although this one in particular, from Basketball Prospectus, is a must read).

I like to find my little niche here at the Immaculate Inning, and that means simulating the hell out of things. The method is the same as for last years' ACC tournament. This year, I used raw offensive and defensive efficiencies that were tabulated here. This means that a team did not have their stats adjusted for home games or for the strength of opponent: the only values in the stat is points scored (or allowed) per possession. Via the Pythagorean Expectation Formula (with KenPom's exponent for unadjusted efficiencies = 8.5), I calculated a team's "expected winning percentage."

To determine the chances that team A beats team B, a form of Bayes Formula is applied, which in the stat-head world has come to be known as "The log5 Method." The method could be applied to any scenario where the probability of a single outcome is desired, given the prior probability for each of two alternatives. Here, we have two teams, each with an expected winning percentage, and can calculate the probability of a .900 team beating an .800 team. If we assume that the result of each game is independent, then we can multiply probabilities together to get a team's overall probability of making a certain round.

Personally I find the method rather deterministic, in what essentially is a stochastic process. Instead, I run the tournament 1 million times and calculate the percentage of simulations n which each team makes it to each round. The results of my simulations for the 2010 ACC Tournament are below:




The spreadsheet has two tabs, one for a simulation done using stats from all games, while the other is for ACC games only. The way to read it is that each team (row) won a certain number of games (0,1,2,3, or 4) in a certain percentage of the 1 million ACC tournaments I simulated. For the top four seeds, the maximum number of wins is 3, while the other 12 teams could potentially win four games and the tournament.

Duke's chances of winning the tournament is severely if stats from the entire season are used, and they go from a near 2-to-1 favorite to not even winning a majority of simulations. Part of this has to do with the raw nature of the efficiencies; accounting for Duke's tough schedule (and it was one of the toughest in the country by most any measure: KenPom, Sagarin, RPI) would probably account for most of the discrepancy.

On the other end of the spectrum is Miami, which gained an incredibly high percentage (from 0.1% to 3.4%) because they had a highly positive efficiency margin for all games, while it was highly negative in ACC games only. The Canes played very very well against a bunch of schools I've barely heard of, followed by getting clobbered in ACC play. Their adjusted efficiency margin is still decent due to the ACC games they played, but it's hard to give the full season stats much regard in this instance.

The numbers for the ACC-only simulations differ from those seen at Basketball Prospectus; I imagine most of the differences here also have to do with using raw efficiencies rather than Pomeroy's adjusted numbers. The adjusted numbers, Pomeroy claims, are the best for predicting "the chance of beating an average D-1 team on a neutral floor." The raw numbers, then, are skewed based on home-court advantage, schedule (remember, the ACC is no longer "balanced"), and the overall strength of offenses and defenses a team faces. In particular the predictions differ in that Maryland's chances are reduced, at the expense of better chances for FSU and VPI. Both methods agree that fifth-seed Wake has one hellish path towards an ACC title; much worse than sixth-seeded Clemson's chances. Overall, it will be interesting to see whether raw or adjusted efficiencies do a better job predicting the ACC tournament.

Another advantage that these simulations have is the amount of fun I can have with the results. Below I present the "Fan Anxiety Matrix." Each cell in the Matrix represents the chances that a team (in the rows) loses in the ACC tournament to a specific team (in the columns):



So, Duke's "Fan Anxiety Matrix" says that, in the 37% of simulations when they didn't win the whole thing, the most common opponent taking down the Blue Devils was Maryland (Using the ACC stats here). Perhaps no surprise there, but then there were still 8.3% of the simulations in which the Blue Devils fell in the semi-finals to Virginia Tech. Duke's first round game is against either Boston College or Virgina, and the combined percentage of simulations in which the Blue Devils' ACC run ended against those two teams was six percent. It should be of some comfort that UNC's chances of taking down Duke (this would have to be in the finals) clocked in at a tiny 0.029%.

Looking at the matrix, Clemson's path is an interesting one. The Tigers are seeded sixth and must at least pass through NCSU and FSU to get to the semifinals; the Matrix has them losing to these teams 22.5% and 37.1%, respectively. Maryland (22.4%) and Duke (10.1%) also appear in the double-digit percentages as Clemson's final ACC foe, with the remaining 3.2% speaking for Clemson's ACC title chances.

Virginia Tech's bubble position would certainly be helped with a win in their quarterfinal matchup; things are looking up according to the simulations, which have them falling to Duke in the semifinals 50% of the time. Wake Forest has a rough road to the ACC title, as they must win Thursday versus Miami (losing %: 30.5), Friday versus Virginia Tech (39.5%), Saturday versus (with 94% probability) Duke, who accounted for a further 25% of Wake's losses in the simulations.

For posterity's sake, here are the official Immaculate Inning ACC Tournament Predictions:

Thursday winners: Virginia, Wake Forest, Georgia Tech, Clemson
Friday winners: Duke, Virginia Tech, Maryland, Clemson
Saturday winners: Duke, Maryland
ACC Champion: Duke 75, Maryland 60

Friday, March 05, 2010

The Last Time

The average US price for a gallon of gasoline was $1.82.

One share of Google stock was $187.40.

The HMS Scott reveals, via mapping of the seafloor, a 100 m landslide at the epicenter of the deadly 2004 earthquake/tsunami in the Indian Ocean.

Year 4702 (Year of the Rooster) began in the Chinese calendar.

The #1 song on the Hot-100 Billboard charts was "Let me love you" by Mario

The top movie at the box office was "Boogeyman." It was about to be replaced by "Hitch."

Cuba begins a ban on smoking in public places.

President George W. Bush, just two weeks into his second term, announces a tax increase.

Two months before his death, Pope John Paul II allows an American cardinal to give the Ash Wednesday address from the Vatican.

#8 Duke beat #2 North Carolina in Cameron Indoor Stadium, 71-70, after Rashad McCants dribbled the ball off his foot with two seconds left.

Duke students, acting like They Have Been There Before, stay in the stands to sing the alma mater, rather than rushing the court. Students proceed to burn shit in a disorderly fashion.

That was Wednesday, February 9, 2005. The last time Duke beat UNC at Cameron.

Carolina delenda est.

Tuesday, March 02, 2010

Five Teams That Should Not Scare Duke In March

Duke fans, I've noticed, have a tendency to be of the self-hating variety. Threads at Duke Basketball Report are filled with skepticism for (supposedly) one's own team, which seems like it just doesn't translate well into pleasurable fan experiences. Some fans even go as far as concocting reasons why a win in a given game is not a big deal: I even saw a fan state that "some years the best team doesn't win it all; if Duke won the national championship, this would be one of those years." How sad a fan existence is that?

To combat the rampant pessimism, this post will identify the types of NCAA tournament teams Duke matches up extremely well against. I will examine whether some of the teams rated top in the country by pollsters and bracketologists are also the types of teams that match up well against Duke in tempo-free statistics. Here I will look at teams that match up poorly against Duke; later I will take a look at some "scarier" teams. My approach is to examine a path towards the National Championship that includes possible opponents at each level of the NCAA Tournament, and which teams/types of teams Duke would love to see. It is unlikely that Duke will have trouble with its first round opponent this year, so that leaves five rounds, and five example teams.

Clearly, this analysis involves looking at precisely zero video. The only team I have consistently watched all year is Duke; I may have tuned into a game here or there for other teams, but that one game could bias my thoughts on non-ACC teams. From the tempo-free perspective, there are a number of things Duke does extremely well:

1) Offensive Rebounding (7th in nation at 40.7%). Brian Zoubek is no longer Duke's best-kept secret weapon, and his improved offensive efficiency rating over the last few games complements his best-in-the-nation offensive rebounding ability.

2) Free Throw Percentage (8th in nation at 75.9%). While other teams can't really control how Duke does from the line, they can certainly be a team that fouls a lot, sending Duke to the line for some free points.

3) Three-Point Shooting Defense (1st in nation at 26.7%). Mid-major teams which lack size and rely on the three-pointer are going to have long nights against Duke in March.

4) Lack of Turnovers (16.4% of possessions, 11th in nation). Duke's turnover rate has been particularly excellent over their last few ACC games (and would be even lower if not for several turnovers by the bench players in garbage time against Virginia).

5) Height. Pomeroy lists two kinds of height on his statsheet: The raw number in inches (Duke's average height is 2nd in the nation at 79.0"). Clearly, this should be weighted by playing time, and Pomeroy does this: he points out that height is correlated to offensive efficiency with r-squared = 0.27, with r-squared = 0.38 on the defensive end. Duke's "Effective Height" is +4.9, meaning they are about five inches taller than the average Division-1 team. Here are the tallest teams in the nation, and Duke has already beaten four of the top ten!

Last week, I pointed out the aspects of the tempo-free analysis that Duke struggles with, and not much has changed so refer to that post for Duke's weaknesses.

Looking at the 7,8,9,and 10 seeds in Joe Lunardi's latest bracketology is a good place to find teams Duke is likely to see in the second round of the tournament. There are a bunch of ACC teams there, but we can leave them out since the committee does like to put teams from the same conference on collision courses before the regional final. Which of these teams sticks out as matching up poorly against Duke?

1 (Thirsty Thirty-Two) Oklahoma State. Picked to be a #8 seed by Joe Lunardi, and firmly in the "Also Receiving Votes" portion of both human polls, the Cowboys are a possible second-round opponent for Duke. Their overall offensive (112.8; 31st) and defensive (94.1, 62nd) efficiencies are about the middle of the pack for possible NCAAT teams, as you might expect for a team with a middle-seed. When broken down into component factors, there's a lot to like about this team from a Duke perspective. First of all, they are a short team, with an effective height more than an inch below the D-1 average; the tallest player on the roster is 6-8 Junior Matt Pilgrim, who sees less than 40% playing time.

The most convincing matchup for Duke comes with shot selection. Oklahoma State is near the top of the pile in relying on the three-point shot: 32.7% of their points come from behind the arc, which is 45th in the nation (the highest among possible NCAAT teams). Further, they are not particularly good at making three-pointers, shooting just 35.6%, barely in the top third of all D-1 teams. Finally, the Cowboys struggle to grab offensive rebounds: a rate of 30.2% of their missed shots places them 264th in the nation. The combination of chucking up threes, not making a lot, and struggling to rebound is a strategy that plays very poorly against the Blue Devils, who have superior rebounding and 3-point defense.

2 (Sweet Sixteen) Pittsburgh. This is a team that is slated to be seeded in the 3-5 range, and therefore is a possible Sweet-Sixteen matchup, and one that the tempo-free stats say would be good for Duke. The biggest reason that jumps out is, once again, height. Pittsburgh has an effective height of a full inch below average, especially at guard and forward. They do have 6-10 center Gary McGee, who plays nearly 60% of the time, but who is not much of a factor offensively. Defensively, McGee has a good block rate and a good defensive rebounding ability, so most of Duke's height advantage in this one comes from Kyle Singler and Jon Scheyer, who tower over their likely defensive counterparts.

One thing Duke would not be doing in a game with Pittsburgh is wasting offensive possessions. While the Panthers are good at limiting their opponent's eFG% (44.1; 20th), they are mediocre-to-bad at the other three factors. Duke protects the basketball pretty well on offense, and Pittsburgh rarely forces their opponents into turnovers (Defensive turnover rate: 16.8, 335th). Pitt also don't have much success on the defensive boards (DR% 68.9, 106th), which once again sets up the scenario of Duke dominating the offensive glass and making every possession count.

Pitt also plays a rather plodding pace (62.4 poss/game, 325th), and has a rather heavy reliance on getting to the free throw line to make their points (23.9% of points come from the charity stripe, 42nd most in the nation). If this game were played in November, this could have come as an advantage for Pitt, but Duke has reduced their fouls in recent weeks, with good results. The tempo-free stats don't say much about the "quick guards" factor, but Pitt does have a high percentage of their baskets accompanied by assists (67.1%, 4th). If this is the signature of a quick team using a lot of backdoor cuts, that could potentially hurt Duke's defense, but this signal is not nearly as strong as the possibility of Duke dominating Pitt on the offensive glass.

3 (Elite Eight) New Mexico. I'm all for giving respect to mid-major teams-- when they deserve it. Ranked #8 in the nation in the polls, #7 in RPI, and given a #2 seed in Joe Lunardi's latest bracketology, things are looking up for the Lobos. With the clear caveat that I have never seen them play, I'm not really sure what all the fuss is about. Ken Pomeroy is with me, and has used New Mexico to drive home a point: as long as the NCAA fails to consider Margin of Victory when making tournament picks/seeding decisions, they will continue to not pick/seed poorly the best teams in the country. Case in point: The Sagarin rankings, which have two parts, one (ELOCHESS) that only considers wins and losses; New Mexico is ranked #8. But when points scored/allowed is taken into account (in the PREDICTOR), New Mexico drops to #33. So, really, most elite teams this year should not be scared of New Mexico.

Duke has some specific advantages, though. One is, again, height, and the Lobos are an average team (Effective height +0.1"). New Mexico has been pretty good on the defensive boards (DR% 72.7; 11th best), but in the components for mid-majors, where most teams are short, it requires a little digging to see how they did against tall teams. Indeed, New Mexico's worst game on the defensive boards was their loss at San Diego State, a team that is only elite at one thing: offensive rebounding (they are ranked 6th in the nation for this component). UNM's best rebounder is Darrington Hobson, who at 6-7, 205 is not the kind of player you really want battling against Brian Zoubek...

New Mexico has other weaknesses that play into Duke's strengths, such as an above-average reliance on three-pointers, playing a generally slow pace, and with inexperienced players (ranked 241st Pomeroy's experience stat, versus Duke, ranked 81st in experience). However, if New Mexico were given a 2-seed, they wouldn't see Duke until the regional-final at the earliest, and the stats suggest that the Lobos will lose well before that point.

4 (Final Four) Villanova. 'Nova leaves the most recent bitter taste in the mouth for Duke fans. For this reason, a Duke fan may not be able to see passed some simple truths: a) Duke, in 09-10, has a better, taller, more versatile team than in 08-09 and b) Villanova's team is about the same as it was last year. Yes, they still have "quick guards" although the tempo-free analysis doesn't really see much other than their shooting percentages and overall efficiency; Reynolds, Fisher, and Pena are all rated above 115.0, which is pretty good. The Wildcats are ranked highly enough that it would be unlikely for Duke to face them this tournament until the Final Four, so we will consider them here.

The tempo-free stats don't pull out a whole lot of things that Villanova does really well. Their free throw percentage (75.6%) is comparable to Duke's, and they pull in a good number of offensive rebounds, especially considering their effective size (which is basically average). While teams at the top of the rankings are going to be fairly good, what we're looking for is whether Villanova's particular strengths and weaknesses lie within Duke's (as defined by tempo-free stats). The biggest red flag is pace: Villanova averages 75.8 possessions/game (11th), which we identified as a potential pitfall for a Duke team that averages just 67.3 poss/g (178th). Villanova also has a balanced attack, not relying too much on 3-pointers, 2-pointers, or foul shots.

But there are more red flags for Villanova. First and foremost: they foul. A lot. A defensive free-throw-rate of 49.7 is the ninth-highest in the nation, and Duke's elite free throw ability will give the Blue Devils lots of free points in this matchup. Villanova has had some atrocious games (at Georgetown and vs UConn) in which the opposing team shot more free throws than they made field goals! That is not a very good winning strategy, especially when in general your defense, relative to other supposed NC-contenders, is lacking. They allow a higher-than-average percentage of 3-point shots (3PTA/FGA = 38.2; 316th lowest), and their opponents make an above-average amount (3PT% against = 33.4; 140th). They are not particularly great on the defensive boards (DR% = 68.5; 122nd), and they don't force many turnovers (Defensive Turnover Rate = 22.0; 100th). It would only take an average-shooting night for Duke to win this one handily.

5 (National Championship Game) Kansas State. Okay, I will fully admit that this selection is a little bit of cheating, so let me explain. Should Duke get to the National Championship game, the range of teams they could face is necessarily much better than the range of teams they could face in the earlier rounds. Examining the rest of the Top 10 in the human and computer polls, there are a lot of teams that make for match-ups that are extremely tough to call from a tempo-free perspective, because each team is pretty good at a lot of things. For teams like Syracuse and Kentucky, there aren't any aspects that scream out "Duke is good at this aspect and the opponent sucks at that aspect!" Or vice versa. There are also teams for which there is convincing evidence that Duke would struggle... but that is for the next post. Finally, picking a Hummel-less Purdue just seems too easy, and without Hummel they probably don't make it to the title game anyway. So of the consensus top teams, the one against which Duke would have the easiest time is Kansas State.

The component of Kansas State's game that speaks the loudest is the fouls. I would imagine that most Kansas State games take three hours or more, with how often the Wildcats (Free Throw Rate: 53.3; 2nd) and their opponents (Free Throw Rate Allowed: 46.7, 309th) spend on the free throw line. Kansas State gets more than a quarter of their points from free throws, way more than Pittsburgh (see above), and the 19th most in the nation. However, once they get there they only shoot 66%, with only one of their regulars (Jacob Mullen) north of 75% individually.

They get around all this fouling by having a revolving door for a frontcourt. Three players (6-9 Wally Judge, 6-10 Luis Colon, and 7-0 Jordan Henriquez-Roberts) each grab just 25% minutes (that's 10 min/game) and each manage to put up foul rates above 7.0/40 minutes. The rest of the frontcourt minutes belong to 6-8 Curtis Kelly, who has decent defensive numbers, but isn't a standout offensive star.

From the team perspective, Kansas State does a lot of similar things to Duke. They are one of the few teams better than the Blue Devils on the offensive glass, grabbing 41.2% of their missed shots, which are frequent (since they shoot 36% from beyond the arc and 50% inside it, both rather mediocre). However, Kansas State, whether by design or by ability, struggles on the defensive glass, much more so than Duke does. This has a recipe for being a physical game, in which both teams finish possessions either with offensive rebounds, putting back missed shots, or by heading to the free throw line. There is one wrinkle that makes the comparison less symmetrical: Kansas State has a high turnover rate (21.4; 216th), and while Duke's defensive turnover rate is not great (22.1; 79th), the turnover battle certainly tips the Blue Devils' way. Overall, in a foul-shooting battle, Duke has the definite advantage, given their much higher free throw abilities. The game would come down to which team lost their best interior player due to foul trouble.

This window into Duke's potential road to the national championship is far from perfect. It leaves out a lot of factors, some that are "intangible" and others that are tangible but simply not measured (unless someone wants to watch thousands of hours of tape to quantify how "quick" certain team's guards are). And while my cherry-picking of teams may seem highly beneficial to Duke, I hope that I've at least illustrated that there are teams out there against which Duke stacks up against mightily. The tempo-free statistics say that Duke is the best team in the nation, by a fairly wide margin. While self-hating fans and Duke-haters alike may not consider Duke to be on their list of "scary" teams, there is clearly room for optimism in Durham.

Monday, February 22, 2010

Duke Weaknesses (Perceived and Real)

Clearly, this year's Duke team is neither entitled nor even favored to win the National Championship. The widely respected human opinion polls put, at time of writing, half a dozen teams above a Duke team that has failed to receive a single first-place vote this entire season. Clearly, something scatters the blue sunshine. There are many common memes surrounding Duke, though only a few of them actually apply to this years' team. Let's take a look at some of the common criticisms levied against this year's Duke squad, analyzed from a tempo-free perspective (statistics, as always, from Ken Pomeroy).

1) Duke has played a soft schedule. If we go based on the tempo-free stats, nothing could be further from the truth. Pomeroy ranks Duke's schedule second in the nation, behind West Virginia. Sure, their non-conference schedule is ranked 83rd, but take a look at the teams on top of the non-conference SOS rankings: other than Butler, none of the top 20 have a prayer at an at-large tournament bid. And a number of the teams up top are probably there because they played Duke... this criticism is tired and should be sent to the land of misfit memes.

Making the argument worse is the observation that not only does Duke have the #3 offense in raw efficiency (1.17 points/possession), they have accomplished such lofty efficiency against a harder slate of defenses than anyone in the nation (adjusted efficiency of defenses Duke has played: 0.95 points/possession).

Taken more subjectively, Duke will play 15 non-conference games this regular season (counting Thursday's game against Tulsa). Of these, as many as 7 teams could make the NCAA tournament; what is a better preparation for NCAA tournament play than contests against teams that will actually be there? For comparison, Duke is #2 in the Sagarin strength of schedule as well, and played the fifth-toughest slate in this version of RPI.

2) Lack of backcourt depth. It seems only yesterday that the meme was about the lack of frontcourt depth (well, last year anyway). Assuming that Kyle Singler counts as a guard, Duke's main men play a ton of minutes, while backup shooting guard Andre Dawkins has struggled in limited minutes. I have a problem with this criticism on a number of levels. Part of this argument comes from the same area as the Duke Fade, which is that too many minutes are given to the Big Three. Here's the tempo-free stats.

Player......%Minutes........ORtg....%Possessions
Scheyer.........91.6.............130.7.........23.1
Singler.........89.5............114.3.........23.1
Smith.........82.2.............117.5.........23.1

First of all, pretty cool that all three are used in the same percentage of possessions. Second, if you have three players capable of contributing those massive tempo-free player ratings, why would it be a bad thing to play them all the time? If a player was used so much that they tired in-game, his offensive rating would plummet, due to the extended number of possessions played. Duke's top three men (by % minutes played) have a higher combined rating (362.5) than the top three men on Kansas (358.5), Syracuse (333.4), Purdue (343.1), West Virginia (354.4) and Kentucky (334.5), to pick a few. Scheyer/Singler/Smith are doing it in far more minutes than any of those teams, save perhaps WV, who do get 80%+ minutes from Butler and Jones.

For another avenue of argument, I shall paraphrase what I overheard this morning on 99.9 The Fan in Raleigh: "While Duke has the best trio of players in the country, what happens when they are all shut down?" This, of course, has an easy answer: Duke would lose. The better question is: "what's the likelihood of all three being shut down?" The three Duke stars average a combined 53.8 points/game; the fewest they've scored, combined, was 41 in the Preseason NIT championship win over UConn. In Duke's four losses, the trio has averaged about the same amount, and scored a combined 61 points in the loss at Georgetown. Indeed, it seems that Duke's wins will come down to how many points they can drag out of their frontcourt. Which leads to...

3) Duke has no talent in the frontcourt. I should hope that the last three games would silent anyone who would make such a bold claim. There is clearly some talent in the frontcourt, and that is in the (massive) form of Brian Zoubek. The man who makes girls swoon so hard that their marriage proposal signs are upside down has averaged nearly a double-double (9.7 points, 12.7 rebounds) in those games. With his new-found ability to avoid foul trouble, Zoubek has slipped back above the 40% minutes played mark. This finds him back atop the nation in offensive rebounding. Let's nail that home for the readers too lazy to click on the link:

Category: Percentage of a team's missed shots rebounded by a player while he's on the court.
Qualification: 40% of a team's minutes played (that's about 16 min/game, ignoring OTs)

1. Brian Zoubek, Duke 23.0%
2. Demarcus Cousins, Kentucky, 22.8%
3. Anthony Johnson, Fairfield, 17.4%

So, other than the stud freshman at Kentucky, no one is even close to Brian Zoubek in offensive rebounding ability. Against Maryland, Zoubek used missed shots by the Big Three to have an amazing offensive night, scoring 16 points with 17 rebounds (8 OR). Then, against Virginia Tech, Zoubek benefited from Duke's cold shooting for another 16 rebounds (8 OR). This time he tended to kick the ball out to open men, including two assists on Nolan Smith three-pointers.
The tempo-free world is only six years old, but in that time, only three players have had an OR% above 21%: DeJuan Blair was at 23.6% last year. (The next highest season total is 20.8%).

Of course, there's more to the frontcourt than Zoubek, and the tempo-free stats have mixed feelings about Duke's other big men. Miles Plumlee is a very strong defensive rebounder, and at 22.6% is currently vying for Duke's best defensive rebounding mark since Pomeroy started tracking individual stats in 2005 (Shelden Williams' best was 22.0%). Mason Plumlee currently has the lowest offensive rating on the team (94.5), thanks to a triple-whammy of low effective field goal percentage, small rebounding numbers, and a very high turnover rate. While analysts have been high on Mason's potential, the tempo-free stats haven't shown that this season.

Lance Thomas is generally seen as the "glue" of the 2009-2010 Duke team, and the captain is certainly one of the emotional leaders. As with many in this position, his contributions are said to be "intangible." Indeed, tempo-free statistics don't have much to say about LT. He has an average offensive rating, and the only stat which really stands out is his turnover rate, which is the highest on the team: Duke turns the ball over on 26.1% of his possessions. This could be partially due to Lance, who turns the ball over 1.6 times/game; it could also be unlucky that he's on the court when turnovers occur. Don't view this as a total knock on Thomas, however. These stats of course only include offense, and while Thomas' contribution may not be exactly "intangible" it is certainly "unmeasured" by this system.

Overall, this criticism is still hanging on by a few threads. Clearly, Duke's frontcourt is responsible for a massive number of rebounds, so much so that it can prop up a poor shooting night against many teams. Whether one of the Plumlees or Zoubek can be a viable offensive threat remains to be seen. For now, only Zoubek is impressive from the tempo-free perspective.

For the heck of it, let's throw in:

4) Duke gets all the calls. While pretty hilarious in most seasons, this conspiracy theory has a pretty steep piece of evidence to overcome this year: this season, Duke has a free throw rate (FTA/FGA) of 37.8, which is 171st in the nation, good for almost exactly median. In addition, Duke gets only 21.9% of their points from the charity stripe, good for 119th in the nation. This season, the refs seem to have been paid off by the fans of Kansas State (Free Throw Rate: 53.3) and Wyoming (27% of points from FTs). What Duke does have going for them, however, is free throw percentage: 8th in the nation at 75.9%.

So, what are Duke's real weaknesses, as shown by tempo-free stats?

1) Fouls. As alluded above, Duke struggles more than usual to get to the free throw line, and so cannot benefit as much from their high percentages from the line. While in most years, Duke has multiple players drawing more than 5 fouls per 40 minutes, only Scheyer is above that mark, and no one else is very close.

On the other side of the basketball, committing fouls has been a huge problem, particularly for Brian Zoubek (8.2 Fouls Committed/40 minutes) and Mason Plumlee (7.2 FC/40). In all three of Duke's January losses, they allowed their opponents to get to the free throw line at a rate of about 50 FTA/FGA.

2) Slow Pace On each team's "Game Plan" page, Pomeroy has tables of correlation coefficients. Each of these is a measure of how correlated a particular statistic is with offensive/defensive efficiency. At the moment for Duke, one of these, on offense, is the "pace" of the game, measured by the number of possessions. As Duke's number of possessions goes up, so does their offensive efficiency. For example, Duke's fastest game this season was against Pennsylvania, in which Duke scored 115 points in 75 possessions (Offensive Efficiency: 151.8!) . In Duke's slowest game, at Clemson, the Blue Devils scored just 60 points in 61 possessions, for an efficiency of 98.8. This will definitely be a trend to keep an eye on, should Duke face a team notorious for slow play in March (looking at you, Big Ten).

3) Allowing 2-pointers Duke's defense is ranked 18th in efficiency, but it has it's definite weaknesses. While the Blue Devils are second in the nation in 3-point shooting percentage against (27.4%), they allow opponents to shoot over 45% inside the arc (81st). Put another way, Duke's opponents score 60.8% of their points from 2-point range, which is the 8th most in the nation (to be fair, the 18.4% of opponent's points from beyond the arc is fourth lowest). Duke's strategy clearly makes some sense: it doesn't take a math degree to deduce that 3 > 2. However, some teams are just not built for chucking up treys. This season, Georgetown, NC State, and Wisconsin all used Duke's spread defense to their advantage, scoring backdoor layups and short-range jumpshots.

As the rankings sit today, Duke is the best team in the nation, measured from a tempo-free perspective. They have the best offense, a good defense, and have played a very strong schedule. We've seen that many of the common criticisms of Duke fall apart when seen from the perspective of tempo-free stats. Still, this is a college basketball world without perfection, and while Pomeroy's statistics can help us see that Duke is a little underrated by the human polls, there is still room for improvement as March looms.

Wednesday, February 17, 2010

The Duke Fade: A Response

Earlier today, Alex Fanaroff of Duke's daily newspaper The Chronicle posted a wonderful article, "Duke Does Decline, Objectively Speaking." It's a refreshing look at basketball in a way that we love here at the Immaculate Inning: using tempo-free statistics. Fanaroff describes the general issue:

1) Duke fades down the stretch.
2) This is due to the starters playing way too many minutes, causing them to get tired.

Fanaroff went into the article expecting to at least debunk #1 as a myth, but ended up finding a strong correlation between Duke's efficiency margin (Offensive Efficiency - Defensive Efficiency), and the number of ACC games played. This suggested that over the last seven years, the data indeed does suggest a Duke Fade. Fanaroff was unable, however, to show that "minutes by starters" had any effect on efficiency margin. Naturally, I was intrigued, and wanted to dial further into the data. Here are a number of issues:

1) Year effects do matter. In his accompanying blog post, Faranoff says "Predictably, most seasons failed to produce robust trends due to the limited sample size, though all seasons from 2004-2009 demonstrated a downward trend." This statement is misleading on two accounts. First of all, here is each year plotted individually, with the linear regression line on each:

True, there is a downward trend most years, and Faranoff does mention that Duke's trend is technically upwards this season. Most seasons, that line is not significant: there is not enough evidence to say that the regression line differs from no slope.

However, the null hypothesis can be rejected in 2004 (r-squared 0.2177, p = 0.04406) and 2008 (r-squared 0.2993, p = 0.01879). Here's my question: if a declining efficiency margin during ACC play is a bad thing, then why did the 2004 team make the Final Four, despite having one of of the few significant in-year declines?

2) Opponents matter. As many folks over at the fine institution of DBR have pointed out, Duke always plays Carolina as its last ACC regular season game. Then, while the early rounds of the ACC tourney may provide a brief dip in competition, Duke almost always advances to the end of the tournament, and they naturally find better teams there. We can look at the effect of Opponent Rank (determined by Pomeroy) on Efficiency:




The scatterplot doesn't look like much, but the trend line is there, and it is significant: Duke's opponents get tougher as the season goes on. And Duke does play much better against inferior competition:


Since Duke's opponents are harder later in the season, and Duke has lower efficiency margins against better opponents, how can we take this into effect? One way is with an Analysis of Variance (ANOVA: more info here), in which we can specify variance components. Basically, we have two main effects on efficiency margin: ACC Game # and Opponent Rank. If we account for the variance accounted for by the strength of the opponent, is the correlation still significant? For the stat-geeks out there, here's the ANOVA table:

Anova Table (Type III tests)

Response: Delta.100
......Sum Sq....... Df........ F value........ Pr(>F)
(Intercept) 0.12776 ....1 ........6.0593 ..........0.015267 *
Opp.Rank 0.48675 ..1...... 23.0858 .........4.563e-06 ***
Game 0.19808 .....1 ........9.3947 ...........0.002693 **
Residuals 2.50903 ....119


To summarize, there is still an overall significant downward trend in efficiency margin as the season progresses, although the significance is reduced.

3) Home Court Matters? When Ken Pomeroy adjusts for home court, he adds 1.4% to the home team's offensive efficiency and visiting team's defensive efficiency, and subtracts the same from the home team's DE and visiting team's OE. So the difference between Faranoff's raw data and what Pomeroy would consider "adjusted" is -2.8 for Duke's home games and +2.8 for Duke's away games. What happens to the correlation if we make the "Pomeroy Adjustment"?



The correlation coefficient decreases, as does the significance, but not to the point where the slope becomes non-significant. There is still a Duke Fade when we account for home and away games.

I must say I come away unimpressed with other explanations for the Duke Fade. Opponents do get tougher but not enough to overcome the efficiency drop. Adjusting for home and away games also doesn't have much of an effect. It is very important to note that nowhere have I suggested a causal agent for the Duke Fade. This is to avoid the common fallacy that correlation implies causation. Clearly, Duke can still have an historic season (2004) despite having one of the few significant in-year Duke Fades.

Instead, I'll take an "I Report, You Decide" kind of approach here. These are the statistical facts, and I'll be happy to attempt more rigorous investigations if they are suggested in the comments.

Saturday, January 30, 2010

Duke-Georgetown Tempo-Free Preview

Seven inches of powdery white stuff has fallen from the sky this morning, which means it's a perfect time to stay home and analyze basketball statistics. From a tempo-free standpoint, today's game between Duke and Georgetown is an elite matchup. The Hoyas are ranked 15th in the nation by Pomeroy, but have an adjusted offense (19th) and an adjusted defense (32nd) ranking below their overall level. The reason for this is that most teams around their level in the Pomeroy rankings have one excellent score and one mediocre score (e.g. Villanova: 3rd on offense, 71st on defense).

Georgetown's strength of schedule also boosts their Pomeroy ranking. The Hoyas are coming off a blow out loss at Syracuse (3rd Pomeroy), and overall they rate as the 4th toughest schedule in the nation, and have played against the second toughest slate of defenses of all the division 1 teams. While this means that G-Town is battle tested, it also means that their raw offensive and defensive scores are much lower: they have averaged 108.7 points/100 possessions (58th) while giving up 93.1 points/100 possessions (46th). Part of the low offensive output has come while on the road against good teams-- they failed to crack the 1.0 points/possession mark at both Syracaue and Villanova.

Duke, meanwhile, is a well-kept tempo free secret this year. For whatever reason, Duke always seems to be near the top of the Pomeroy rankings in January and February, and this season is no different. Duke currently sits behind only top-ranked Kansas overall, and the Blue Devils continue to have the best adjusted offense in the country. In fact, it has only been in the past week (since the loss at NC State) that Duke has not also had the highest raw offensive efficiency as well (and are still in the top five). Clearly those numbers were going to come down with ACC play, but like Gerogetown, Duke is playing against some pretty tough defenses (6th toughest defense against).

It's hard to put a finger on what exactly is making Duke's offense so good. Pomeroy lists the "four factors" he believes are most critical to consistent play, regardless of the game's pace. Duke is a top 20 team in two of these factors: turnover rate (16.8% of possessions, 14th) and offensive rebounding percentage (40.1% of missed shots rebounded, 15th). Less impressive is the effective field goal percentage (52.9%, 41st) and FTA/FGA (36.4%, 202nd). Clearly the Duke alums have not been paying the refs off enough this year, because they are not getting to the free throw line very often at all. On the bright side, Duke's 77.0% FT% is fourth in the nation.

This is a game that will be won underneath the baskets. The first issue is Duke's shooting ability: while Duke has their characteristic 3-point shooting ability, they have struggled at times shooting from close range, and rank 79th with just 50.6% from inside the arc. Georgetown, meanwhile, struggles to keep opponents' percentages down. Because this game is being played at the Verizon Center, Duke can be expected to miss their fair share of shots. This shifts the focus to a battle between an elite offensive rebounding unit (Duke) and an elite defensive rebounding unit (Georgetown). For Duke's big men, the goal should be to grab that rebound and then go up strong through the usually foul-conscious Greg Monroe. But if the Hoya sophomore can handle Zoubek and the Plumlees, it could be a long afternoon for Duke.

In the rare event that Duke has a lights-out shooting performance on the road, the final tally could get quite high-scoring. One of the things Georgetown does do well is shoot the ball, with a 55.4% eFG% (12th). However, they are not very skilled at the other four factors (offensive rebounding, limiting turnovers, and getting to the free throw line). It's a strange game to predict because if the game were played at Cameron, one could see Duke making their shots and limiting their opponent's strength on the defensive glass, while the Blue Devils dominate their own defensive glass after limiting Georgetown's shooting ability.

But the game is being played in DC's downtown Chinatown, which adds expectation to Georgetown's shooting ability and detracts from Duke's. That's bad news as Georgetown has four players (Freeman, Vaughn, Wright and Clark) who put up eFG% north of 55. A cold shooting night from Duke and road defense as dismal as shown at NC State, and Duke looks to fall in a big way. However, more recent games have shown Duke's defense to be back on solid ground, and so if Georgetown's shooters are cooled, then look for a fight to the finish.

Prediction: Mason Plumlee and Brian Zoubek, combined, have about as much playing time (86.7% minutes played combined) as Greg Monroe (84.7%). Whoever has the most rebounds (MP2 + Zoo vs Monroe) will be on the victorious team.

Friday, January 22, 2010

Worst Duke Defensive Performances

It's been a while since we've done college basketball here, and I have a few things planned for the coming months, including a systematic re-do of last year's tournament simulation. In the meantime, I want to nip any "Matt's a Duke fanboy" criticism in the proverbial bud by doing a negative post about Duke.

The tempo-free era of college basketball began with the 2003-2004 season as Ken Pomeroy started putting posting his rankings based on offensive and defensive efficiency. Duke, for whatever reason, has always done pretty well during the regular season in Pomeroy's adjusted (for opponent offensive and defensive strength) rankings:

Year/Offensive Efficiency/ Defensive Efficiency
2010/1/18
2009/10/20
2008/11/9
2007/40/5
2006/5/13
2005/15/1
2004/2/4

Nevertheless, Duke has put up some hard-to-believe individual games on both sides of the ball. Defense, however, is so intricately tied to Coach K's philosophy that it is the better choice for a comprehensive breakdown. Defensive ability puts players on the floor for Coach K, and can limit the minutes of potentially explosive offensive perimeter players, if they cannot grasp Duke's defensive philosophy (c.f. Taylor King, Elliot Williams, Andre Dawkins). Usually, the result is a solid defensive gameplan that has consistently been in the top 20 the past six and a half seasons.

There have been great defensive efforts, and not always against inferior competition. In fact, the best defensive performance of the 228 games recorded on kenpom.com is this one, played December 19, 2009 against Gonzaga in Madison Square Garden. Duke held the Zags to an efficiency of 57.1, which is absolutely stellar when you consider that Gonzaga's 2009-2010 average is 111.3! But it has not been all good news for Duke this year, and it is an historic defensive lapse that is the reason for this post. Here are the top 10 worst defensive performances since 2003-2004:

#10 (118.4): January 14 2006 at Clemson (Duke W 87-77). We start with a surprising performance in that not only is it a Duke win, but also a win on the road. Although the 2005-2006 Duke team was frequently spotlighted for it's excellent defense, led by Shelden Williams' shotblocking ability, on this night at Littlejohn, things did not go well. Giving up more than a point per possession to a Clemson team that did not make the NCAA tournament and finished 94th in the nation in adjusted defense was simply not acceptable. Duke had some other defensive clunkers along the way, but eventually fell in the NCAA tournament due to their worst offensive performance in the tempo-free era. (But as Alton Brown might say, that's another post...)

#9 (119.0) November 16 2008 vs Rhode Island (Duke W 82-79). This was the night that RIU's Jimmy Barron almost got a permanent middle name from Duke fans. Barron hit seven straight three-pointers before Dave McClure came off the bench to get a hand in Barron's face, causing him to heave an air-ball with 1:24 remaining and Duke down by 2 (ESPN's play-by-play calls the shot a 2-pointer, but I was at the game and remember differently). This was by far the loudest I've heard Cameron in a non-conference game. I also had the pleasure of high-fiving Jimmy Barron as he ran off the court, and I have not washed that hand since.

#8 (119.1) March 8 2009 at North Carolina (Duke L 79-71). This was the closer of the two Duke-UNC games last year, and both teams were had worse efficiencies than the earlier matchup. The pace was 13 possessions slower as well. The biggest standout is that the Tar Heels rebounded over 40% of their missed shots, while Duke's offensive rebounding percentage was a dismal 18.8%. Eww.

#6t (121.3) February 18 2004 at Wake Forest (Duke L 90-84). Duke's best team during the Tempo-Free era only lost six games all season, and every time they did, they allowed more than a point per possession. It was Duke's second loss in a week (they lost at NC State earlier) and is perhaps the best lesson for Chicken-Little Duke fans and Duke haters alike, concerning this year's Duke team. Despite the efforts of the media and the common fan, "consistency" just doesn't mean anything. Especially in ACC road games. A team can put up clunkers in consecutive games in January and still get within an Okafor of the national championship game.

#6t (121.3) January 21 2006 at Georgetown (Duke L 87-84). From the second paragraph of that ESPN recap: "That's my child," the elder Thompson said. "I love my child. After all he's had to go through, he deserves this." Duke never led in a classic execution of the Princeton offense, and JJ Redick scored 41 points, and with Duke almost matching the Hoyas' offensive efficiency with a 117.3 rating. Free throws were the difference, as Georgetown got to the line more often than Duke. It is unclear what the elder Thompson said of Duke's destruction of Georgetown in 2009.

#5 (124.5) February 22 2009 vs Wake Forest (Duke W 101-91). Last year was Duke's worst from a defensive standpoint, and it shows in this top ten list. This game was the polar opposite of the game a few weeks earlier in Winston-Salem, a two point loss for Duke in which neither team topped 95 in efficiency. The rematch in Cameron however had a similar number of possessions and basically no defense.

#4 (125.8) February 20 2005 vs Wake Forest (Duke W 102-92). An eerily similar game to the one above, although this rematch played out exactly like the preceding game at Wake Forest (see below); the only difference was the result for Duke. Interesting point about the 2004-2005 Duke team; if you check here, you can see all the factors that were correlated to Duke's defensive performance that year. Interestingly, nearly all the factors are in bold, indicating a significant (at the 95% confidence level) correlation. Duke's defensive performance that year was strongly tied to their opponents' shooting, rebounding ability, turnover rate, and ability to get to the free throw line. Defense was also significantly correlated to Duke's own offensive ability that night, especially on Duke's offensive glass. It should come as no surprise, then, that Duke was knocked off in the Sweet 16 in a game to a Michigan State team that was among the best in offensive rebounding, among other things.

#3 (127.0) January 19, 2010 at NC State (Duke L 88-74). Ah, the inspiration for this post. Duke was outplayed on their defensive end in a big way, and NC State was helped by a healthy 62.7 effective field goal percentage. They also protected the ball, turning it over on just 13% of their possessions, and turnover rate seems to be a relative strength of this year's Wolfpack team. Offensive efficiency, however, has not exactly been consistent for NC State, and while they put up similar numbers against Georgia Southern and UNC-G, an effort like this against a top-20 defense was completely unexpected. I see no reason to see the game as anything other than a fluke for Duke, and while winning at Littlejohn tomorrow will be a tough task, if Duke loses it will not be due to another defensive calamity. For this, in K, I trust.

#2 (127.6) February 11 2009 vs North Carolina (Duke L 102-87). Easily the worst Duke game I have ever attended, as Duke was never really in this game. Unlike some of the other games, this was not a defensive lapse against an otherwise mediocre offense. This was a lashing at the hands of the eventual champs who could not stop two All-Americans from doing whatever they want on the court. Yes, I'm a little bitter that I slept outside in a tent to watch this game. Next.

#1 (131.6) February 2 2005 at Wake Forest (Duke L 92-89). So many surprising things about this list, including the number of home games (4) and the number of Duke wins (4) and the number of appearances by the Demon Deacons (4). There is quite a wide gap between #2 and this game, and Duke almost pulled off the win! This was a collision of elite squads, as Duke finished the 04-05 season with the best defense in the nation, while Chris Paul's team was the #2 offense. Wake's inability to stop anyone came back to bite them in March, as they couldn't shoot their way out of early exits in both the ACC and NCAA tournaments.

To be fair, Wake Forest dominated the the middle 36 minutes of the game and it was only the 3-point abilities of the talented Mr. Redick that brought the game close at the end: JJ hit three from beyond the arc and Sean Dockery made it 90-89 with two seconds left. Taron Downey hit both of his free throws and Redick missed a running three to end the game. Notably, Duke's offensive efficiency in this game was 127.3, and so one must conclude that it was indeed Duke's defense that prevented the win. As the rematch (see #4) showed, home court has a lot to do with the outcome of games like these.

The biggest absence from this top 10 list: games in March. For all of the hate heaped upon Duke for its struggles in the month of March, not once did they make an NCAA tournament exit due to a sudden lack of defense. In fact, the worst NCAA tournament performance is (predictably) the 2007 first-round loss to VCU, which is 27th on the list at 112.0, followed by last year's win over Texas (36th at 109.3). But those games are miles from the ones above, and allowing around 110 points/100 possessions is probably to be expected from some of the elite offenses in the nation. Coach K gets his teams prepared on the defensive end for March, combined with playing non-ACC teams unfamiliar with Duke's defensive style. There are relatively more games in ACC tournament on the list.

I have compiled the stats from kenpom.com from all years into one spreadsheet, which can be found here for your sorting pleasure. Tempo-free stats can tell us a lot about a team that we weren't expecting to hear. This would be not-so-subtle foreshadowing of a future post about Duke. For a further hint, see this and this.

Monday, December 28, 2009

2009 AFC Playoff Scenarios

One year ago, we constructed a popular post about the playoff permutations in the AFC East. This year, a logjam in the middle of the AFC leaves many meaningful games for week 17.

It is possible that five teams could finish 9-7 (Jets, Denver, Baltimore, Pittsburgh, Houston). In this scenario, strength of conference record ranks the teams first. This eliminates the Texans and Steelers, since both teams would finish at 6-6 in conference, while the other three teams would finish 7-5. Next, record in common games applies:

Versus NE, OAK, IND, and CIN:
Jets: 4-1
Broncos: 3-2
Ravens: 1-3

This gives the Jets the #5 seed and the Ravens would take the #6 seed thanks to their victory over the Broncos.

A different, crazier, possibility exists. With one week to play, it is possible that EIGHT AFC teams can finish at a mediocre 8-8, which in addition to giving ex-commisioner (and lover of parity) Paul Tagliabue a spring in his step, it brings upon the NFL the full power of the sometimes mysterious NFL Tiebreaking procedures. So what if the stars align and all eight teams finish at 8-8?

The key is that ties within divisions are broken first, with only one team per division advancing:

Within Division:
East: Mia > NYJ (head2head)
North: Bal > Pit (Division record)
West: Den
South: Jax (3-way h2h):
Jax 3-1 vs Hou, Ten
Ten 2-2 vs Hou, Jax
Hou 1-3 vs Ten, Jax

In a multi-team tiebreaker, a head to head sweep would prevail but the teams did not all play each other. Baltimore's win over Denver is irrelevant now.

This lands with the games-within-conference tiebreaker, and an 8-8 Jacksonville would have 7 wins in conference, so they're in.
The remaining three teams would be 6-6 in conference.
The next tiebreaker is record in common games (minimum four), and each team has played NE, PIT, IND, and SD:

Denver 2-3
Baltimore 2-3
Miami 2-3

This, troublesomely, leaves each team at 2-3, causing us to enter the magical land of Strength of Victory. Luckily, in the Everyone 8-8 Scenario, many games have been decided for us.

Denver beat: CIN (11), CLE (4), OAK (6), DAL (10, Play PHI), NE (11), SD (12, Play WAS), NYG (8, Play MIN), KC (4)
Ravens beat: KC (4), SD (12, Play WAS) , CLE (4), DEN (8), CLE (4), PIT (8), DET (2, Play CHI), CHI (5, play MIN and DET)
Miami beat: BUF (5, Play IND), NYJ (8), NYJ (8), TB (3, Play ATL), CAR (7, Play NO), NE (11), JAX (8), PIT (8)

So the games we would know for sure:

DEN: 66
BAL: 47
MIA: 58

The superior record of teams the Broncos have beaten is insurmountable, and so the Broncos would get the final playoff spot.

Meanwhile, the wins by the Patriots (over Houston) and Bengals (over the Jets) would put them both at 11-5 and at 8-4 in the conference. Moving on to record in common games against the Jets, Ravens, Texans, and Broncos:

NE: 3-2
CIN: 3-2
Which leaves Strength of Victory:

NE beat: NYJ (8), BAL (8), HOU (8), BUF (5, play IND), ATL (3, play TB), TEN (8), TB (3, play ATL), MIA (8), CAR (7, play NO), BUF (5, play IND), JAX (8) (71 wins)
CIN beat: NYJ (8), BAL (8), BAL (8), PIT (8), GB (10, play ARI), CLE (4), CHI (5, play MIN and DET), PIT (8), CLE (4), DET (2, Play CHI), KC (4) (69 wins)

So New England would have multiple ways to increase its lead over Cincinnati in the Strength of Victory department next week, while the Bengals will have to wait for everything to fall into place in order to grab the 3rd seed.


In summary, the Great Mediocrity Scenario of 2009 would create the following playoff matchups:

1. Indianapolis (bye)
2. San Diego (bye)
3. New England vs 6. Denver
4. Cincinnati vs 5. Jacksonville

Feel free to point out any errors I have made in the comments.

Update 1: Fixed Ravens-Broncos error pointed out in the comments. I blame NFL.com's difficult to read schedule page...

Monday, September 07, 2009

Congrats to Ross Ohlendorf(and belatedly to A.J. Burnett)

On Saturday, Ross Ohlendorf pitched an immaculate seventh inning against the Cardinals. You can watch it here. I'd have to defer to mehmattski on this, but I would bet that it's the first time that an Immaculate Inning has been throw where every strikeout was a dropped third strike.

On June 20th, AJ Burnett threw an immaculate inning of his own. You would think that a blog written by a Yankees fan and a Marlins fan who also follows Former Marlins, would've noticed when a Former Marlin throws an Immaculate Inning for the Yankees, against the Marlins. To our 6 readers and AJ Burnett, we wholeheartedly apologize for not posting about it when it happened.

Tuesday, April 28, 2009

Immaculate Inning: Daniel Bard

We at Immaculate Inning take a lot of pride in chronicling the rare feat which gives our blog its name; that is, striking out three batters in an inning using just nine pitches. It has only happened 41 times in major league history, but unfortunately we have no idea how common the feat is at the minor leagues. A few years ago it came to our attention that Chris Mason twirled an Immaculate Inning in a AA game. The immaculate inning fires take a while to get stroked with these minor league games, but we are proud to recount the feat of Red Sox prospect Daniel Bard. Major hat-tip to the Projo Sox Blog for bringing the performance to my attention.

Daniel Bard is a 23-year old righthander pitching for the Pawtucket Sox in AAA. He finished last season at AA, retiring 20 of his final 23 batters; that domination has continued into this season, as he sports a 1.69 ERA and has struck out 18 in 10.3 innings. One of those innings, and three of those strikeouts, came against the Rochester Red Wings on April 22. Some video of the immaculate inning can be found here.

The batters were Jason Pridie, Matt Tolbert, and Luke Hughes, the first three batters in the Rochester order. All of them are career minor leaguers, although the 23 year old Hughes appears to be a legit prospect. Pridie appears to go down swinging on three straight fastballs right down the pipe. Tolbert follows the advice of a nearby heckler ("swing!") and misses at three more fastballs from Bard. Some kind of offspeed pitch (curveball) is taken for a strike by Hughes before he swings way late on two fastballs, the second around his eyes. Bard, meanwhile, looks to be rather bored with AAA pitching, and should expect a call up to the majors sometime this season. Regardless of his future, Bard has solidified a place in history with his immaculate inning, and we offer him the highest congratulations!

Tuesday, March 24, 2009

Sweet Sixteen Predictions by Simulation

Now that I've taken a day to recover from watching some 40+ hours of basketball over the weekend, let's revisit the predictions made by my NCAA Tournament Simulator. Here's a link to bracket that I picked based on the highest number of average wins in the tournament. As you can see, the picks did pretty well, landing in the 72nd percentile overall on ESPN. Thirteen of the sweet Sixteen teams were picked correctly, and the bracket lost zero Elite Eight teams over the first weekend of play. The three most notable exceptions were West Virginia, UCLA and Wake Forest. The simulation could not have taken into account how absolutely uninspired these teams would play. It also missed the Western Kentucky over Illinois, since the simulation didn't know about the injury to Chester Frazier.

West Virginia did replace Michigan State in the Most Likely Elite Eight according to the one million simulations. How likely was the first round overall? I wrote a script to count the number of times the simulation predicted the exact first round results in each region:

West = YES! 41733 times!
Midwest = YES! 2325 times!
East = YES! 84894 times!
South = YES! 13648 times!
Overall = Nope. 0 matches.

Upsets of Wake Forest, Utah, and West Virginia at the same time in the Midwest region rarely occurred in the same simulation, and when they did, that simulation did not get one of the other regions correct. In fact, in my pool of 1 million simulations, just 66 produced the correct first round results in three of the four regions. It seems that even if I could have entered all one million simulations, it would not be enough to win Yahoo's Perfect Bracket $1 million. Oh well.

So what do the Pomeroy ratings tell us about the Sweet Sixteen and beyond? To answer that I have two different approaches. One is to simply report the results of the final simulation from Sunday night, the results of which can be found in the data and graphs in this post. Those results are based on the Pythagorean Winning Percentages posted before the first round of the tournament. Four days and forty-eight games (not counting NIT games) later, the rankings are a bit different. How does the added information enhance or suppress the national title chances of each team left in the tournament?

Elite Eight Chances (Click for Chart)
Final Four Chances (Click for Chart)
Championship Game Chances (Click for Chart)
National Title Chances (Click for Chart)

Basically, the inclusion of all the statistics from the tournament games has improved the chances of Connecticut and Memphis winning the national championship, and hurt the chances for nearly everyone else. For Thursday and Friday's games, the teams that most improved were Connecticut (+8.2%), Villanova (+5.5%), North Carolina (+4.5%), and Kansas (+4%). Predictably, the teams that were most hurt by the newer statistics were the immediate opponents of those four teams. UNC-Gonzaga has gone from a tossup (51%-49%) to a more solid favoring of the top seed (55%-45%). The closest game of the Sweet Sixteen now projects to be Oklahoma-Syracuse, with the third-seeded Orange winning 52% of the time.

In the Final Four, Connecticut has actually seen its chances decrease, due to a much higher proportion alocated to Memphis and Missouri, but the Huskies still win the West region in 35% of the one million simulations. From the Midwest, Louisville is still the favorite with a slight edge over Kansas; Michigan State saw a drop in their chances with the inclusion of the new stats. The South is just as open as it was to start the tournament, but Syracuse maintains a healthy advantage, followed by Oklahoma. There is then a huge dropoff between those two and North Carolina and Gonzaga. Finally, the East regional still projects a showdown between Pittsburgh and Duke, with the Blue Devils giving an ever so slight edge (29.00% to 28.28% for Pitt).

The updated stats say that the national title game is less likely to have a representative from the East region, compared with pre-Tourney stats. This is because the four remaining South regional teams all improved their title-game chances, while Duke had the biggest drop of all the teams (from 17.01% to 14.82%). The other half of the title game is still most likely to come from the West, which had Connecticut, Memphis, and Missouri all increase their chances with the inclusion of new stats.

It has, so far, been a tournament small on upsets. The simulator predicts that this trend will continue, with one small exception (#3 Syracuse over #2 Oklahoma), although many of the games project to be very close. One thing that could be improved in the model is the log5 predictions for teams with such similar Pythagorean Winning Percentages. This is one of the things I will be taking a look at in the offseason. In the meantime, it's only two more days until things get kicked off in Glendale, Arizona. Hooray basketball!

Friday, March 20, 2009

Progression of Final Four Chances

At the Immaculate Inning we've been playing all week with different ways to present data generated by our NCAA Tournament simulation. Here is what I feel is the most dynamic view of things: after every set of games on both Thursday and Friday, I set the probability of the losing teams to "0" and re-ran the simulation. I've then graphed the final four chances of every team, by section. You can see the results below (also click here to view the whole source spreadsheet):



You can click on each tab to view the chart for each region. There lots of interesting in trends in each region. The re-simulations are based on the Pomeroy rankings from Thursday, and do not take into account statistics from the first round games themselves.

South Regional: The story here is the Final Four chances of #1 seed North Carolina. Note that these statistics do not take into account Ty Lawson's injury, and yet UNC has had their chances drop over the last two days. More specifically, they've stayed in about the same place, while four other teams have passed them in Final Four chances. Oklahoma is now the odds-on favorite, their chances jumping tremendously with the upset of Clemson elsewhere in the bracket. That is the story throughout; teams rarely improve their own Final Four chances with a win. Instead it's other teams losing that sends waves through the simulations. Three of the four remaining teams in the bottom half of the regional have a better final four shot than UNC, as does Gonzaga in the top half. Arizona State, meanwhile, has climbed from sixth to second in terms of Final Four chances, because they are favored in their matchup with Syracuse (52%-48%).

East Regional: Not much movement going on here, just some strengthening of chances for the favorites as the upsets just don't come. Remember, Wisconsin was heavily favored over FSU in the simulation, so the Seminoles' overtime loss doesn't have much effect on the rest of the regional. Basically, Wisconsin is now at 9%, having added FSU's original 4% to the Badgers' own 5% chances. Among the remainging teams, Texas has the worst chances, since they could have to go through Duke, UCLA, and Pitt (the top 3 teams, statistically), to make it to Detroit. Pittsburgh has the best chance of winning their second round game over Oklahoma St, while the Xavier-Wisconsin game should prove to be the closest of the second round.

Midwest Regional: One of the biggest jumps of the first round was in this regional-- Louisville is no longer the favorite, but instead Kansas wins the West 25% of the time. This only had a little bit to do with Kansas' win over North Dakota St. As you can see from the graph, the gigantic jump came at 5 PM, when simulation favorite West Virginia went down in an uninspiring performance against Dayton. Louisville, the #1 seed, also reached a Final Four chance of 25% by the end of the day, thanks to the upset of Wake Forest by Cleveland State (an upset we predicted in this post). There seems to be two types of games in the second round-- Kansas and Louisville are 80% favorites, while Michigan State and Arizona are favored at the 60-65% rate. If a Sweet Sixteen berth for a low-seeded, mid-major team is your defnition of "Cinderella," then Cleveland State's 38% chance of beating Arizona is the best slipper bet.

West Regional: Not much going on here, because there haven't been that many upsets. Our model did not see Maryland taking out Cal, but clearly they are a different team than the one which put up very mediocre numbers throughout the season. If Maryland can click their offense to the tune of 1.23 points/possession like against Cal, Memphis is going to be in for a long day. Purdue vs Washington is a coinflip (50.9% to 49.1%) and should be a very good game, while Missouri could have a tough time with Marquette.

Final Four Picture: There are tossups in pretty much every regional now, with Louisville-Kansas joining Pittsburgh-Duke and Memphis-Connecticut in the two-dog races. The South regional is as open as ever, and sees Oklahoma as the most likely representative. A UConn-Pittsburgh final still seems to be the most likely, while UConn and Memphis are the only teams winning more than 8% of the time (both are over 11%). This dynamic should change considerably after this weekend; currently the only major change was the elimination of West Virginia. Gonzaga and Kansas have slipped past Duke and are the fourth and fifth most likely championship teams.

So that's where we stand after the first thirty-two games of the 2009 NCAA tournament. Tomorrow and Sunday I'll be updating frequently with the chances of each team's advancement, and I will follow next week with a new simulation from the Sweet-Sixteen onwards! Till then, may your brackets be less busted than mine!