Tuesday, August 24, 2010

Immaculate: Rafael Soriano

My google alert screamed this afternoon with a new hit for Immaculate Inning: this article by Gregg Found at ESPN, which mentions the 44th Immaculate Inning in Major League History. Congratulations to Rafael Soriano, the Tampa Bay Rays closer who effortlessly dispatched the Los Angeles Angels of Anaheim, California, USA, Earth by striking out the side on nine pitches last night. His victims were Erik Aybar, pinch hitter Mike Napoli, and Peter Bourjos.

Soriano was not the first one to throw an Immaculate Inning to end a game, which has happened eight times previously, though two ended complete games with Immaculate Innings (by Ron Guidry and Trevor Wilson). Closers, meanwhile, don't need to play the cat-and-mouse game of wasting pitches, and there is a high priority placed on not walking anyone. Therefore it makes a bit of sense that Soriano joins closers such as Jason Isringhausen by finishing off the game with nine straight strikes.

We honor the Immaculate Inning here because it serves as a type of dominance a pitcher can have over the batters in that inning, and along those lines, Soriano's feat stands out. The Rays' closer got seven swinging strikes (including one foul by Napoli) out of the nine strikes, and all three batters swung through the final pitch. Napoli's at bat was also interesting, as a pinch hitter he seemed predestined to swing-- missing wildly on breaking pitches for strikes one and three. For his part, he just missed a fastball right down the middle on Soriano's second pitch-- a few centimeters over and we're not talking about an Immaculate Inning.

Soriano picked up his league-leading 38th save for the effort, and the victory secured a tie in the AL East with the Yankees, meaning that this Immaculate Inning is one of the most "clutch" in the history of the feat, the first in nearly a year. Honestly, despite two perfect games, how could we call 2010 the Year of the Pitcher without at least one Immaculate Inning?

Finally, we'll have to agree with Found as he notes that an Immaculate Inning is "a feat with a cool-sounding moniker to match its impressiveness."

Saturday, June 26, 2010

The Worst No-Hitter

Yesterday evening I was packing and decided to throw on a baseball game as background noise. Since the Yankees were playing on the West Coast, I went for the Arizona-Tampa Bay contest. I turned off the game in the third inning, after Edwin Jackson weaseled his way out of a bases-loaded jam. I thought nothing of the game until I saw on Baseball Tonight that he had thrown a no-hitter! My first thought was, is this the worst no-hitter of all time?

Jackson walked eight batters in the game, and threw a career-high 149 pitches, just 79 of them for strikes. At one point in the third inning, the Win Probability actually favored Tampa Bay, thanks to Jackson walking the bases loaded! With two outs in the ninth, Jackson walked pinch hitter Willy Aybar on four pitches, which was Aybar's seventh walk of the season. No doubt about it, Jackson pitched rather poorly and still picked up the no-no. Is it possible to have a worse no hitter than Jackson?

First of all, some ground rules: Major League Baseball defines a no-hitter as a "a game in which a pitcher, or pitchers, gives up no hits while pitching at least nine innings. A pitcher may give up a run or runs so long as he pitches nine innings or more and does not give up a hit."

This excludes some rather infamous no-hit performances, such as this one by Andy Hawkins, who allowed four (unearned) runs in a 4-0 loss while pitching for the hapless 1990 Yankees. In a baseball-reference play index search, I asked for games of 9 IP or more since 1920 with zero hits. I then sorted this list by ascending Game Score, a measure invented by Bill James to assign a single number to a starting pitchers' performance. The best nine-inning performance since 1920, according to Game Score, is Kerry Wood's 20K game in 1998, with a score of 105. The measure is very results-oriented, since it places high value on the number of innings pitched and the number of runs-- shutouts are practically guaranteed to be above 80.

Click the link for the results, which show Jackson's game last night to be the fourth-worst no-hitter on record, in Game Score. Three games tied with a game score one unit worse:

George Culver, July 29, 1968 Pitching for the Reds, his second of what would be five teams in the pre-free agency era, Culver was nothing short of mediocre in this game, for someone who pitched a no-hitter. He walked four in the game, but faced 34 batters, one of the extra men (Dick Allen) getting aboard thanks to a throwing error by third baseman Tony Perez, and reaching second on another error by shortstop Woody Woodward on the same play. Allen then reached third base on a groundout and scored on a sac-fly by Cookie Rojas.

Culver allowed another unearned baserunner in the third inning, with Phillies' starter Chris Short reaching on Catchers Interference by Pat Corrales. Culver then retired eleven straight before walking two men with two outs in the sixth, needing a strikeout to get out of that jam. Culver's final two baserunners got on via walk to open the eighth inning, but Culver induced some ground balls to get out of that one, and breezed through the ninth for the no-hitter.

Bill James calculated how likely it was for each pitcher to have thrown a no-hitter, and George Culver came out as one of the ten men least likely to have a no-no.

Ken Holtzman- August 19, 1969. Of the three no-hitters with a Game Score of 84, Holtzman's seems the most impressive. Facing Phil Neikro and staked to a 3-0 lead after the first inning, Holtzman walked just three in his no-hitter. The reason this game has such a low game score is that Holtzman struck out precisely zero hitters! Not exactly Nolan Ryan, Holtzman struck out just 5.0 per nine innings in his career, despite playing in an extremely pitcher-friendly era. Only one other no-hitter since 1920 has featured zero strikeouts, by Sad Sam Jones in 1923.

Sabermetrics has taught us in the Gospel of Three True Outcomes; that a pitcher can only (really) control three results of an at bat: a strikeout, a walk, and a home run. There are fine tunings in there, such as GB/FB rate, and his own fielding ability, but once the ball is put in play, a lot is left up to his defense. On that day in 1969, the Cubs' defense shined, and Holtzman never allowed more than one baserunner in an inning. Twelve groundouts, twelve flyouts, and three pop-outs formed an even split among the batted balls in this game. Interestingly, while giving up fly-balls tends to influence home-run rate and therefore is bad for the overall success of a pitcher, if one wants to pitch a no-hitter, a flyball is far more likely to turn into an out.

The shimmering defense was on display no finer than on the last out of the seventh inning, in which outfielder Billy Williams climbed the ivy at Wrigley to pluck a home run away from Henry Aaron. Holtzman wrote in Chicago Cubs: Memorable Stories of Cubs Baseball that the home fans were reminding him that he had a no-hitter every inning following the third (so much for jinxes!) Holtzman would go on to throw another no-hitter two years later for the Cubs, and was later elected to the Jewish Sports Hall of Fame.

Joe Cowley, September 19, 1986
From the box-score, this is my vote for the worst official no-hitter on record. Cowley registered seven walks (and eight strikeouts), and among the 145 no-hitters on record with Game Scores below 100, Cowley's is only one of two in which the no-no-man gave up an earned run. Much like Jackson, Cowley walked the bases loaded to start the sixth, and then gave up three straight fly balls, the second of which scored Reggie Jackson. Cowley also had two men on in the third, and like Jackson walked a man in the ninth, though this runner was erased on the game-ending double play.

In fact, Cowley threw the worst no-hiter of all time, and then never won another big league game. He lost his next six decisions and was out of baseball within a year of this game. Hopefully, Edwin Jackson can avoid this same fate.

Thursday, April 01, 2010

Confessions of a Duke Alum

It is time for the truth: the reason why Duke is the #1 team in the nation is because we at the Immaculate Inning paid off Ken Pomeroy.

Since 2007, these e-pages have been graced with seemingly logical discussion of the Duke University men's basketball team. We dissect the game from various angles using the statistics-- and those stats, facts based on what happened in each game, come from one website: kenpom.com. The obsession was so obvious that one time in 2007 the blog was briefly shut down by Google because they thought it was a spam link generator:

Re: #118781655 Blogger beta non-spam review and verification request: mehmattski.blogspot.com

Hello,

Your blog has been reviewed, verified, and cleared for regular use so that
it will no longer appear as potential spam. If you sign out of Blogger and
sign back in again, you should be able to post as normal. Thanks for your
patience, and we apologize for any inconvenience this has caused.

Sincerely,
The Blogger Team

Over the years, some have wondered how it is that mediocre Duke teams were still ranked near the top of Pomeroy's rankings. All this season, fans dismissed the rankings because "everyone knew" that Kansas, Kentucky, and Syracuse were the best teams in the land. Yet, for all but a few weeks of 2010, the Duke Blue Devils were ranked #1 on kenpom.com; no ranking so obviously flawed could be trusted! Well, now you know the reason; with our beloved Blue Devils trashed in the national media for being "alarmingly unathletic," and unable to compete because of our racist coach, we wanted to feel like we were the best at something.

Some of you may ask for proof, so here is the little "Christmas Present" I gave to Mr. Pomeroy. His demands were quite specific.

Most of you will not be surprised by this revelation; it is of course common knowledge that Duke receives special treatment from the NCAA, CBS, and the officials (Georgia Tech superfreshman Derick Favors said after the ACC Championship game: "It was very frustrating. We played good defense, and the referees bailed them out.") It's such common knowledge that even after six seasons of "adjustment" by the officials, keeping Duke out of the Final Four, the first explanation for Duke's trip to Indianapolis was that "Duke Gets All the Calls." Of course we do, and there's a very good reason for that too! First, the facts:

It's no secret that referees are corruptible. Tim Donaghy, who was caught wagering on the basketball games he officiated, has claimed that the problem is not an isolated one; he claims that 13 NBA officials are involved in wagering on the game. He further accuses the NBA of "turning a blind eye," because they are more interested in the money than in fairness. The distrust this has created in the casual sports fan has trickled down to mingle with the "Duke Gets All the Calls" meme; no longer is it ridiculous tinfoil hat talk. Real referees are really swayed by real cash.

It's also no secret that Duke alums make a lot of money; in 2009 USA Today ran a "payscale bracket" which picked teams based on median graduate salary. Duke won, and it wasn't close. The Duke Endowment, despite the economic downturn, is still worth nearly $3 billion. The Duke Annual Fund employs banks of undergraduates to call previous donors; this year I made a donation that the university recorded as going to "The Nicholas School of the Environment," and they even made it look good by sending me a thank you letter. All of it was done with a wink, and an understanding.

Truth #2: Every year, Duke alums make a donation to the "Referee Fund," which goes directly to the National Association of College Basketball Referees. Our sizable contribution is made with good faith understanding that it will be paid back with whistles.

We are human, though; we do feel a slight bit of guilt every time we write that check. For six years we had to sit on our hands because the officials said that it would be too obvious to hand Duke a title in 2006, so soon after the officials handed them the 2004 and 2001 Final Four runs. Many alums canceled their contributions after the Duke-UConn game in 2004, and the Duke-LSU game in 2006. It seemed that in those games, the referees were blatantly defying their loyal contributors!

Finally, we have some retribution. Handed the easiest bracket since UNC "won" the 1924 national championship by playing exactly zero postseason games, Duke sailed to the Final Four beating a couple of high school teams and your mother's quilting club. It almost wasn't enough, so the refs did have to step in and prevent Baylor from rebounding any of their (fairly frequent) missed shots. Now, we are two games away from a fourth national championship. But at what cost? We felt it was time to stand up for what was right and come clean.

To the referees in the Final Four: May you call every touch foul and carry on Duke; may every block-charge call go against us. May you clear your conscience and hand West Virginia twice as many free throws. It is your massive control over the game-- more than the coaches, more than the players themselves, you referees always decide the outcome of every game. So, in the name of justice, and as a Duke alum and current student who will be in attendance, please make West Virginia win on Saturday. Only then can we rest in peace.

Wednesday, March 17, 2010

2010 NCAAT Fan Anxiety Matrix

Simulating the NCAA tournament has been pretty popular this year, although I imagine that many of the Duke haters out there are not very satisfied with the results. The results, as expected, agree with the discrete log5 projections published by Basketball Prospectus (Click the links for South, East, West, and Midwest region log5 predictions). The difference between those predictions and my simulations, is the type of data that can be pulled out of the simulations.

For instance, below is a list of the five teams that all 64 teams in the field are most likely to end their seasons against. For example, while Duke was found to win the whole tournament in 24% of simulations, who did they lose to in the other 76% of the one million simulations?




As you can see, California is the team most likely to send Duke home with a disappointing season (and give plenty of bloggers and media mouthpieces lots to puff their chests about). It's not surprising to see that Kansas is one of the top five teams to knock off Duke, since Duke averaged 3.5 wins per simulation, and got to the final game very often, as did Kansas.

We're calling this presentation the Fan Anxiety Matrix. Which team is the most likely to knock out your favorite team? The second tab of the spreadsheet above shows every team's chances of being knocked out by every other team. Click here to see the full spreadsheet.

With a focus on the ACC teams, here's some pie charts to gaze at. Maryland fans will be pleased to see that Duke has only a 2% chance of ending their season. Unfortunately, there is only a 2% chance of winning the national championship as well. The other four ACC teams have an uphill battle to make it until the second weekend (click a Fan Anxiety Index to enlarge):






Sunday, March 14, 2010

2010 NCAA Tournament Simulations

Be sure to also check out the "Fan Anxiety Matrix" Who will your team lose to this March?

The brackets are set, and so for college basketball stat nerds, that means simulations. A number have already popped up: 5000 simulations using Sagarin's predictor rating; an online tool for simulating one random bracket, using Pomeroy's statistics.

As I did last year, I take both approaches one step further: using Pomeroy's offensive and defensive efficiency ratings, and the log5 prediction method, I simulated the 2010 NCAA tournament one million times. My script tabulated the results below. (You can also access them in Google Spreadsheet form by clicking here).



Each column is a round of the tournament; each value is the percentage of the one million simulations that a team reached a given round. On the right-hand side is the average number of wins each team in the tournament had in the simulations.

Last year, hardly any first round games were likely to be upsets based on the simulation. This year is much different, thanks to several major discrepancies between Pomeroy's rankings and the seedings made by the committee. Brigham Young, in particular, is seventh overall at kenpom, while stuck in a 7-seed in this year's tournament. The best games to watch this Thursday and Friday will be the 6-11 games, as all four are slated to be near coin-flips.

The intuition here is to do a face-slap and say "Duh, the teams at the top of the Pomeroy rankings have the best chance in simulations using the Pomeroy rankings!" That dismissal would miss several key features of the simulation, and one interesting thing to do is to see how the simulations correspond to our gut instincts about the basketball matchups in each game. For example, Kentucky and Syracuse have rough roads to the national title because of very high (about 25%) chances of losing in the second round. Florida State is the culprit for Syracuse; if you combine that information with the possibility that Arinze Onuaku will not play this weekend, an FSU-Syracuse game on Sunday suddenly gets very interesting.

Kentucky could run into Texas in the second round, and the Longhorns are ranked much higher than in the RPI and by humans (both the selection committee and the polls). And while Texas has been a bit of an enigma to the national media this year, it is clear that the possess the talent to be efficient on both ends of the floor. Kentucky's road is further blocked by Wisconsin, a team actually ranked higher than the Wildcats. That Sweet Sixteen matchup would be really bruising on Kentucky's boards, as the nation's #2 offensive rebounder (Collins) takes on the Badgers' nation-best defensive rebounding squad.

The Pomeroy rankings are typically recognized as being successful post-season analysis of the teams, including the tournament games. All six NCAA champions since Pomeroy's website launched were ranked in the top two post-season, and in the top 15 for both offense and defense. However, the accuracy of the pre-tournament stats is a bit more rusty; last year the "best simulation" out of one million got 53 games right, and averaged 37 correct games. Is it a lack of complete data, a flaw in the system's ability to prognosticate, or just the general stochasticity of the NCAA tournament?

This will be a very interesting year for seeing the ability of the RPI rating system (used by the committee, and which does not include margin of victory) versus the Pomeroy rating system (which goes to the opposite extreme, including margin of victory with no cap). If the adjusted efficiencies are the more accurate predictor this tournament, then we are likely to have a mad, mad, mad, mad March.


I wanted to get the results out quickly, but I will have some further analysis in this post and others throughout the week. Thanks as always to Ken Pomeroy for his absolutely terrific website; without his stats, none of the fun simulators would exist!

Thursday, March 11, 2010

One Million ACC Tournament Simulations

One year ago, we had some fun with spreadsheets and used a number of different methods to predict the ACC Tournament. Unfortunately, the raw numbers had no way of knowing the status of Ty Lawson's ankle, so the team predicted to win about 30% of the time grabbed the title, while UNC rested for bigger fish. In the past year the number of Pomeroy Disciples has grown, and so traditional "log5" predictions of the conference tournaments can be found all across teh internets (although this one in particular, from Basketball Prospectus, is a must read).

I like to find my little niche here at the Immaculate Inning, and that means simulating the hell out of things. The method is the same as for last years' ACC tournament. This year, I used raw offensive and defensive efficiencies that were tabulated here. This means that a team did not have their stats adjusted for home games or for the strength of opponent: the only values in the stat is points scored (or allowed) per possession. Via the Pythagorean Expectation Formula (with KenPom's exponent for unadjusted efficiencies = 8.5), I calculated a team's "expected winning percentage."

To determine the chances that team A beats team B, a form of Bayes Formula is applied, which in the stat-head world has come to be known as "The log5 Method." The method could be applied to any scenario where the probability of a single outcome is desired, given the prior probability for each of two alternatives. Here, we have two teams, each with an expected winning percentage, and can calculate the probability of a .900 team beating an .800 team. If we assume that the result of each game is independent, then we can multiply probabilities together to get a team's overall probability of making a certain round.

Personally I find the method rather deterministic, in what essentially is a stochastic process. Instead, I run the tournament 1 million times and calculate the percentage of simulations n which each team makes it to each round. The results of my simulations for the 2010 ACC Tournament are below:




The spreadsheet has two tabs, one for a simulation done using stats from all games, while the other is for ACC games only. The way to read it is that each team (row) won a certain number of games (0,1,2,3, or 4) in a certain percentage of the 1 million ACC tournaments I simulated. For the top four seeds, the maximum number of wins is 3, while the other 12 teams could potentially win four games and the tournament.

Duke's chances of winning the tournament is severely if stats from the entire season are used, and they go from a near 2-to-1 favorite to not even winning a majority of simulations. Part of this has to do with the raw nature of the efficiencies; accounting for Duke's tough schedule (and it was one of the toughest in the country by most any measure: KenPom, Sagarin, RPI) would probably account for most of the discrepancy.

On the other end of the spectrum is Miami, which gained an incredibly high percentage (from 0.1% to 3.4%) because they had a highly positive efficiency margin for all games, while it was highly negative in ACC games only. The Canes played very very well against a bunch of schools I've barely heard of, followed by getting clobbered in ACC play. Their adjusted efficiency margin is still decent due to the ACC games they played, but it's hard to give the full season stats much regard in this instance.

The numbers for the ACC-only simulations differ from those seen at Basketball Prospectus; I imagine most of the differences here also have to do with using raw efficiencies rather than Pomeroy's adjusted numbers. The adjusted numbers, Pomeroy claims, are the best for predicting "the chance of beating an average D-1 team on a neutral floor." The raw numbers, then, are skewed based on home-court advantage, schedule (remember, the ACC is no longer "balanced"), and the overall strength of offenses and defenses a team faces. In particular the predictions differ in that Maryland's chances are reduced, at the expense of better chances for FSU and VPI. Both methods agree that fifth-seed Wake has one hellish path towards an ACC title; much worse than sixth-seeded Clemson's chances. Overall, it will be interesting to see whether raw or adjusted efficiencies do a better job predicting the ACC tournament.

Another advantage that these simulations have is the amount of fun I can have with the results. Below I present the "Fan Anxiety Matrix." Each cell in the Matrix represents the chances that a team (in the rows) loses in the ACC tournament to a specific team (in the columns):



So, Duke's "Fan Anxiety Matrix" says that, in the 37% of simulations when they didn't win the whole thing, the most common opponent taking down the Blue Devils was Maryland (Using the ACC stats here). Perhaps no surprise there, but then there were still 8.3% of the simulations in which the Blue Devils fell in the semi-finals to Virginia Tech. Duke's first round game is against either Boston College or Virgina, and the combined percentage of simulations in which the Blue Devils' ACC run ended against those two teams was six percent. It should be of some comfort that UNC's chances of taking down Duke (this would have to be in the finals) clocked in at a tiny 0.029%.

Looking at the matrix, Clemson's path is an interesting one. The Tigers are seeded sixth and must at least pass through NCSU and FSU to get to the semifinals; the Matrix has them losing to these teams 22.5% and 37.1%, respectively. Maryland (22.4%) and Duke (10.1%) also appear in the double-digit percentages as Clemson's final ACC foe, with the remaining 3.2% speaking for Clemson's ACC title chances.

Virginia Tech's bubble position would certainly be helped with a win in their quarterfinal matchup; things are looking up according to the simulations, which have them falling to Duke in the semifinals 50% of the time. Wake Forest has a rough road to the ACC title, as they must win Thursday versus Miami (losing %: 30.5), Friday versus Virginia Tech (39.5%), Saturday versus (with 94% probability) Duke, who accounted for a further 25% of Wake's losses in the simulations.

For posterity's sake, here are the official Immaculate Inning ACC Tournament Predictions:

Thursday winners: Virginia, Wake Forest, Georgia Tech, Clemson
Friday winners: Duke, Virginia Tech, Maryland, Clemson
Saturday winners: Duke, Maryland
ACC Champion: Duke 75, Maryland 60

Friday, March 05, 2010

The Last Time

The average US price for a gallon of gasoline was $1.82.

One share of Google stock was $187.40.

The HMS Scott reveals, via mapping of the seafloor, a 100 m landslide at the epicenter of the deadly 2004 earthquake/tsunami in the Indian Ocean.

Year 4702 (Year of the Rooster) began in the Chinese calendar.

The #1 song on the Hot-100 Billboard charts was "Let me love you" by Mario

The top movie at the box office was "Boogeyman." It was about to be replaced by "Hitch."

Cuba begins a ban on smoking in public places.

President George W. Bush, just two weeks into his second term, announces a tax increase.

Two months before his death, Pope John Paul II allows an American cardinal to give the Ash Wednesday address from the Vatican.

#8 Duke beat #2 North Carolina in Cameron Indoor Stadium, 71-70, after Rashad McCants dribbled the ball off his foot with two seconds left.

Duke students, acting like They Have Been There Before, stay in the stands to sing the alma mater, rather than rushing the court. Students proceed to burn shit in a disorderly fashion.

That was Wednesday, February 9, 2005. The last time Duke beat UNC at Cameron.

Carolina delenda est.

Tuesday, March 02, 2010

Five Teams That Should Not Scare Duke In March

Duke fans, I've noticed, have a tendency to be of the self-hating variety. Threads at Duke Basketball Report are filled with skepticism for (supposedly) one's own team, which seems like it just doesn't translate well into pleasurable fan experiences. Some fans even go as far as concocting reasons why a win in a given game is not a big deal: I even saw a fan state that "some years the best team doesn't win it all; if Duke won the national championship, this would be one of those years." How sad a fan existence is that?

To combat the rampant pessimism, this post will identify the types of NCAA tournament teams Duke matches up extremely well against. I will examine whether some of the teams rated top in the country by pollsters and bracketologists are also the types of teams that match up well against Duke in tempo-free statistics. Here I will look at teams that match up poorly against Duke; later I will take a look at some "scarier" teams. My approach is to examine a path towards the National Championship that includes possible opponents at each level of the NCAA Tournament, and which teams/types of teams Duke would love to see. It is unlikely that Duke will have trouble with its first round opponent this year, so that leaves five rounds, and five example teams.

Clearly, this analysis involves looking at precisely zero video. The only team I have consistently watched all year is Duke; I may have tuned into a game here or there for other teams, but that one game could bias my thoughts on non-ACC teams. From the tempo-free perspective, there are a number of things Duke does extremely well:

1) Offensive Rebounding (7th in nation at 40.7%). Brian Zoubek is no longer Duke's best-kept secret weapon, and his improved offensive efficiency rating over the last few games complements his best-in-the-nation offensive rebounding ability.

2) Free Throw Percentage (8th in nation at 75.9%). While other teams can't really control how Duke does from the line, they can certainly be a team that fouls a lot, sending Duke to the line for some free points.

3) Three-Point Shooting Defense (1st in nation at 26.7%). Mid-major teams which lack size and rely on the three-pointer are going to have long nights against Duke in March.

4) Lack of Turnovers (16.4% of possessions, 11th in nation). Duke's turnover rate has been particularly excellent over their last few ACC games (and would be even lower if not for several turnovers by the bench players in garbage time against Virginia).

5) Height. Pomeroy lists two kinds of height on his statsheet: The raw number in inches (Duke's average height is 2nd in the nation at 79.0"). Clearly, this should be weighted by playing time, and Pomeroy does this: he points out that height is correlated to offensive efficiency with r-squared = 0.27, with r-squared = 0.38 on the defensive end. Duke's "Effective Height" is +4.9, meaning they are about five inches taller than the average Division-1 team. Here are the tallest teams in the nation, and Duke has already beaten four of the top ten!

Last week, I pointed out the aspects of the tempo-free analysis that Duke struggles with, and not much has changed so refer to that post for Duke's weaknesses.

Looking at the 7,8,9,and 10 seeds in Joe Lunardi's latest bracketology is a good place to find teams Duke is likely to see in the second round of the tournament. There are a bunch of ACC teams there, but we can leave them out since the committee does like to put teams from the same conference on collision courses before the regional final. Which of these teams sticks out as matching up poorly against Duke?

1 (Thirsty Thirty-Two) Oklahoma State. Picked to be a #8 seed by Joe Lunardi, and firmly in the "Also Receiving Votes" portion of both human polls, the Cowboys are a possible second-round opponent for Duke. Their overall offensive (112.8; 31st) and defensive (94.1, 62nd) efficiencies are about the middle of the pack for possible NCAAT teams, as you might expect for a team with a middle-seed. When broken down into component factors, there's a lot to like about this team from a Duke perspective. First of all, they are a short team, with an effective height more than an inch below the D-1 average; the tallest player on the roster is 6-8 Junior Matt Pilgrim, who sees less than 40% playing time.

The most convincing matchup for Duke comes with shot selection. Oklahoma State is near the top of the pile in relying on the three-point shot: 32.7% of their points come from behind the arc, which is 45th in the nation (the highest among possible NCAAT teams). Further, they are not particularly good at making three-pointers, shooting just 35.6%, barely in the top third of all D-1 teams. Finally, the Cowboys struggle to grab offensive rebounds: a rate of 30.2% of their missed shots places them 264th in the nation. The combination of chucking up threes, not making a lot, and struggling to rebound is a strategy that plays very poorly against the Blue Devils, who have superior rebounding and 3-point defense.

2 (Sweet Sixteen) Pittsburgh. This is a team that is slated to be seeded in the 3-5 range, and therefore is a possible Sweet-Sixteen matchup, and one that the tempo-free stats say would be good for Duke. The biggest reason that jumps out is, once again, height. Pittsburgh has an effective height of a full inch below average, especially at guard and forward. They do have 6-10 center Gary McGee, who plays nearly 60% of the time, but who is not much of a factor offensively. Defensively, McGee has a good block rate and a good defensive rebounding ability, so most of Duke's height advantage in this one comes from Kyle Singler and Jon Scheyer, who tower over their likely defensive counterparts.

One thing Duke would not be doing in a game with Pittsburgh is wasting offensive possessions. While the Panthers are good at limiting their opponent's eFG% (44.1; 20th), they are mediocre-to-bad at the other three factors. Duke protects the basketball pretty well on offense, and Pittsburgh rarely forces their opponents into turnovers (Defensive turnover rate: 16.8, 335th). Pitt also don't have much success on the defensive boards (DR% 68.9, 106th), which once again sets up the scenario of Duke dominating the offensive glass and making every possession count.

Pitt also plays a rather plodding pace (62.4 poss/game, 325th), and has a rather heavy reliance on getting to the free throw line to make their points (23.9% of points come from the charity stripe, 42nd most in the nation). If this game were played in November, this could have come as an advantage for Pitt, but Duke has reduced their fouls in recent weeks, with good results. The tempo-free stats don't say much about the "quick guards" factor, but Pitt does have a high percentage of their baskets accompanied by assists (67.1%, 4th). If this is the signature of a quick team using a lot of backdoor cuts, that could potentially hurt Duke's defense, but this signal is not nearly as strong as the possibility of Duke dominating Pitt on the offensive glass.

3 (Elite Eight) New Mexico. I'm all for giving respect to mid-major teams-- when they deserve it. Ranked #8 in the nation in the polls, #7 in RPI, and given a #2 seed in Joe Lunardi's latest bracketology, things are looking up for the Lobos. With the clear caveat that I have never seen them play, I'm not really sure what all the fuss is about. Ken Pomeroy is with me, and has used New Mexico to drive home a point: as long as the NCAA fails to consider Margin of Victory when making tournament picks/seeding decisions, they will continue to not pick/seed poorly the best teams in the country. Case in point: The Sagarin rankings, which have two parts, one (ELOCHESS) that only considers wins and losses; New Mexico is ranked #8. But when points scored/allowed is taken into account (in the PREDICTOR), New Mexico drops to #33. So, really, most elite teams this year should not be scared of New Mexico.

Duke has some specific advantages, though. One is, again, height, and the Lobos are an average team (Effective height +0.1"). New Mexico has been pretty good on the defensive boards (DR% 72.7; 11th best), but in the components for mid-majors, where most teams are short, it requires a little digging to see how they did against tall teams. Indeed, New Mexico's worst game on the defensive boards was their loss at San Diego State, a team that is only elite at one thing: offensive rebounding (they are ranked 6th in the nation for this component). UNM's best rebounder is Darrington Hobson, who at 6-7, 205 is not the kind of player you really want battling against Brian Zoubek...

New Mexico has other weaknesses that play into Duke's strengths, such as an above-average reliance on three-pointers, playing a generally slow pace, and with inexperienced players (ranked 241st Pomeroy's experience stat, versus Duke, ranked 81st in experience). However, if New Mexico were given a 2-seed, they wouldn't see Duke until the regional-final at the earliest, and the stats suggest that the Lobos will lose well before that point.

4 (Final Four) Villanova. 'Nova leaves the most recent bitter taste in the mouth for Duke fans. For this reason, a Duke fan may not be able to see passed some simple truths: a) Duke, in 09-10, has a better, taller, more versatile team than in 08-09 and b) Villanova's team is about the same as it was last year. Yes, they still have "quick guards" although the tempo-free analysis doesn't really see much other than their shooting percentages and overall efficiency; Reynolds, Fisher, and Pena are all rated above 115.0, which is pretty good. The Wildcats are ranked highly enough that it would be unlikely for Duke to face them this tournament until the Final Four, so we will consider them here.

The tempo-free stats don't pull out a whole lot of things that Villanova does really well. Their free throw percentage (75.6%) is comparable to Duke's, and they pull in a good number of offensive rebounds, especially considering their effective size (which is basically average). While teams at the top of the rankings are going to be fairly good, what we're looking for is whether Villanova's particular strengths and weaknesses lie within Duke's (as defined by tempo-free stats). The biggest red flag is pace: Villanova averages 75.8 possessions/game (11th), which we identified as a potential pitfall for a Duke team that averages just 67.3 poss/g (178th). Villanova also has a balanced attack, not relying too much on 3-pointers, 2-pointers, or foul shots.

But there are more red flags for Villanova. First and foremost: they foul. A lot. A defensive free-throw-rate of 49.7 is the ninth-highest in the nation, and Duke's elite free throw ability will give the Blue Devils lots of free points in this matchup. Villanova has had some atrocious games (at Georgetown and vs UConn) in which the opposing team shot more free throws than they made field goals! That is not a very good winning strategy, especially when in general your defense, relative to other supposed NC-contenders, is lacking. They allow a higher-than-average percentage of 3-point shots (3PTA/FGA = 38.2; 316th lowest), and their opponents make an above-average amount (3PT% against = 33.4; 140th). They are not particularly great on the defensive boards (DR% = 68.5; 122nd), and they don't force many turnovers (Defensive Turnover Rate = 22.0; 100th). It would only take an average-shooting night for Duke to win this one handily.

5 (National Championship Game) Kansas State. Okay, I will fully admit that this selection is a little bit of cheating, so let me explain. Should Duke get to the National Championship game, the range of teams they could face is necessarily much better than the range of teams they could face in the earlier rounds. Examining the rest of the Top 10 in the human and computer polls, there are a lot of teams that make for match-ups that are extremely tough to call from a tempo-free perspective, because each team is pretty good at a lot of things. For teams like Syracuse and Kentucky, there aren't any aspects that scream out "Duke is good at this aspect and the opponent sucks at that aspect!" Or vice versa. There are also teams for which there is convincing evidence that Duke would struggle... but that is for the next post. Finally, picking a Hummel-less Purdue just seems too easy, and without Hummel they probably don't make it to the title game anyway. So of the consensus top teams, the one against which Duke would have the easiest time is Kansas State.

The component of Kansas State's game that speaks the loudest is the fouls. I would imagine that most Kansas State games take three hours or more, with how often the Wildcats (Free Throw Rate: 53.3; 2nd) and their opponents (Free Throw Rate Allowed: 46.7, 309th) spend on the free throw line. Kansas State gets more than a quarter of their points from free throws, way more than Pittsburgh (see above), and the 19th most in the nation. However, once they get there they only shoot 66%, with only one of their regulars (Jacob Mullen) north of 75% individually.

They get around all this fouling by having a revolving door for a frontcourt. Three players (6-9 Wally Judge, 6-10 Luis Colon, and 7-0 Jordan Henriquez-Roberts) each grab just 25% minutes (that's 10 min/game) and each manage to put up foul rates above 7.0/40 minutes. The rest of the frontcourt minutes belong to 6-8 Curtis Kelly, who has decent defensive numbers, but isn't a standout offensive star.

From the team perspective, Kansas State does a lot of similar things to Duke. They are one of the few teams better than the Blue Devils on the offensive glass, grabbing 41.2% of their missed shots, which are frequent (since they shoot 36% from beyond the arc and 50% inside it, both rather mediocre). However, Kansas State, whether by design or by ability, struggles on the defensive glass, much more so than Duke does. This has a recipe for being a physical game, in which both teams finish possessions either with offensive rebounds, putting back missed shots, or by heading to the free throw line. There is one wrinkle that makes the comparison less symmetrical: Kansas State has a high turnover rate (21.4; 216th), and while Duke's defensive turnover rate is not great (22.1; 79th), the turnover battle certainly tips the Blue Devils' way. Overall, in a foul-shooting battle, Duke has the definite advantage, given their much higher free throw abilities. The game would come down to which team lost their best interior player due to foul trouble.

This window into Duke's potential road to the national championship is far from perfect. It leaves out a lot of factors, some that are "intangible" and others that are tangible but simply not measured (unless someone wants to watch thousands of hours of tape to quantify how "quick" certain team's guards are). And while my cherry-picking of teams may seem highly beneficial to Duke, I hope that I've at least illustrated that there are teams out there against which Duke stacks up against mightily. The tempo-free statistics say that Duke is the best team in the nation, by a fairly wide margin. While self-hating fans and Duke-haters alike may not consider Duke to be on their list of "scary" teams, there is clearly room for optimism in Durham.

Monday, February 22, 2010

Duke Weaknesses (Perceived and Real)

Clearly, this year's Duke team is neither entitled nor even favored to win the National Championship. The widely respected human opinion polls put, at time of writing, half a dozen teams above a Duke team that has failed to receive a single first-place vote this entire season. Clearly, something scatters the blue sunshine. There are many common memes surrounding Duke, though only a few of them actually apply to this years' team. Let's take a look at some of the common criticisms levied against this year's Duke squad, analyzed from a tempo-free perspective (statistics, as always, from Ken Pomeroy).

1) Duke has played a soft schedule. If we go based on the tempo-free stats, nothing could be further from the truth. Pomeroy ranks Duke's schedule second in the nation, behind West Virginia. Sure, their non-conference schedule is ranked 83rd, but take a look at the teams on top of the non-conference SOS rankings: other than Butler, none of the top 20 have a prayer at an at-large tournament bid. And a number of the teams up top are probably there because they played Duke... this criticism is tired and should be sent to the land of misfit memes.

Making the argument worse is the observation that not only does Duke have the #3 offense in raw efficiency (1.17 points/possession), they have accomplished such lofty efficiency against a harder slate of defenses than anyone in the nation (adjusted efficiency of defenses Duke has played: 0.95 points/possession).

Taken more subjectively, Duke will play 15 non-conference games this regular season (counting Thursday's game against Tulsa). Of these, as many as 7 teams could make the NCAA tournament; what is a better preparation for NCAA tournament play than contests against teams that will actually be there? For comparison, Duke is #2 in the Sagarin strength of schedule as well, and played the fifth-toughest slate in this version of RPI.

2) Lack of backcourt depth. It seems only yesterday that the meme was about the lack of frontcourt depth (well, last year anyway). Assuming that Kyle Singler counts as a guard, Duke's main men play a ton of minutes, while backup shooting guard Andre Dawkins has struggled in limited minutes. I have a problem with this criticism on a number of levels. Part of this argument comes from the same area as the Duke Fade, which is that too many minutes are given to the Big Three. Here's the tempo-free stats.

Player......%Minutes........ORtg....%Possessions
Scheyer.........91.6.............130.7.........23.1
Singler.........89.5............114.3.........23.1
Smith.........82.2.............117.5.........23.1

First of all, pretty cool that all three are used in the same percentage of possessions. Second, if you have three players capable of contributing those massive tempo-free player ratings, why would it be a bad thing to play them all the time? If a player was used so much that they tired in-game, his offensive rating would plummet, due to the extended number of possessions played. Duke's top three men (by % minutes played) have a higher combined rating (362.5) than the top three men on Kansas (358.5), Syracuse (333.4), Purdue (343.1), West Virginia (354.4) and Kentucky (334.5), to pick a few. Scheyer/Singler/Smith are doing it in far more minutes than any of those teams, save perhaps WV, who do get 80%+ minutes from Butler and Jones.

For another avenue of argument, I shall paraphrase what I overheard this morning on 99.9 The Fan in Raleigh: "While Duke has the best trio of players in the country, what happens when they are all shut down?" This, of course, has an easy answer: Duke would lose. The better question is: "what's the likelihood of all three being shut down?" The three Duke stars average a combined 53.8 points/game; the fewest they've scored, combined, was 41 in the Preseason NIT championship win over UConn. In Duke's four losses, the trio has averaged about the same amount, and scored a combined 61 points in the loss at Georgetown. Indeed, it seems that Duke's wins will come down to how many points they can drag out of their frontcourt. Which leads to...

3) Duke has no talent in the frontcourt. I should hope that the last three games would silent anyone who would make such a bold claim. There is clearly some talent in the frontcourt, and that is in the (massive) form of Brian Zoubek. The man who makes girls swoon so hard that their marriage proposal signs are upside down has averaged nearly a double-double (9.7 points, 12.7 rebounds) in those games. With his new-found ability to avoid foul trouble, Zoubek has slipped back above the 40% minutes played mark. This finds him back atop the nation in offensive rebounding. Let's nail that home for the readers too lazy to click on the link:

Category: Percentage of a team's missed shots rebounded by a player while he's on the court.
Qualification: 40% of a team's minutes played (that's about 16 min/game, ignoring OTs)

1. Brian Zoubek, Duke 23.0%
2. Demarcus Cousins, Kentucky, 22.8%
3. Anthony Johnson, Fairfield, 17.4%

So, other than the stud freshman at Kentucky, no one is even close to Brian Zoubek in offensive rebounding ability. Against Maryland, Zoubek used missed shots by the Big Three to have an amazing offensive night, scoring 16 points with 17 rebounds (8 OR). Then, against Virginia Tech, Zoubek benefited from Duke's cold shooting for another 16 rebounds (8 OR). This time he tended to kick the ball out to open men, including two assists on Nolan Smith three-pointers.
The tempo-free world is only six years old, but in that time, only three players have had an OR% above 21%: DeJuan Blair was at 23.6% last year. (The next highest season total is 20.8%).

Of course, there's more to the frontcourt than Zoubek, and the tempo-free stats have mixed feelings about Duke's other big men. Miles Plumlee is a very strong defensive rebounder, and at 22.6% is currently vying for Duke's best defensive rebounding mark since Pomeroy started tracking individual stats in 2005 (Shelden Williams' best was 22.0%). Mason Plumlee currently has the lowest offensive rating on the team (94.5), thanks to a triple-whammy of low effective field goal percentage, small rebounding numbers, and a very high turnover rate. While analysts have been high on Mason's potential, the tempo-free stats haven't shown that this season.

Lance Thomas is generally seen as the "glue" of the 2009-2010 Duke team, and the captain is certainly one of the emotional leaders. As with many in this position, his contributions are said to be "intangible." Indeed, tempo-free statistics don't have much to say about LT. He has an average offensive rating, and the only stat which really stands out is his turnover rate, which is the highest on the team: Duke turns the ball over on 26.1% of his possessions. This could be partially due to Lance, who turns the ball over 1.6 times/game; it could also be unlucky that he's on the court when turnovers occur. Don't view this as a total knock on Thomas, however. These stats of course only include offense, and while Thomas' contribution may not be exactly "intangible" it is certainly "unmeasured" by this system.

Overall, this criticism is still hanging on by a few threads. Clearly, Duke's frontcourt is responsible for a massive number of rebounds, so much so that it can prop up a poor shooting night against many teams. Whether one of the Plumlees or Zoubek can be a viable offensive threat remains to be seen. For now, only Zoubek is impressive from the tempo-free perspective.

For the heck of it, let's throw in:

4) Duke gets all the calls. While pretty hilarious in most seasons, this conspiracy theory has a pretty steep piece of evidence to overcome this year: this season, Duke has a free throw rate (FTA/FGA) of 37.8, which is 171st in the nation, good for almost exactly median. In addition, Duke gets only 21.9% of their points from the charity stripe, good for 119th in the nation. This season, the refs seem to have been paid off by the fans of Kansas State (Free Throw Rate: 53.3) and Wyoming (27% of points from FTs). What Duke does have going for them, however, is free throw percentage: 8th in the nation at 75.9%.

So, what are Duke's real weaknesses, as shown by tempo-free stats?

1) Fouls. As alluded above, Duke struggles more than usual to get to the free throw line, and so cannot benefit as much from their high percentages from the line. While in most years, Duke has multiple players drawing more than 5 fouls per 40 minutes, only Scheyer is above that mark, and no one else is very close.

On the other side of the basketball, committing fouls has been a huge problem, particularly for Brian Zoubek (8.2 Fouls Committed/40 minutes) and Mason Plumlee (7.2 FC/40). In all three of Duke's January losses, they allowed their opponents to get to the free throw line at a rate of about 50 FTA/FGA.

2) Slow Pace On each team's "Game Plan" page, Pomeroy has tables of correlation coefficients. Each of these is a measure of how correlated a particular statistic is with offensive/defensive efficiency. At the moment for Duke, one of these, on offense, is the "pace" of the game, measured by the number of possessions. As Duke's number of possessions goes up, so does their offensive efficiency. For example, Duke's fastest game this season was against Pennsylvania, in which Duke scored 115 points in 75 possessions (Offensive Efficiency: 151.8!) . In Duke's slowest game, at Clemson, the Blue Devils scored just 60 points in 61 possessions, for an efficiency of 98.8. This will definitely be a trend to keep an eye on, should Duke face a team notorious for slow play in March (looking at you, Big Ten).

3) Allowing 2-pointers Duke's defense is ranked 18th in efficiency, but it has it's definite weaknesses. While the Blue Devils are second in the nation in 3-point shooting percentage against (27.4%), they allow opponents to shoot over 45% inside the arc (81st). Put another way, Duke's opponents score 60.8% of their points from 2-point range, which is the 8th most in the nation (to be fair, the 18.4% of opponent's points from beyond the arc is fourth lowest). Duke's strategy clearly makes some sense: it doesn't take a math degree to deduce that 3 > 2. However, some teams are just not built for chucking up treys. This season, Georgetown, NC State, and Wisconsin all used Duke's spread defense to their advantage, scoring backdoor layups and short-range jumpshots.

As the rankings sit today, Duke is the best team in the nation, measured from a tempo-free perspective. They have the best offense, a good defense, and have played a very strong schedule. We've seen that many of the common criticisms of Duke fall apart when seen from the perspective of tempo-free stats. Still, this is a college basketball world without perfection, and while Pomeroy's statistics can help us see that Duke is a little underrated by the human polls, there is still room for improvement as March looms.

Wednesday, February 17, 2010

The Duke Fade: A Response

Earlier today, Alex Fanaroff of Duke's daily newspaper The Chronicle posted a wonderful article, "Duke Does Decline, Objectively Speaking." It's a refreshing look at basketball in a way that we love here at the Immaculate Inning: using tempo-free statistics. Fanaroff describes the general issue:

1) Duke fades down the stretch.
2) This is due to the starters playing way too many minutes, causing them to get tired.

Fanaroff went into the article expecting to at least debunk #1 as a myth, but ended up finding a strong correlation between Duke's efficiency margin (Offensive Efficiency - Defensive Efficiency), and the number of ACC games played. This suggested that over the last seven years, the data indeed does suggest a Duke Fade. Fanaroff was unable, however, to show that "minutes by starters" had any effect on efficiency margin. Naturally, I was intrigued, and wanted to dial further into the data. Here are a number of issues:

1) Year effects do matter. In his accompanying blog post, Faranoff says "Predictably, most seasons failed to produce robust trends due to the limited sample size, though all seasons from 2004-2009 demonstrated a downward trend." This statement is misleading on two accounts. First of all, here is each year plotted individually, with the linear regression line on each:

True, there is a downward trend most years, and Faranoff does mention that Duke's trend is technically upwards this season. Most seasons, that line is not significant: there is not enough evidence to say that the regression line differs from no slope.

However, the null hypothesis can be rejected in 2004 (r-squared 0.2177, p = 0.04406) and 2008 (r-squared 0.2993, p = 0.01879). Here's my question: if a declining efficiency margin during ACC play is a bad thing, then why did the 2004 team make the Final Four, despite having one of of the few significant in-year declines?

2) Opponents matter. As many folks over at the fine institution of DBR have pointed out, Duke always plays Carolina as its last ACC regular season game. Then, while the early rounds of the ACC tourney may provide a brief dip in competition, Duke almost always advances to the end of the tournament, and they naturally find better teams there. We can look at the effect of Opponent Rank (determined by Pomeroy) on Efficiency:




The scatterplot doesn't look like much, but the trend line is there, and it is significant: Duke's opponents get tougher as the season goes on. And Duke does play much better against inferior competition:


Since Duke's opponents are harder later in the season, and Duke has lower efficiency margins against better opponents, how can we take this into effect? One way is with an Analysis of Variance (ANOVA: more info here), in which we can specify variance components. Basically, we have two main effects on efficiency margin: ACC Game # and Opponent Rank. If we account for the variance accounted for by the strength of the opponent, is the correlation still significant? For the stat-geeks out there, here's the ANOVA table:

Anova Table (Type III tests)

Response: Delta.100
......Sum Sq....... Df........ F value........ Pr(>F)
(Intercept) 0.12776 ....1 ........6.0593 ..........0.015267 *
Opp.Rank 0.48675 ..1...... 23.0858 .........4.563e-06 ***
Game 0.19808 .....1 ........9.3947 ...........0.002693 **
Residuals 2.50903 ....119


To summarize, there is still an overall significant downward trend in efficiency margin as the season progresses, although the significance is reduced.

3) Home Court Matters? When Ken Pomeroy adjusts for home court, he adds 1.4% to the home team's offensive efficiency and visiting team's defensive efficiency, and subtracts the same from the home team's DE and visiting team's OE. So the difference between Faranoff's raw data and what Pomeroy would consider "adjusted" is -2.8 for Duke's home games and +2.8 for Duke's away games. What happens to the correlation if we make the "Pomeroy Adjustment"?



The correlation coefficient decreases, as does the significance, but not to the point where the slope becomes non-significant. There is still a Duke Fade when we account for home and away games.

I must say I come away unimpressed with other explanations for the Duke Fade. Opponents do get tougher but not enough to overcome the efficiency drop. Adjusting for home and away games also doesn't have much of an effect. It is very important to note that nowhere have I suggested a causal agent for the Duke Fade. This is to avoid the common fallacy that correlation implies causation. Clearly, Duke can still have an historic season (2004) despite having one of the few significant in-year Duke Fades.

Instead, I'll take an "I Report, You Decide" kind of approach here. These are the statistical facts, and I'll be happy to attempt more rigorous investigations if they are suggested in the comments.

Saturday, January 30, 2010

Duke-Georgetown Tempo-Free Preview

Seven inches of powdery white stuff has fallen from the sky this morning, which means it's a perfect time to stay home and analyze basketball statistics. From a tempo-free standpoint, today's game between Duke and Georgetown is an elite matchup. The Hoyas are ranked 15th in the nation by Pomeroy, but have an adjusted offense (19th) and an adjusted defense (32nd) ranking below their overall level. The reason for this is that most teams around their level in the Pomeroy rankings have one excellent score and one mediocre score (e.g. Villanova: 3rd on offense, 71st on defense).

Georgetown's strength of schedule also boosts their Pomeroy ranking. The Hoyas are coming off a blow out loss at Syracuse (3rd Pomeroy), and overall they rate as the 4th toughest schedule in the nation, and have played against the second toughest slate of defenses of all the division 1 teams. While this means that G-Town is battle tested, it also means that their raw offensive and defensive scores are much lower: they have averaged 108.7 points/100 possessions (58th) while giving up 93.1 points/100 possessions (46th). Part of the low offensive output has come while on the road against good teams-- they failed to crack the 1.0 points/possession mark at both Syracaue and Villanova.

Duke, meanwhile, is a well-kept tempo free secret this year. For whatever reason, Duke always seems to be near the top of the Pomeroy rankings in January and February, and this season is no different. Duke currently sits behind only top-ranked Kansas overall, and the Blue Devils continue to have the best adjusted offense in the country. In fact, it has only been in the past week (since the loss at NC State) that Duke has not also had the highest raw offensive efficiency as well (and are still in the top five). Clearly those numbers were going to come down with ACC play, but like Gerogetown, Duke is playing against some pretty tough defenses (6th toughest defense against).

It's hard to put a finger on what exactly is making Duke's offense so good. Pomeroy lists the "four factors" he believes are most critical to consistent play, regardless of the game's pace. Duke is a top 20 team in two of these factors: turnover rate (16.8% of possessions, 14th) and offensive rebounding percentage (40.1% of missed shots rebounded, 15th). Less impressive is the effective field goal percentage (52.9%, 41st) and FTA/FGA (36.4%, 202nd). Clearly the Duke alums have not been paying the refs off enough this year, because they are not getting to the free throw line very often at all. On the bright side, Duke's 77.0% FT% is fourth in the nation.

This is a game that will be won underneath the baskets. The first issue is Duke's shooting ability: while Duke has their characteristic 3-point shooting ability, they have struggled at times shooting from close range, and rank 79th with just 50.6% from inside the arc. Georgetown, meanwhile, struggles to keep opponents' percentages down. Because this game is being played at the Verizon Center, Duke can be expected to miss their fair share of shots. This shifts the focus to a battle between an elite offensive rebounding unit (Duke) and an elite defensive rebounding unit (Georgetown). For Duke's big men, the goal should be to grab that rebound and then go up strong through the usually foul-conscious Greg Monroe. But if the Hoya sophomore can handle Zoubek and the Plumlees, it could be a long afternoon for Duke.

In the rare event that Duke has a lights-out shooting performance on the road, the final tally could get quite high-scoring. One of the things Georgetown does do well is shoot the ball, with a 55.4% eFG% (12th). However, they are not very skilled at the other four factors (offensive rebounding, limiting turnovers, and getting to the free throw line). It's a strange game to predict because if the game were played at Cameron, one could see Duke making their shots and limiting their opponent's strength on the defensive glass, while the Blue Devils dominate their own defensive glass after limiting Georgetown's shooting ability.

But the game is being played in DC's downtown Chinatown, which adds expectation to Georgetown's shooting ability and detracts from Duke's. That's bad news as Georgetown has four players (Freeman, Vaughn, Wright and Clark) who put up eFG% north of 55. A cold shooting night from Duke and road defense as dismal as shown at NC State, and Duke looks to fall in a big way. However, more recent games have shown Duke's defense to be back on solid ground, and so if Georgetown's shooters are cooled, then look for a fight to the finish.

Prediction: Mason Plumlee and Brian Zoubek, combined, have about as much playing time (86.7% minutes played combined) as Greg Monroe (84.7%). Whoever has the most rebounds (MP2 + Zoo vs Monroe) will be on the victorious team.

Friday, January 22, 2010

Worst Duke Defensive Performances

It's been a while since we've done college basketball here, and I have a few things planned for the coming months, including a systematic re-do of last year's tournament simulation. In the meantime, I want to nip any "Matt's a Duke fanboy" criticism in the proverbial bud by doing a negative post about Duke.

The tempo-free era of college basketball began with the 2003-2004 season as Ken Pomeroy started putting posting his rankings based on offensive and defensive efficiency. Duke, for whatever reason, has always done pretty well during the regular season in Pomeroy's adjusted (for opponent offensive and defensive strength) rankings:

Year/Offensive Efficiency/ Defensive Efficiency
2010/1/18
2009/10/20
2008/11/9
2007/40/5
2006/5/13
2005/15/1
2004/2/4

Nevertheless, Duke has put up some hard-to-believe individual games on both sides of the ball. Defense, however, is so intricately tied to Coach K's philosophy that it is the better choice for a comprehensive breakdown. Defensive ability puts players on the floor for Coach K, and can limit the minutes of potentially explosive offensive perimeter players, if they cannot grasp Duke's defensive philosophy (c.f. Taylor King, Elliot Williams, Andre Dawkins). Usually, the result is a solid defensive gameplan that has consistently been in the top 20 the past six and a half seasons.

There have been great defensive efforts, and not always against inferior competition. In fact, the best defensive performance of the 228 games recorded on kenpom.com is this one, played December 19, 2009 against Gonzaga in Madison Square Garden. Duke held the Zags to an efficiency of 57.1, which is absolutely stellar when you consider that Gonzaga's 2009-2010 average is 111.3! But it has not been all good news for Duke this year, and it is an historic defensive lapse that is the reason for this post. Here are the top 10 worst defensive performances since 2003-2004:

#10 (118.4): January 14 2006 at Clemson (Duke W 87-77). We start with a surprising performance in that not only is it a Duke win, but also a win on the road. Although the 2005-2006 Duke team was frequently spotlighted for it's excellent defense, led by Shelden Williams' shotblocking ability, on this night at Littlejohn, things did not go well. Giving up more than a point per possession to a Clemson team that did not make the NCAA tournament and finished 94th in the nation in adjusted defense was simply not acceptable. Duke had some other defensive clunkers along the way, but eventually fell in the NCAA tournament due to their worst offensive performance in the tempo-free era. (But as Alton Brown might say, that's another post...)

#9 (119.0) November 16 2008 vs Rhode Island (Duke W 82-79). This was the night that RIU's Jimmy Barron almost got a permanent middle name from Duke fans. Barron hit seven straight three-pointers before Dave McClure came off the bench to get a hand in Barron's face, causing him to heave an air-ball with 1:24 remaining and Duke down by 2 (ESPN's play-by-play calls the shot a 2-pointer, but I was at the game and remember differently). This was by far the loudest I've heard Cameron in a non-conference game. I also had the pleasure of high-fiving Jimmy Barron as he ran off the court, and I have not washed that hand since.

#8 (119.1) March 8 2009 at North Carolina (Duke L 79-71). This was the closer of the two Duke-UNC games last year, and both teams were had worse efficiencies than the earlier matchup. The pace was 13 possessions slower as well. The biggest standout is that the Tar Heels rebounded over 40% of their missed shots, while Duke's offensive rebounding percentage was a dismal 18.8%. Eww.

#6t (121.3) February 18 2004 at Wake Forest (Duke L 90-84). Duke's best team during the Tempo-Free era only lost six games all season, and every time they did, they allowed more than a point per possession. It was Duke's second loss in a week (they lost at NC State earlier) and is perhaps the best lesson for Chicken-Little Duke fans and Duke haters alike, concerning this year's Duke team. Despite the efforts of the media and the common fan, "consistency" just doesn't mean anything. Especially in ACC road games. A team can put up clunkers in consecutive games in January and still get within an Okafor of the national championship game.

#6t (121.3) January 21 2006 at Georgetown (Duke L 87-84). From the second paragraph of that ESPN recap: "That's my child," the elder Thompson said. "I love my child. After all he's had to go through, he deserves this." Duke never led in a classic execution of the Princeton offense, and JJ Redick scored 41 points, and with Duke almost matching the Hoyas' offensive efficiency with a 117.3 rating. Free throws were the difference, as Georgetown got to the line more often than Duke. It is unclear what the elder Thompson said of Duke's destruction of Georgetown in 2009.

#5 (124.5) February 22 2009 vs Wake Forest (Duke W 101-91). Last year was Duke's worst from a defensive standpoint, and it shows in this top ten list. This game was the polar opposite of the game a few weeks earlier in Winston-Salem, a two point loss for Duke in which neither team topped 95 in efficiency. The rematch in Cameron however had a similar number of possessions and basically no defense.

#4 (125.8) February 20 2005 vs Wake Forest (Duke W 102-92). An eerily similar game to the one above, although this rematch played out exactly like the preceding game at Wake Forest (see below); the only difference was the result for Duke. Interesting point about the 2004-2005 Duke team; if you check here, you can see all the factors that were correlated to Duke's defensive performance that year. Interestingly, nearly all the factors are in bold, indicating a significant (at the 95% confidence level) correlation. Duke's defensive performance that year was strongly tied to their opponents' shooting, rebounding ability, turnover rate, and ability to get to the free throw line. Defense was also significantly correlated to Duke's own offensive ability that night, especially on Duke's offensive glass. It should come as no surprise, then, that Duke was knocked off in the Sweet 16 in a game to a Michigan State team that was among the best in offensive rebounding, among other things.

#3 (127.0) January 19, 2010 at NC State (Duke L 88-74). Ah, the inspiration for this post. Duke was outplayed on their defensive end in a big way, and NC State was helped by a healthy 62.7 effective field goal percentage. They also protected the ball, turning it over on just 13% of their possessions, and turnover rate seems to be a relative strength of this year's Wolfpack team. Offensive efficiency, however, has not exactly been consistent for NC State, and while they put up similar numbers against Georgia Southern and UNC-G, an effort like this against a top-20 defense was completely unexpected. I see no reason to see the game as anything other than a fluke for Duke, and while winning at Littlejohn tomorrow will be a tough task, if Duke loses it will not be due to another defensive calamity. For this, in K, I trust.

#2 (127.6) February 11 2009 vs North Carolina (Duke L 102-87). Easily the worst Duke game I have ever attended, as Duke was never really in this game. Unlike some of the other games, this was not a defensive lapse against an otherwise mediocre offense. This was a lashing at the hands of the eventual champs who could not stop two All-Americans from doing whatever they want on the court. Yes, I'm a little bitter that I slept outside in a tent to watch this game. Next.

#1 (131.6) February 2 2005 at Wake Forest (Duke L 92-89). So many surprising things about this list, including the number of home games (4) and the number of Duke wins (4) and the number of appearances by the Demon Deacons (4). There is quite a wide gap between #2 and this game, and Duke almost pulled off the win! This was a collision of elite squads, as Duke finished the 04-05 season with the best defense in the nation, while Chris Paul's team was the #2 offense. Wake's inability to stop anyone came back to bite them in March, as they couldn't shoot their way out of early exits in both the ACC and NCAA tournaments.

To be fair, Wake Forest dominated the the middle 36 minutes of the game and it was only the 3-point abilities of the talented Mr. Redick that brought the game close at the end: JJ hit three from beyond the arc and Sean Dockery made it 90-89 with two seconds left. Taron Downey hit both of his free throws and Redick missed a running three to end the game. Notably, Duke's offensive efficiency in this game was 127.3, and so one must conclude that it was indeed Duke's defense that prevented the win. As the rematch (see #4) showed, home court has a lot to do with the outcome of games like these.

The biggest absence from this top 10 list: games in March. For all of the hate heaped upon Duke for its struggles in the month of March, not once did they make an NCAA tournament exit due to a sudden lack of defense. In fact, the worst NCAA tournament performance is (predictably) the 2007 first-round loss to VCU, which is 27th on the list at 112.0, followed by last year's win over Texas (36th at 109.3). But those games are miles from the ones above, and allowing around 110 points/100 possessions is probably to be expected from some of the elite offenses in the nation. Coach K gets his teams prepared on the defensive end for March, combined with playing non-ACC teams unfamiliar with Duke's defensive style. There are relatively more games in ACC tournament on the list.

I have compiled the stats from kenpom.com from all years into one spreadsheet, which can be found here for your sorting pleasure. Tempo-free stats can tell us a lot about a team that we weren't expecting to hear. This would be not-so-subtle foreshadowing of a future post about Duke. For a further hint, see this and this.