<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Tom Flesher</title>
	<atom:link href="http://tomflesher.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://tomflesher.com</link>
	<description>Mercenary Educator and Bad Economist</description>
	<lastBuildDate>Thu, 06 Dec 2012 22:35:01 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='tomflesher.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Tom Flesher</title>
		<link>http://tomflesher.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://tomflesher.com/osd.xml" title="Tom Flesher" />
	<atom:link rel='hub' href='http://tomflesher.com/?pushpress=hub'/>
		<item>
		<title>Why the difference in voting?</title>
		<link>http://tomflesher.com/2011/01/05/why-the-difference-in-voting/</link>
		<comments>http://tomflesher.com/2011/01/05/why-the-difference-in-voting/#comments</comments>
		<pubDate>Wed, 05 Jan 2011 15:51:48 +0000</pubDate>
		<dc:creator>tomflesher</dc:creator>
				<category><![CDATA[Baseball]]></category>
		<category><![CDATA[Economics]]></category>
		<category><![CDATA[Cy Young]]></category>
		<category><![CDATA[Felix Hernandez]]></category>
		<category><![CDATA[Jered Weaver]]></category>
		<category><![CDATA[marginal value]]></category>

		<guid isPermaLink="false">http://tomflesher.com/?p=527</guid>
		<description><![CDATA[This blog has moved. Please visit The World&#8217;s Worst Sports Blog. As much as I love the Angels, I can&#8217;t take Jered&#8217;s side on this one. Today, I was browsing the voting results from the various awards being voted on. Each league&#8217;s Cy Young award voting included the requisite two closers. No surprises there. There [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=tomflesher.com&#038;blog=20518139&#038;post=527&#038;subd=tomflesher&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><em>This blog has moved. Please visit <a href="http://worldsworstsportsblog.com/">The World&#8217;s Worst Sports Blog</a>.</em></p>
<p>As much as I love the Angels, I can&#8217;t take Jered&#8217;s side on this one.</p>
<p>Today, I was browsing the voting results from the various awards being voted on. Each league&#8217;s Cy Young award voting included the requisite two closers. No surprises there. There was also a beautiful case study of the AL Cy Young winner, <strong><a href="http://www.baseball-reference.com/players/h/hernafe02.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Felix  Hernandez</a></strong>, versus <strong><a href="http://www.baseball-reference.com/players/w/weaveje02.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Jered  Weaver</a></strong>. They had identical records (13-12) in an identical number of starts (34) and similar strikeouts (233 for Weaver versus 232 for Hernandez). What explains Hernandez&#8217; winning total of 167 points contra Weaver&#8217;s fifth-place 24?</p>
<p>A few things come to mind:</p>
<ul>
<li><strong>Hernandez went longer.</strong> In the same number of games, wins, and losses, King Felix pitched 249 2/3 innings, whereas Weaver pitched 224 1/3. Those extra 25 1/3 innings show not only that Hernandez was considered more reliable by his manager but that he was, in fact, more reliable (since the extra innings didn&#8217;t result in his stats taking a hit). Hernandez also pitched a formidable 6 complete games with one shutout, whereas Weaver had no pips in either category.</li>
<li><strong>Hernandez was more effective.</strong> Felix gave up fewer runs (80 versus 83) and had a much higher proportion of unearned runs &#8211; fully 21.25% of his runs were unearned, whereas Weaver had about 9.6% of runs unearned. That means that more of Hernandez&#8217;s runs are attributable to defensive mishaps than Weaver&#8217;s. That leads to Felix with a miniscule 2.27 ERA, much lower than Weaver&#8217;s respectable 3.01, and 6 wins above replacement compared with Weaver&#8217;s 5.4.</li>
<li><strong>Hernandez was marginally more effective.</strong> He had six Tough Losses and no Cheap Wins, while Weaver had five Tough Losses and one Cheap Win. Felix couldn&#8217;t rely on his team to supply him with significant run support, while Weaver got that support in his one cheap win.</li>
<li>However, <strong>Hernandez&#8217;s control wasn&#8217;t as good.</strong> Felix walked 70 batters for a control ratio (Strikeouts over walks) of .30 and threw 14 wild pitches. Jered, on the other hand, walked only 54 batters, for a control ratio of .23, and only 7 wild pitches. Still, it seems reasonable to assume that control suffers exponentially as innings increase, so part of the apparent lack of control can be explained by Hernandez&#8217;s extra innings.</li>
</ul>
<p>Overall, Felix&#8217;s marginal value over Weaver more than explains the difference in voting.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/tomflesher.wordpress.com/527/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/tomflesher.wordpress.com/527/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=tomflesher.com&#038;blog=20518139&#038;post=527&#038;subd=tomflesher&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://tomflesher.com/2011/01/05/why-the-difference-in-voting/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/4cc81c8ef60cdc1c146147aed58a6174?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Tom</media:title>
		</media:content>
	</item>
		<item>
		<title>Utility Pitchers II: Alternate Definition</title>
		<link>http://tomflesher.com/2011/01/03/utility-pitchers-ii-alternate-definition/</link>
		<comments>http://tomflesher.com/2011/01/03/utility-pitchers-ii-alternate-definition/#comments</comments>
		<pubDate>Mon, 03 Jan 2011 17:12:12 +0000</pubDate>
		<dc:creator>tomflesher</dc:creator>
				<category><![CDATA[Baseball]]></category>
		<category><![CDATA[Brian Bruney]]></category>
		<category><![CDATA[Bruce Chen]]></category>
		<category><![CDATA[Carlos Zambrano]]></category>
		<category><![CDATA[David Hernandez]]></category>
		<category><![CDATA[Francsico Rodriguez]]></category>
		<category><![CDATA[Hisanori Takahashi]]></category>
		<category><![CDATA[Joe Girardi]]></category>
		<category><![CDATA[Matt Garza]]></category>
		<category><![CDATA[Matt Harrison]]></category>
		<category><![CDATA[Mike Pelfrey]]></category>
		<category><![CDATA[Neftali Feliz]]></category>
		<category><![CDATA[Nelson Figueroa]]></category>
		<category><![CDATA[quality starts]]></category>
		<category><![CDATA[saves]]></category>
		<category><![CDATA[Tom Gorzelanny]]></category>

		<guid isPermaLink="false">http://tomflesher.com/?p=523</guid>
		<description><![CDATA[In the previous post, I discussed utility pitchers, which I defined as players who primarily play a defensive position who are called on to pitch. It never occurred to me that Bleacher Report had previously defined it otherwise &#8211; as a pitcher who can perform well in any role. How can I quantify that? Well, [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=tomflesher.com&#038;blog=20518139&#038;post=523&#038;subd=tomflesher&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>In the previous post, I discussed utility pitchers, which I defined as players who primarily play a defensive position who are called on to pitch. It never occurred to me that <a href="http://bleacherreport.com/articles/36370-brad-thompson-utility-pitcher">Bleacher Report</a> had previously defined it otherwise &#8211; as a pitcher who can perform well in any role.</p>
<p>How can I quantify that? Well, it seems to me that a sign of quality as a starter is the vaunted quality start (game score above 50, or six innings with three or fewer runs allowed, depending who you ask), and a sign of quality as a reliever is the save. Thus, a good utility pitcher is one who can muster at least one quality start and at least one save in a given season. It&#8217;s not perfect, since it relies on the manager being willing to insert a primary starter at the right point in a game to earn a save (or starting a primary reliever, as Joe Girardi did with <strong><a href="http://www.baseball-reference.com/players/b/brunebr01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Brian  Bruney</a></strong> back in 2008). Nonetheless, <a href="http://bbref.com/pi/shareit/iroKz">eight pitchers</a> managed that feat this year.</p>
<p>By far the most versatile was <strong><a href="http://www.baseball-reference.com/players/t/takahhi01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Hisanori  Takahashi</a></strong> of the Mets. Tak managed six quality starts, a handful of appearances as a left-handed specialist, and eight saves when he stepped in as the Mets&#8217; closer after <strong><a href="http://www.baseball-reference.com/players/r/rodrifr03.shtml?utm_medium=linker&amp;utm_campaign=Linker">Francisco  Rodriguez</a></strong> <a href="http://sports.espn.go.com/new-york/mlb/news/story?id=5457861">became unavailable</a>.</p>
<p><strong><a href="http://www.baseball-reference.com/players/p/pelfrmi01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Mike  Pelfrey</a></strong> also represented for the Mets, although he made only one relief appearance (in the crazy <a href="http://www.baseball-reference.com/boxes/SLN/SLN201004170.shtml">20-inning game against the Cardinals</a>).</p>
<p><strong><a href="http://www.baseball-reference.com/players/g/garzama01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Matt  Garza</a></strong> of the Rays <a href="http://www.baseball-reference.com/blog/archives/7204">made some news</a> this July when he showed his versatility by starting and saving games in the same series.</p>
<p>The other five pitchers were <strong><a href="http://www.baseball-reference.com/players/c/chenbr01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Bruce  Chen</a></strong>, <strong><a href="http://www.baseball-reference.com/players/f/figuene01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Nelson  Figueroa</a></strong>, <strong><a href="http://www.baseball-reference.com/players/g/gorzeto01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Tom  Gorzelanny</a></strong>, <strong><a href="http://www.baseball-reference.com/players/h/harrima01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Matt  Harrison</a></strong>, and <strong><a href="http://www.baseball-reference.com/player_search.cgi?search=David+Hernandez&amp;utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">David  Hernandez</a></strong>.</p>
<p>Shockingly, <strong><a href="http://www.baseball-reference.com/players/z/zambrca01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Carlos  Zambrano</a></strong> wasn&#8217;t among the pitchers listed, even though he spent some time in the bullpen for the Cubs and some time as a starter. (Big Z was briefly the highest-paid setup man in the league.)</p>
<p>My guess for the 2011 season? <strong><a href="http://www.baseball-reference.com/players/f/felizne01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Neftali  Feliz</a></strong> of the Rangers was among the best closers this year but has the ability to start games as well. Most likely, though, it&#8217;ll be someone like Pelfrey, who was pressed into service in relief for an extra-inning game.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/tomflesher.wordpress.com/523/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/tomflesher.wordpress.com/523/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=tomflesher.com&#038;blog=20518139&#038;post=523&#038;subd=tomflesher&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://tomflesher.com/2011/01/03/utility-pitchers-ii-alternate-definition/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/4cc81c8ef60cdc1c146147aed58a6174?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Tom</media:title>
		</media:content>
	</item>
		<item>
		<title>A Utility Pitcher Sidebar</title>
		<link>http://tomflesher.com/2010/12/30/a-utility-pitcher-sidebar/</link>
		<comments>http://tomflesher.com/2010/12/30/a-utility-pitcher-sidebar/#comments</comments>
		<pubDate>Thu, 30 Dec 2010 12:13:26 +0000</pubDate>
		<dc:creator>tomflesher</dc:creator>
				<category><![CDATA[Baseball]]></category>
		<category><![CDATA[Aaron Miles]]></category>
		<category><![CDATA[Andy Marte]]></category>
		<category><![CDATA[Bill Hall]]></category>
		<category><![CDATA[Felipe Lopez]]></category>
		<category><![CDATA[Joe Inglett]]></category>
		<category><![CDATA[Joe Mather]]></category>
		<category><![CDATA[Jonathan Van Every]]></category>
		<category><![CDATA[Jose Canseco]]></category>
		<category><![CDATA[Kevin Cash]]></category>
		<category><![CDATA[position players pitching]]></category>
		<category><![CDATA[Spectrum Club]]></category>
		<category><![CDATA[Wade Boggs]]></category>

		<guid isPermaLink="false">http://tomflesher.com/?p=507</guid>
		<description><![CDATA[The joys of the position player pitching were well represented this year. A whopping eight players came in from the infield or outfield and stood on the mound, more often than not looking pretty comfortable. Two of them &#8211; Aaron Miles and Andy Marte &#8211; joined the Spectrum Club by pitching and being the designated [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=tomflesher.com&#038;blog=20518139&#038;post=507&#038;subd=tomflesher&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>The joys of the position player pitching were well represented this year. A whopping eight players came in from the infield or outfield and stood on the mound, more often than not looking pretty comfortable. Two of them &#8211; <strong><a href="http://www.baseball-reference.com/players/m/milesaa01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Aaron  Miles</a></strong> and <strong><a href="http://www.baseball-reference.com/players/m/martean01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Andy  Marte</a></strong> &#8211; joined the Spectrum Club by pitching and being the designated hitter in the same season, as we discussed in a previous post. Miles&#8217; achievement was even more unlikely because he played for a National League team, so he had to get lucky and DH an interleague game.</p>
<p>Let&#8217;s talk about the average utility pitcher, which is a phrase I just made up to avoid saying &#8220;position player called on to pitch&#8221; over and over again.</p>
<ol>
<li><strong>He&#8217;s a journeyman</strong>. <strong><a href="http://www.baseball-reference.com/players/l/lopezfe01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Felipe  Lopez</a></strong>, who pitched for the Cardinals on <a href="http://www.baseball-reference.com/boxes/SLN/SLN201004170.shtml">April 17</a> in a 20-inning game against the Mets, has played for six teams since 2001. <strong><a href="http://www.baseball-reference.com/players/i/inglejo01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Joe  Inglett</a></strong> played for three different teams since 2006, and he pitched for the Brewers in a loss on <a href="http://www.baseball-reference.com/boxes/MIL/MIL201007270.shtml">July 27</a>. Backup catcher <strong><a href="http://www.baseball-reference.com/players/c/cashke01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Kevin  Cash</a></strong> has pitched for five teams since 2002, including Houston, where he pitched in a loss on <a href="http://www.baseball-reference.com/boxes/CIN/CIN201005280.shtml">May 28</a>.</li>
<li><strong>He&#8217;s expendable.</strong><strong><a href="http://www.baseball-reference.com/players/v/vanevjo01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Jonathan  Van  Every</a></strong>, who pitched for Boston in a <a href="http://www.baseball-reference.com/boxes/BOS/BOS201005080.shtml">May 8</a> loss to the Yankees, has played 39 games over three seasons of bouncing between the minors and the majors. <strong><a href="http://www.baseball-reference.com/players/h/hallbi03.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Bill  Hall</a></strong>, his teammate, pitched on <a href="http://www.baseball-reference.com/boxes/BOS/BOS201005280.shtml">May 28</a> (in a different game than Cash did!) and played six utility positions for Boston during 2010 &#8211; second base, third base, shortstop, and all three outfield positions &#8211; in addition to pitching. <strong><a href="http://www.baseball-reference.com/players/m/mathejo02.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Joe  Mather</a></strong>, who pitched in the same game as Lopez and took the loss, played all three outfield positions and both infield corners. These are guys who are marginal enough that they have to learn a million positions just to be on the roster.</li>
<li><strong>He played for Boston at some point.</strong> Okay, okay, Inglett, Miles, Marte and Mather never did. Fine. But Van Every and Hall both pitched for Boston, Cash has done two unrelated stints with the Red Sox, and Lopez ended the season as Terry Francona&#8217;s utility man. That&#8217;s quite the coincidence, wouldn&#8217;t you agree?</li>
</ol>
<p>Before anyone gripes, there&#8217;s one other type of utility pitcher, but he wasn&#8217;t represented this season. That, of course, is the star who gets his jollies pitching. This includes two prime varieties: the Wade Boggs, (wily vet who taught himself a knuckleball), and the <strong><a href="http://www.baseball-reference.com/players/c/cansejo01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Jose  Canseco</a></strong> (idiot who hurts himself).</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/tomflesher.wordpress.com/507/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/tomflesher.wordpress.com/507/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=tomflesher.com&#038;blog=20518139&#038;post=507&#038;subd=tomflesher&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://tomflesher.com/2010/12/30/a-utility-pitcher-sidebar/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/4cc81c8ef60cdc1c146147aed58a6174?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Tom</media:title>
		</media:content>
	</item>
		<item>
		<title>The Spectrum Club</title>
		<link>http://tomflesher.com/2010/12/28/the-spectrum-club/</link>
		<comments>http://tomflesher.com/2010/12/28/the-spectrum-club/#comments</comments>
		<pubDate>Tue, 28 Dec 2010 14:39:53 +0000</pubDate>
		<dc:creator>tomflesher</dc:creator>
				<category><![CDATA[Baseball]]></category>
		<category><![CDATA[Aaron Miles]]></category>
		<category><![CDATA[Andy Marte]]></category>
		<category><![CDATA[designated hitter]]></category>
		<category><![CDATA[Felipe Lopez]]></category>
		<category><![CDATA[Ike Davis]]></category>
		<category><![CDATA[Jeff Kunkel]]></category>
		<category><![CDATA[Joe Mather]]></category>
		<category><![CDATA[Mark Loretta]]></category>
		<category><![CDATA[Nick Swisher]]></category>
		<category><![CDATA[position players pitching]]></category>
		<category><![CDATA[Spectrum Club]]></category>
		<category><![CDATA[Wade Boggs]]></category>

		<guid isPermaLink="false">http://tomflesher.com/?p=505</guid>
		<description><![CDATA[This year, I get to induct two more players into the prestigious* Spectrum Club. *not a guarantee The Spectrum Club is the elite group of players who play, in one season, at both ends of the Defensive Spectrum. At the end of a season, a player is inducted if he pitches in at least one [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=tomflesher.com&#038;blog=20518139&#038;post=505&#038;subd=tomflesher&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>This year, I get to induct two more players into the prestigious* Spectrum Club.<br />
*not a guarantee</p>
<p><a href="http://bbref.com/pi/shareit/QcwRJ">The Spectrum Club</a> is the elite group of players who play, in one season, at both ends of the Defensive Spectrum. At the end of a season, a player is inducted if he pitches in at least one game and appears as designated hitter in at least one game. As it stands, that leaves about ten pitchers who only served as placeholder DHs but never made a plate appearance on the rolls, but that&#8217;s okay.</p>
<p>Three players have joined the Spectrum Club twice &#8211; <strong><a href="http://www.baseball-reference.com/minors/player.cgi?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker&amp;id=kunkel001jef">Jeff  Kunkel</a></strong> in 1988 and 1989 for Texas, <strong><a href="http://www.baseball-reference.com/players/l/loretma01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Mark  Loretta</a></strong> in 2001 for the Brewers and 2009 for the Dodgers, and Wade Boggs in 1997 for the Yankees and 1999 for Tampa Bay. Baltimore leads the club in inductees with six.</p>
<p>This year&#8217;s first inductee is <strong><a href="http://www.baseball-reference.com/players/m/milesaa01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Aaron  Miles</a></strong> of the Cardinals, who actually pitched twice (August 3 in <a href="http://www.baseball-reference.com/boxes/SLN/SLN201008030.shtml">a loss</a> to  Houston and September 28 in <a href="http://www.baseball-reference.com/boxes/SLN/SLN201009280.shtml">a loss</a> to Pittsburgh). Making it more impressive, Miles DHed only once, in an interleague win over Kansas City on <a href="http://www.baseball-reference.com/boxes/KCA/KCA201006260.shtml">June 26</a>. Miles is an experienced pitcher, having tossed twice in 2007 and once in 2008. Tony Larussa has quite the commodity there, and I bet he wishes he&#8217;d had Miles on hand for <a href="http://www.baseball-reference.com/boxes/SLN/SLN201004170.shtml">that crazy 20-inning game against the Mets</a> on April 17.</p>
<p>The second player to join the club is <strong><a href="http://www.baseball-reference.com/players/m/martean01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Andy  Marte</a></strong> of Cleveland. Marte DHed twice, once on <a href="http://www.baseball-reference.com/boxes/TBA/TBA201007100.shtml">July 10</a> in a loss to the Rays and once on <a href="http://www.baseball-reference.com/boxes/ANA/ANA201009070.shtml">September 7</a>. His single inning pitched came as part of <a title="The Best Game Ever" href="http://tomflesher.com/2010/07/30/the-best-game-ever/">the Best Game Ever</a>, a <a href="http://www.baseball-reference.com/boxes/CLE/CLE201007290.shtml">July 29</a> loss to the Yankees in which the Yankees lost their DH and Marte struck out <strong><a href="http://www.baseball-reference.com/players/s/swishni01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Nick  Swisher</a></strong>.</p>
<p>Who&#8217;s the smart money on for Spectrum Club inductions in 2011? <strong><a href="http://www.baseball-reference.com/players/m/mathejo02.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Joe  Mather</a></strong> and <strong><a href="http://www.baseball-reference.com/players/l/lopezfe01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Felipe  Lopez</a></strong> are both reasonable hitters and both pitched for Tony Larussa in the Mets-Cardinals game. If Lopez stays with the Red Sox, he might be called on to DH an odd late game, and Terry Francona has been known to use position players in emergencies. <strong><a href="http://www.baseball-reference.com/players/d/davisik02.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Ike  Davis</a></strong> may well be asked to DH interleague games for the Mets, and he was a closer in college, so he&#8217;d be a solid emergency reliever. If I had to guess, though, I&#8217;d figure that the next Spectrum Club inductee will be <strong><a href="http://www.baseball-reference.com/players/s/swishni01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Nick  Swisher</a></strong> getting his second induction for the Yankees.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/tomflesher.wordpress.com/505/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/tomflesher.wordpress.com/505/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=tomflesher.com&#038;blog=20518139&#038;post=505&#038;subd=tomflesher&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://tomflesher.com/2010/12/28/the-spectrum-club/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/4cc81c8ef60cdc1c146147aed58a6174?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Tom</media:title>
		</media:content>
	</item>
		<item>
		<title>Hit Batsman Roundup, 2010</title>
		<link>http://tomflesher.com/2010/12/26/hit-batsman-roundup-2010/</link>
		<comments>http://tomflesher.com/2010/12/26/hit-batsman-roundup-2010/#comments</comments>
		<pubDate>Sun, 26 Dec 2010 20:56:14 +0000</pubDate>
		<dc:creator>tomflesher</dc:creator>
				<category><![CDATA[Baseball]]></category>
		<category><![CDATA[Brett Carroll]]></category>
		<category><![CDATA[hit batsman]]></category>
		<category><![CDATA[hit by pitch]]></category>
		<category><![CDATA[Hunter Pence]]></category>
		<category><![CDATA[Kevin Youkilis]]></category>
		<category><![CDATA[Omar Infante]]></category>
		<category><![CDATA[Raul Ibanez]]></category>
		<category><![CDATA[regression]]></category>
		<category><![CDATA[Rickie Weeks]]></category>
		<category><![CDATA[Scott Podsednik]]></category>
		<category><![CDATA[spurious correlation]]></category>
		<category><![CDATA[Victor Martinez]]></category>

		<guid isPermaLink="false">http://tomflesher.com/?p=498</guid>
		<description><![CDATA[There&#8217;s very little more subtle and involved than the quiet elegance of a batter getting beaned. In fact, that particular strategy was invoked 1549 times in 2010, with 419 batters getting plunked at least one. The absolute leader this season was not Kevin Youkilis or Brett Carroll but Rickie Weeks, who led with 25 HBP [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=tomflesher.com&#038;blog=20518139&#038;post=498&#038;subd=tomflesher&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>There&#8217;s very little more subtle and involved than the quiet elegance of a batter getting beaned. In fact, that particular strategy was invoked 1549 times in 2010, with 419 batters getting plunked at least one.</p>
<p>The <a href="http://bbref.com/pi/shareit/DKZvv">absolute leader</a> this season was not <strong><a href="http://www.baseball-reference.com/players/y/youklke01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Kevin  Youkilis</a></strong> or <strong><a href="http://www.baseball-reference.com/players/c/carrobr01.shtml">Brett  Carroll</a></strong> but <strong><a href="http://www.baseball-reference.com/players/w/weeksri01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Rickie  Weeks</a></strong>, who led with 25 HBP in 754 plate appearances. Put another way, Weeks got hit in 3.32% of his plate appearances.  That&#8217;s almost once every 30 plate appearances, or nearly four times the MLB-wide rate of 0.83% of the time. (Incidentally, that&#8217;s total HBP divided by total plate appearances. The more skewed mean percentage is 0.58%.) What leads to such a high number of plunkings?</p>
<p>I would assume that a few things would go into the decision to hit a batter intentionally:</p>
<ul>
<li>Pitchers are less likely to be hit by other pitchers.</li>
<li>If a hitter is likely to get on base anyway, he&#8217;s more likely to be hit &#8211; you don&#8217;t lose anything by putting him on base, and you control the damage by limiting him to one base.</li>
<li>If a batter is likely to hit for extra bases, he&#8217;s more likely to be hit.</li>
<li>If a batter is likely to steal a base, he&#8217;s less likely to be hit, but there is an offsetting effect for caught stealing.</li>
<li>American League batters are more likely to be hit because of the moral hazard effect of pitchers not having to bat.</li>
</ul>
<p>With that in mind, I set up a regression in R using every player who had at least one plate appearance in 2010. I added binary variables for Pitcher (1 if the player&#8217;s primary position is pitcher, 0 otherwise) and Lg (1 if the player played the entire season in the American League, 0 otherwise), then regressed <em>HBP/PA</em> on <em>Pitcher, Lg, BB, HR, OBP, SLG, SB,</em> and <em>CS</em>. The results were somewhat surprising:</p>
<div style="overflow:auto;">
<div class="geshifilter">
<pre class="r geshifilter-R" style="font-family:monospace;">Call:
<a href="http://inside-r.org/r-doc/stats/lm"><span style="color:#003399;font-weight:bold;">lm</span></a><span style="color:#009900;">(</span><a href="http://inside-r.org/r-doc/stats/formula"><span style="color:#003399;font-weight:bold;">formula</span></a> = hbppa ~ Pitcher + Lg + <a href="/packages/BB">BB</a> + HR + OBP + SLG + SB +
    CS<span style="color:#009900;">)</span>
 
Residuals:
       Min         1Q     Median         3Q        Max
-<span style="color:#cc66cc;">0.0154027</span> -<span style="color:#cc66cc;">0.0059081</span> -<span style="color:#cc66cc;">0.0018096</span>  <span style="color:#cc66cc;">0.0001845</span>  <span style="color:#cc66cc;">0.1397065</span>
 
Coefficients:
              Estimate Std. Error <a href="http://inside-r.org/r-doc/base/t"><span style="color:#003399;font-weight:bold;">t</span></a> value Pr<span style="color:#009900;">(</span>&gt;|t|<span style="color:#009900;">)</span>
<span style="color:#009900;">(</span>Intercept<span style="color:#009900;">)</span>  6.847e-03  9.815e-04   <span style="color:#cc66cc;">6.975</span> 5.77e-12 ***
Pitcher     -5.399e-03  9.136e-04  -<span style="color:#cc66cc;">5.909</span> 4.81e-09 ***
Lg          -1.614e-03  7.054e-04  -<span style="color:#cc66cc;">2.289</span>   <span style="color:#cc66cc;">0.0223</span> *
<a href="/packages/BB">BB</a>          -1.412e-05  3.257e-05  -<span style="color:#cc66cc;">0.434</span>   <span style="color:#cc66cc;">0.6647</span>
HR           1.122e-04  7.956e-05   <span style="color:#cc66cc;">1.411</span>   <span style="color:#cc66cc;">0.1587</span>
OBP          8.570e-03  3.477e-03   <span style="color:#cc66cc;">2.465</span>   <span style="color:#cc66cc;">0.0139</span> *
SLG         -3.451e-03  2.468e-03  -<span style="color:#cc66cc;">1.398</span>   <span style="color:#cc66cc;">0.1624</span>
SB          -6.749e-05  8.693e-05  -<span style="color:#cc66cc;">0.776</span>   <span style="color:#cc66cc;">0.4377</span>
CS           1.770e-04  2.646e-04   <span style="color:#cc66cc;">0.669</span>   <span style="color:#cc66cc;">0.5036</span>
---
Signif. codes:  <span style="color:#cc66cc;">0</span> ‘***’ <span style="color:#cc66cc;">0.001</span> ‘**’ <span style="color:#cc66cc;">0.01</span> ‘*’ <span style="color:#cc66cc;">0.05</span> ‘.’ <span style="color:#cc66cc;">0.1</span> ‘ ’ <span style="color:#cc66cc;">1</span>
 
Residual standard error: <span style="color:#cc66cc;">0.01042</span> on <span style="color:#cc66cc;">935</span> degrees of freedom
Multiple R-squared: <span style="color:#cc66cc;">0.08839</span><span style="color:#339933;">,</span>    Adjusted R-squared: <span style="color:#cc66cc;">0.08059</span>
F-statistic: <span style="color:#cc66cc;">11.33</span> on <span style="color:#cc66cc;">8</span> and <span style="color:#cc66cc;">935</span> DF<span style="color:#339933;">,</span>  p-value: 2.07e-15</pre>
</div>
</div>
<p><a title="Created by Pretty R at inside-R.org" href="http://www.inside-r.org/pretty-r">Created by Pretty R at inside-R.org</a></p>
<p>That&#8217;s right &#8211; only <em>Pitcher, Lg, HR,</em> and <em>SLG</em> are even marginally significant (80% level). <em>BB, SB,</em> and <em>CS</em> aren&#8217;t even close. Why not?</p>
<p>Well, for one, the number of stolen bases and times caught stealing are relatively small no matter what. There probably isn&#8217;t enough data. For another, there simply probably isn&#8217;t as much intent to hit batters as we&#8217;d like to pretend.</p>
<p>Second, American Leaguers are <strong>less</strong> likely to be hit. This baffles me a little bit.</p>
<p>Also, keep in mind that this model shouldn&#8217;t be expected to, and cannot, explain all or even most of the variation in hit batsman. The R-squared is about .09, meaning that it explains about 9% of the variation. It ignores probably the most important factor, physics, entirely. (That is, the model doesn&#8217;t have any way to account for accidental plunkings.) As a side note, other regressions show there might be an effect for plate appearances, meaning you&#8217;re more likely to get hit by chance alone if you take enough pitches.</p>
<p>Finally, there are some guys who manage to do the opposite of Weeks&#8217; feat. Houston outfielder <strong><a href="http://www.baseball-reference.com/players/p/pencehu01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Hunter  Pence</a></strong> went 156 games and 658 plate appearances without getting plunked at all. Honorable mentions go to <strong><a href="http://www.baseball-reference.com/players/i/ibanera01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Raul  Ibanez</a></strong>, <strong><a href="http://www.baseball-reference.com/players/p/podsesc01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Scott  Podsednik</a></strong>, <strong><a href="http://www.baseball-reference.com/players/m/martivi01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Victor  Martinez</a></strong>, and <strong><a href="http://www.baseball-reference.com/players/i/infanom01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Omar  Infante</a></strong>, all of whom went over 500 plate appearances without a beaning. Now THAT&#8217;S plate discipline.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/tomflesher.wordpress.com/498/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/tomflesher.wordpress.com/498/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=tomflesher.com&#038;blog=20518139&#038;post=498&#038;subd=tomflesher&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://tomflesher.com/2010/12/26/hit-batsman-roundup-2010/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/4cc81c8ef60cdc1c146147aed58a6174?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Tom</media:title>
		</media:content>
	</item>
		<item>
		<title>Weird Pitching Decisions Almanac in 2010</title>
		<link>http://tomflesher.com/2010/12/24/weird-pitching-decisions-almanac-in-2010/</link>
		<comments>http://tomflesher.com/2010/12/24/weird-pitching-decisions-almanac-in-2010/#comments</comments>
		<pubDate>Sat, 25 Dec 2010 02:50:25 +0000</pubDate>
		<dc:creator>tomflesher</dc:creator>
				<category><![CDATA[Baseball]]></category>
		<category><![CDATA[baseball-reference.com]]></category>
		<category><![CDATA[Carl Pavano]]></category>
		<category><![CDATA[Cheap Wins]]></category>
		<category><![CDATA[Clayton Kershaw]]></category>
		<category><![CDATA[Colby Lewis]]></category>
		<category><![CDATA[Cubs]]></category>
		<category><![CDATA[Felix Hernandez]]></category>
		<category><![CDATA[Francisco Rodriguez]]></category>
		<category><![CDATA[Hiroki Kuroda]]></category>
		<category><![CDATA[Jeremy Affeldt]]></category>
		<category><![CDATA[John Lackey]]></category>
		<category><![CDATA[Justin Verlander]]></category>
		<category><![CDATA[Mariners]]></category>
		<category><![CDATA[Phil Hughes]]></category>
		<category><![CDATA[Red Sox]]></category>
		<category><![CDATA[Rodrigo Lopez]]></category>
		<category><![CDATA[Roy Oswalt]]></category>
		<category><![CDATA[Royals]]></category>
		<category><![CDATA[Tommy Hanson]]></category>
		<category><![CDATA[Tough Losses]]></category>
		<category><![CDATA[Tyler Clippard]]></category>
		<category><![CDATA[vulture wins]]></category>

		<guid isPermaLink="false">http://tomflesher.com/?p=494</guid>
		<description><![CDATA[I&#8217;m a big fan of weird pitching decisions. A pitcher with a lot of tough losses pitches effectively but stands behind a team with crappy run support. A pitcher with a high proportion of cheap wins gets lucky more often than not. A reliever with a lot of vulture wins might as well be taking [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=tomflesher.com&#038;blog=20518139&#038;post=494&#038;subd=tomflesher&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>I&#8217;m a big fan of weird pitching decisions. A pitcher with a lot of tough losses pitches effectively but stands behind a team with crappy run support. A pitcher with a high proportion of cheap wins gets lucky more often than not. A reliever with a lot of vulture wins might as well be taking the loss.</p>
<p>In an earlier post, I defined a <a title="Tough Losses" href="http://worldsworstsportsblog.com/2010/07/08/tough-losses/">tough loss</a> two ways. The official definition is a loss in which the starting pitcher made a quality start &#8211; that is, six or more innings with three or fewer runs. The Bill James definition is the same, except that James defines a quality start as having a <a href="http://en.wikipedia.org/wiki/Game_score">game score</a> of 50 or higher. In either case, tough losses result from solid pitching combined with anemic run support.</p>
<p>This year&#8217;s <a href="http://bbref.com/pi/shareit/UfTqj">Tough Loss leaderboard</a> had 457 games spread around 183 pitchers across both leagues. The Dodgers&#8217; <strong><a href="http://www.baseball-reference.com/players/k/kurodhi01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Hiroki  Kuroda</a></strong> led the league with a whopping eight starts with game scores of 50 or more. He was followed by eight players with six tough losses, including <strong><a href="http://www.baseball-reference.com/players/v/verlaju01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Justin  Verlander</a></strong>, <strong><a href="http://www.baseball-reference.com/players/p/pavanca01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Carl  Pavano</a></strong>, <strong><a href="http://www.baseball-reference.com/players/o/oswalro01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Roy  Oswalt</a></strong>, <strong><a href="http://www.baseball-reference.com/players/l/lopezro02.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Rodrigo  Lopez</a></strong>, <strong><a href="http://www.baseball-reference.com/players/l/lewisco01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Colby  Lewis</a></strong>, <strong><a href="http://www.baseball-reference.com/players/k/kershcl01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Clayton  Kershaw</a></strong>, <strong><a href="http://www.baseball-reference.com/players/h/hernafe02.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Felix  Hernandez</a></strong>, and <strong><a href="http://www.baseball-reference.com/players/h/hansoto01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Tommy  Hanson</a></strong>. Kuroda&#8217;s Dodgers led the league with 23 tough losses, followed by the Mariners and the Cubs with 22 each.</p>
<p>There were fewer <a title="Cheap Wins" href="http://worldsworstsportsblog.com/2010/07/16/cheap-wins/">cheap wins</a>, in which a pitcher does not make a quality start but does earn the win. The <a href="http://bbref.com/pi/shareit/JZtWR">Cheap Win leaderboard</a> had 248 games and 136 pitchers, led by <strong><a href="http://www.baseball-reference.com/players/l/lackejo01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">John  Lackey</a></strong> with six and <strong><a href="http://www.baseball-reference.com/players/h/hugheph01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Phil  Hughes</a></strong> with 5. Hughes pitched to 18 wins, but Lackey&#8217;s six cheap wins were almost half of his 14-win total this year. That really shows what kind of run support he had. The Royals and the Red Sox were tied for first place with 15 team cheap wins each.</p>
<p>Finally, a <a href="http://bbref.com/pi/shareit/BWevR">vulture win</a> is one for the relievers. I define a vulture win as a blown save and a win in the same game, so I searched Baseball Reference for players with blown saves and then looked for the largest number of wins. <strong><a href="http://www.baseball-reference.com/players/c/clippty01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Tyler  Clippard</a></strong> was the clear winner here. In six blown saves, he got 5 vulture wins. <strong><a href="http://www.baseball-reference.com/player_search.cgi?search=Francisco+Rodriguez&amp;utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Francisco  Rodriguez</a></strong> and <strong><a href="http://www.baseball-reference.com/players/a/affelje01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Jeremy  Affeldt</a></strong> each deserve credit, though &#8211; each had three blown saves and converted all three for vulture wins. (When I say &#8220;converted,&#8221; I mean &#8220;waited it out for their team to score more runs.&#8221;)</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/tomflesher.wordpress.com/494/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/tomflesher.wordpress.com/494/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=tomflesher.com&#038;blog=20518139&#038;post=494&#038;subd=tomflesher&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://tomflesher.com/2010/12/24/weird-pitching-decisions-almanac-in-2010/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/4cc81c8ef60cdc1c146147aed58a6174?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Tom</media:title>
		</media:content>
	</item>
		<item>
		<title>Pitchers Hit This Year (or, Two Guys Named Buchholz)</title>
		<link>http://tomflesher.com/2010/12/23/pitchers-hit-this-year-or-two-guys-named-buchholz/</link>
		<comments>http://tomflesher.com/2010/12/23/pitchers-hit-this-year-or-two-guys-named-buchholz/#comments</comments>
		<pubDate>Thu, 23 Dec 2010 16:48:42 +0000</pubDate>
		<dc:creator>tomflesher</dc:creator>
				<category><![CDATA[Baseball]]></category>
		<category><![CDATA[baseball-reference.com]]></category>
		<category><![CDATA[Bruce Chen]]></category>
		<category><![CDATA[Clay Buccholz]]></category>
		<category><![CDATA[Evan Meek]]></category>
		<category><![CDATA[George Sherrill]]></category>
		<category><![CDATA[Gustavo Chacin]]></category>
		<category><![CDATA[hit by pitch]]></category>
		<category><![CDATA[Jack Taschner]]></category>
		<category><![CDATA[Joe Blanton]]></category>
		<category><![CDATA[Kenley Jansen]]></category>
		<category><![CDATA[Manny Aybar]]></category>
		<category><![CDATA[Matt Reynolds]]></category>
		<category><![CDATA[Pitchers batting]]></category>
		<category><![CDATA[Taylor Buccholz]]></category>
		<category><![CDATA[weird lines]]></category>
		<category><![CDATA[Yovani Gallardo]]></category>

		<guid isPermaLink="false">http://tomflesher.com/?p=484</guid>
		<description><![CDATA[Okay, I admit it. This post was originally conceived as a way to talk about the supremely weird line put up by Gustavo Chacin, who in his only plate appearance for Houston hit a home run to leave him with the maximum season OPS of 5.0. Unfortunately, Raphy at Baseball Reference beat me to it. Instead, I noticed while [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=tomflesher.com&#038;blog=20518139&#038;post=484&#038;subd=tomflesher&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Okay, I admit it. This post was originally conceived as a way to talk about the supremely weird line put up by <strong><a href="http://www.baseball-reference.com/players/c/chacigu01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Gustavo  Chacin</a></strong>, who in his only plate appearance for Houston hit a home run to leave him with the maximum season OPS of 5.0. Unfortunately, Raphy at Baseball Reference <a href="http://www.baseball-reference.com/blog/archives/9556">beat me to it</a>. Instead, I noticed while I was browsing the NL&#8217;s home run log to prepare to run some diagnostics on it that <strong><a href="http://www.baseball-reference.com/players/j/janseke01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Kenley  Jansen</a></strong> had two plate appearances comprising one hit and one walk. (Seriously, is there anything this kid can&#8217;t do?)</p>
<p>In Kenley&#8217;s case, that&#8217;s not entirely surprising, since he was a catcher until this season. His numbers weren&#8217;t great, but <a href="http://www.baseball-reference.com/minors/player.cgi?id=jansen001ken#standard_batting::none">he was competent</a>. What surprised me was that <a href="http://www.baseball-reference.com/play-index/season_finder.cgi?as=result_batter&amp;offset=0&amp;sum=0&amp;min_year_season=2000&amp;max_year_season=2010&amp;min_season=1&amp;max_season=-1&amp;min_age=0&amp;max_age=99&amp;lg_ID=lgAny&amp;lgAL_team=tmAny&amp;lgNL_team=tmAny&amp;lgFL_team=tmAny&amp;lgAA_team=tmAny&amp;lgPL_team=tmAny&amp;lgUA_team=tmAny&amp;lgNA_team=tmAny&amp;isFA=either&amp;isActive=either&amp;isHOF=either&amp;isAllstar=either&amp;bats=any&amp;throws=any&amp;games_min_max=min&amp;games_prop=50&amp;games_tot=&amp;exactness=anymarked&amp;pos_1=1&amp;qualifiersSeason=nomin&amp;minpasValS=502&amp;mingamesValS=100&amp;qualifiersCareer=nomin&amp;minpasValC=3000&amp;mingamesValC=1000&amp;orderby=batting_avg&amp;submitter=1&amp;c1criteria=PA&amp;c1gtlt=gt&amp;c1val=1&amp;c2gtlt=eq&amp;c2val=0&amp;c3gtlt=eq&amp;c3val=0&amp;c4gtlt=eq&amp;c4val=0&amp;c5gtlt=eq&amp;c5val=1.0&amp;location=pob&amp;locationMatch=is&amp;pob=&amp;pod=&amp;pcanada=&amp;pusa=#ajax_result_table::none">75 pitchers since 2000 have finished the season with a perfect batting average</a>. 9 were from this year, including <strong><a href="http://www.baseball-reference.com/players/b/buchhcl01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Clay  Buchholz</a></strong> and his distant cousing <strong><a href="http://www.baseball-reference.com/players/b/buchhta01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Taylor  Buchholz</a></strong>. <strong><a href="http://www.baseball-reference.com/players/m/meekev01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Evan  Meek</a></strong> and <strong><a href="http://www.baseball-reference.com/players/c/chenbr01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Bruce  Chen</a></strong> matched Jansen&#8217;s two plate appearances without an out. None of the perfect batting average crowd had an extra-base hit except for Chacin.</p>
<p>Since 2000, the most plate appearances by a pitcher to keep the perfect batting average was 4 by <strong><a href="http://www.baseball-reference.com/players/a/aybarma01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Manny  Aybar</a></strong> in 2000.</p>
<p>At the other end of the spectrum, this year only three pitchers managed a perfect 1.000 on-base percentage without getting any hits at all. <strong><a href="http://www.baseball-reference.com/players/s/sherrge01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">George  Sherrill</a></strong> and <strong><a href="http://www.baseball-reference.com/players/r/reynoma02.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Matt  Reynolds</a></strong> both walked in their only plate appearances; <strong><a href="http://www.baseball-reference.com/players/t/taschja01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Jack  Taschner</a></strong> went them one better by recording a sacrifice hit in a second plate appearance.</p>
<p>Finally, to round things out, this year saw <strong><a href="http://www.baseball-reference.com/players/b/blantjo01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Joe  Blanton</a></strong> and <em>Heureusement, ici, c&#8217;est le Blog</em>&#8216;s favorite pitcher, <strong><a href="http://www.baseball-reference.com/players/g/gallayo01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Yovani  Gallardo</a></strong>, each get hit by two pitches. Gallardo had clearly angered other pitchers by being so much more awesome than they were.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/tomflesher.wordpress.com/484/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/tomflesher.wordpress.com/484/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=tomflesher.com&#038;blog=20518139&#038;post=484&#038;subd=tomflesher&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://tomflesher.com/2010/12/23/pitchers-hit-this-year-or-two-guys-named-buchholz/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/4cc81c8ef60cdc1c146147aed58a6174?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Tom</media:title>
		</media:content>
	</item>
		<item>
		<title>Are This Year&#039;s Home Runs Really That Different?</title>
		<link>http://tomflesher.com/2010/12/22/are-this-years-home-runs-really-that-different/</link>
		<comments>http://tomflesher.com/2010/12/22/are-this-years-home-runs-really-that-different/#comments</comments>
		<pubDate>Thu, 23 Dec 2010 01:23:06 +0000</pubDate>
		<dc:creator>tomflesher</dc:creator>
				<category><![CDATA[Baseball]]></category>
		<category><![CDATA[Economics]]></category>
		<category><![CDATA[Carlos Pena]]></category>
		<category><![CDATA[Carlos Quentin]]></category>
		<category><![CDATA[home run distributions]]></category>
		<category><![CDATA[home runs]]></category>
		<category><![CDATA[Jose Bautista]]></category>
		<category><![CDATA[kurtosis]]></category>
		<category><![CDATA[Mark Teixeira]]></category>
		<category><![CDATA[Miguel Cabrera]]></category>
		<category><![CDATA[Paul Konerko]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[skewness]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://tomflesher.com/?p=470</guid>
		<description><![CDATA[This year&#8217;s home runs are quite confounding. On the one hand, home runs per game in the AL have dropped precipitously (as noted and examined in the two previous posts). On the other hand, Jose Bautista had an absolutely outstanding year. How much different is this year&#8217;s distribution than those of previous years? To answer [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=tomflesher.com&#038;blog=20518139&#038;post=470&#038;subd=tomflesher&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://tomflesher.files.wordpress.com/2010/12/001u4334_josc3a9_bautista1.jpg"><img class="size-thumbnail wp-image-469 alignleft" title="001U4334" src="http://tomflesher.files.wordpress.com/2010/12/001u4334_josc3a9_bautista1.jpg?w=135&#038;h=150" alt="" width="135" height="150" /></a>This year&#8217;s home runs are quite confounding. On the one hand, home runs per game in the AL have dropped precipitously (as noted and examined in the two previous posts). On the other hand, <strong><a href="http://www.baseball-reference.com/players/b/bautijo02.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Jose  Bautista</a></strong> had an absolutely outstanding year. How much different is this year&#8217;s distribution than those of previous years? To answer that question, I took off to Baseball Reference and found the list of all players with at least one plate appearance, sorted by home runs.</p>
<p>There are several parameters that are of interest when discussing the distribution of events. Th<a href="http://tomflesher.files.wordpress.com/2010/12/alhr2010dist1.jpg"><img class="alignright size-thumbnail wp-image-468" title="alhr2010dist" src="http://tomflesher.files.wordpress.com/2010/12/alhr2010dist1.jpg?w=150&#038;h=150" alt="" width="150" height="150" /></a>e first is the mean. This year&#8217;s mean was 5.43, meaning that of the players with at least one plate appearance, on average each one hit 5.43 homers. That&#8217;s down from 6.53 last year and 5.66 in 2008.</p>
<p>Next, consider the <a href="http://en.wikipedia.org/wiki/Variance">variance</a> and <a href="http://en.wikipedia.org/wiki/Standard_deviation">standard deviation</a>. (The variance is the standard deviation squared, so the numbers derive similarly.) A low variance means that the numbers are clumped tightly around the mean. This year&#8217;s variance was 68.4, down from last year&#8217;s 84.64 but up from 2008&#8242;s 66.44.</p>
<p>The <a href="http://en.wikipedia.org/wiki/Skewness">skewness</a> and <a href="http://en.wikipedia.org/wiki/">kurtosis</a> represent the length and thickness of the tails, respectively. Since a lot of people have very <a href="http://tomflesher.files.wordpress.com/2010/12/alhr2009dist1.jpg"><img class="size-thumbnail wp-image-467 alignleft" title="alhr2009dist" src="http://tomflesher.files.wordpress.com/2010/12/alhr2009dist1.jpg?w=150&#038;h=150" alt="" width="150" height="150" /></a>few home runs, the skewness of every year&#8217;s distribution is going to be positive. Roughly, that means that there are observations far larger than the mean, but very few that are far smaller. That makes sense, since there&#8217;s no such thing as a negative home run total. The kurtosis number represents how pointy the distribution is, or alternatively how much of the distribution is found in the tail.</p>
<p>For example, in 2009, <strong><a href="http://www.baseball-reference.com/players/t/teixema01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Mark  Teixeira</a></strong> and <strong><a href="http://www.baseball-reference.com/players/p/penaca01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Carlos  Pena</a></strong> jointly led the American League in home runs with 39. There was a high mean, but the tail was relatively thin with a <a href="http://tomflesher.files.wordpress.com/2010/12/alhr2008dist1.jpg"><img class="alignright size-thumbnail wp-image-466" title="alhr2008dist" src="http://tomflesher.files.wordpress.com/2010/12/alhr2008dist1.jpg?w=150&#038;h=150" alt="" width="150" height="150" /></a>high variance. Compared with this year, when Bautista led his nearest competitor (<strong><a href="http://www.baseball-reference.com/players/k/konerpa01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Paul  Konerko</a></strong>) by 15 runs and only 8 players were over 30 home runs, 2009 saw 15 players above 30 home runs with a pretty tight race for the lead. Kurtosis in 2010 was 7.72 compared with 2009&#8242;s 4.56 and 2008&#8242;s 5.55. (In 2008, 11 players were above the 30-mark, and <strong><a href="http://www.baseball-reference.com/players/c/cabremi01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Miguel  Cabrera</a></strong>&#8216;s 37 home runs edged <strong><a href="http://www.baseball-reference.com/players/q/quentca01.shtml?utm_source=direct&amp;utm_medium=linker&amp;utm_campaign=Linker">Carlos  Quentin</a></strong> by just one.)</p>
<p>The numbers say that 2008 and 2009 were much more similar than either of them is to 2010. A quick look at the distributions bears that out &#8211; this was a weird year.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/tomflesher.wordpress.com/470/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/tomflesher.wordpress.com/470/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=tomflesher.com&#038;blog=20518139&#038;post=470&#038;subd=tomflesher&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://tomflesher.com/2010/12/22/are-this-years-home-runs-really-that-different/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/4cc81c8ef60cdc1c146147aed58a6174?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Tom</media:title>
		</media:content>

		<media:content url="http://tomflesher.files.wordpress.com/2010/12/001u4334_josc3a9_bautista1.jpg?w=135" medium="image">
			<media:title type="html">001U4334</media:title>
		</media:content>

		<media:content url="http://tomflesher.files.wordpress.com/2010/12/alhr2010dist1.jpg?w=150" medium="image">
			<media:title type="html">alhr2010dist</media:title>
		</media:content>

		<media:content url="http://tomflesher.files.wordpress.com/2010/12/alhr2009dist1.jpg?w=150" medium="image">
			<media:title type="html">alhr2009dist</media:title>
		</media:content>

		<media:content url="http://tomflesher.files.wordpress.com/2010/12/alhr2008dist1.jpg?w=150" medium="image">
			<media:title type="html">alhr2008dist</media:title>
		</media:content>
	</item>
		<item>
		<title>Diagnosing the AL</title>
		<link>http://tomflesher.com/2010/12/22/diagnosing-the-al/</link>
		<comments>http://tomflesher.com/2010/12/22/diagnosing-the-al/#comments</comments>
		<pubDate>Wed, 22 Dec 2010 21:20:26 +0000</pubDate>
		<dc:creator>tomflesher</dc:creator>
				<category><![CDATA[Baseball]]></category>
		<category><![CDATA[Economics]]></category>
		<category><![CDATA[2010]]></category>
		<category><![CDATA[American League]]></category>
		<category><![CDATA[baseball-reference.com]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[regression]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[Year of the Pitcher]]></category>

		<guid isPermaLink="false">http://tomflesher.com/?p=463</guid>
		<description><![CDATA[In the previous post, I crunched some numbers on a previous forecast I&#8217;d made and figured out that it was a pretty crappy forecast. (That&#8217;s the fun of forecasting, of course &#8211; sometimes you&#8217;re right and sometimes you&#8217;re wrong.) The funny part of it, though, is that the predicted home runs per game for the [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=tomflesher.com&#038;blog=20518139&#038;post=463&#038;subd=tomflesher&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>In the previous post, I crunched some numbers on a previous forecast I&#8217;d made and figured out that it was a pretty crappy forecast. (That&#8217;s the fun of forecasting, of course &#8211; sometimes you&#8217;re right and sometimes you&#8217;re wrong.) The funny part of it, though, is that the predicted home runs per game for the American League was so far off &#8211; 3.4 standard errors below the predicted value &#8211; that it&#8217;s highly unlikely that the regression model I used controls for all relevant variables. That&#8217;s not surprising, since it was only a time trend with a dummy variable for the designated hitter.</p>
<p>There are a couple of things to check for immediately. The first is the most common explanation thrown around when home runs drop &#8211; steroids. It seems to me that if the drop in home runs were due to better control of performance-enhancing drugs, then it should mostly be home runs that are affected. For example, intentional walks should probably be below expectation, since intentional walks are used to protect against a home run hitter. Unintentional walks should probably be about as expected, since walks are a function of plate discipline and pitcher control, not of strength. On-base percentage should probably drop at a lower magnitude than home runs, since some hits that would have been home runs will stay in the park as singles, doubles, or triples. Finally, slugging average should drop because a loss in power without a corresponding increase in speed will lower total bases.</p>
<p>I&#8217;ll analyze these with pretty new R code behind the cut.</p>
<p><span id="more-463"></span>Using R, I fitted time-series models of the same functional form as the home runs per game model. I pulled the data from the Baseball-Reference.com AL Batting Encyclopedia and regressed the variable of interest on a time trend, its square, and a dummy for the designated hitter.</p>
<p><span style="text-decoration:underline;"><strong>First Assumption:</strong></span> Intentional walks should decrease.</p>
<p><strong><span style="text-decoration:underline;">Results:</span></strong></p>
<div style="overflow:auto;">
<div class="geshifilter">
<pre class="r geshifilter-R" style="font-family:monospace;">&gt; ibb.lm &lt;- <a href="http://inside-r.org/r-doc/stats/lm"><span style="color:#003399;font-weight:bold;">lm</span></a><span style="color:#009900;">(</span>IBB ~ <a href="http://inside-r.org/r-doc/base/t"><span style="color:#003399;font-weight:bold;">t</span></a> + tsq + DH<span style="color:#009900;">)</span>
&gt; <a href="http://inside-r.org/r-doc/base/summary"><span style="color:#003399;font-weight:bold;">summary</span></a><span style="color:#009900;">(</span>ibb.lm<span style="color:#009900;">)</span>
 
Call:
<a href="http://inside-r.org/r-doc/stats/lm"><span style="color:#003399;font-weight:bold;">lm</span></a><span style="color:#009900;">(</span><a href="http://inside-r.org/r-doc/stats/formula"><span style="color:#003399;font-weight:bold;">formula</span></a> = IBB ~ <a href="http://inside-r.org/r-doc/base/t"><span style="color:#003399;font-weight:bold;">t</span></a> + tsq + DH<span style="color:#009900;">)</span>
 
Residuals:
       Min         1Q     Median         3Q        Max
-<span style="color:#cc66cc;">0.1350376</span> -<span style="color:#cc66cc;">0.0261969</span>  <span style="color:#cc66cc;">0.0005516</span>  <span style="color:#cc66cc;">0.0294412</span>  <span style="color:#cc66cc;">0.1534536</span>
 
Coefficients:
              Estimate Std. Error <a href="http://inside-r.org/r-doc/base/t"><span style="color:#003399;font-weight:bold;">t</span></a> value Pr<span style="color:#009900;">(</span>&gt;|t|<span style="color:#009900;">)</span>
<span style="color:#009900;">(</span>Intercept<span style="color:#009900;">)</span>  2.656e-01  1.408e-02  <span style="color:#cc66cc;">18.870</span>  &lt; 2e-16 ***
<a href="http://inside-r.org/r-doc/base/t"><span style="color:#003399;font-weight:bold;">t</span></a>            8.037e-03  1.199e-03   <span style="color:#cc66cc;">6.706</span> 1.01e-09 ***
tsq         -1.393e-04  2.024e-05  -<span style="color:#cc66cc;">6.882</span> 4.30e-10 ***
DH          -1.140e-01  1.055e-02 -<span style="color:#cc66cc;">10.805</span>  &lt; 2e-16 ***
---
Signif. codes:  <span style="color:#cc66cc;">0</span> ‘***’ <span style="color:#cc66cc;">0.001</span> ‘**’ <span style="color:#cc66cc;">0.01</span> ‘*’ <span style="color:#cc66cc;">0.05</span> ‘.’ <span style="color:#cc66cc;">0.1</span> ‘ ’ <span style="color:#cc66cc;">1</span>
 
Residual standard error: <span style="color:#cc66cc;">0.04689</span> on <span style="color:#cc66cc;">106</span> degrees of freedom
Multiple R-squared: <span style="color:#cc66cc;">0.5961</span><span style="color:#339933;">,</span>     Adjusted R-squared: <span style="color:#cc66cc;">0.5847</span>
F-statistic: <span style="color:#cc66cc;">52.14</span> on <span style="color:#cc66cc;">3</span> and <span style="color:#cc66cc;">106</span> DF<span style="color:#339933;">,</span>  p-value: &lt; 2.2e-16
 
&gt; ibb.2010.fitted &lt;- <span style="color:#009900;">(</span>2.656e-01<span style="color:#009900;">)</span> + <span style="color:#009900;">(</span>8.037e-03<span style="color:#009900;">)</span>*<span style="color:#cc66cc;">56</span> + <span style="color:#009900;">(</span>-1.393e-04<span style="color:#009900;">)</span>*<span style="color:#009900;">(</span><span style="color:#cc66cc;">56</span>**<span style="color:#cc66cc;">2</span><span style="color:#009900;">)</span> + <span style="color:#009900;">(</span>-1.140e-01<span style="color:#009900;">)</span>
&gt; ibb.2010.obs &lt;- <span style="color:#cc66cc;">.2</span>
&gt; residual.ibb &lt;- ibb.2010.obs - ibb.2010.fitted
&gt; se.ibb &lt;- <span style="color:#cc66cc;">.04689</span>
&gt; residual.ibb/se.ibb
<span style="color:#009900;">[</span><span style="color:#cc66cc;">1</span><span style="color:#009900;">]</span> <span style="color:#cc66cc;">0.750113</span></pre>
</div>
</div>
<p><a title="Created by Pretty R at inside-R.org" href="http://www.inside-r.org/pretty-r">Created by Pretty R at inside-R.org</a></p>
<p>Intentional walks per game increased, but the increase was by less than one standard error. Statistically, intentional walks did not change.</p>
<p><strong><span style="text-decoration:underline;">Second Assumption:</span></strong> Unintentional walks should not change.</p>
<p><strong><span style="text-decoration:underline;">Results:</span></strong></p>
<div style="overflow:auto;">
<div class="geshifilter">
<pre class="r geshifilter-R" style="font-family:monospace;">&gt; uBB &lt;- <span style="color:#009900;">(</span>BB-IBB<span style="color:#009900;">)</span>
&gt; ubb.lm &lt;- <a href="http://inside-r.org/r-doc/stats/lm"><span style="color:#003399;font-weight:bold;">lm</span></a><span style="color:#009900;">(</span>uBB ~ <a href="http://inside-r.org/r-doc/base/t"><span style="color:#003399;font-weight:bold;">t</span></a> + tsq + DH<span style="color:#009900;">)</span>
&gt; <a href="http://inside-r.org/r-doc/base/summary"><span style="color:#003399;font-weight:bold;">summary</span></a><span style="color:#009900;">(</span>ubb.lm<span style="color:#009900;">)</span>
 
Call:
<a href="http://inside-r.org/r-doc/stats/lm"><span style="color:#003399;font-weight:bold;">lm</span></a><span style="color:#009900;">(</span><a href="http://inside-r.org/r-doc/stats/formula"><span style="color:#003399;font-weight:bold;">formula</span></a> = uBB ~ <a href="http://inside-r.org/r-doc/base/t"><span style="color:#003399;font-weight:bold;">t</span></a> + tsq + DH<span style="color:#009900;">)</span>
 
Residuals:
     Min       1Q   Median       3Q      Max
-<span style="color:#cc66cc;">0.69256</span> -<span style="color:#cc66cc;">0.12758</span> -<span style="color:#cc66cc;">0.01390</span>  <span style="color:#cc66cc;">0.13178</span>  <span style="color:#cc66cc;">0.77866</span>
 
Coefficients:
              Estimate Std. Error <a href="http://inside-r.org/r-doc/base/t"><span style="color:#003399;font-weight:bold;">t</span></a> value Pr<span style="color:#009900;">(</span>&gt;|t|<span style="color:#009900;">)</span>
<span style="color:#009900;">(</span>Intercept<span style="color:#009900;">)</span>  <span style="color:#cc66cc;">3.0879505</span>  <span style="color:#cc66cc;">0.0732669</span>  <span style="color:#cc66cc;">42.147</span>  &lt; 2e-16 ***
<a href="http://inside-r.org/r-doc/base/t"><span style="color:#003399;font-weight:bold;">t</span></a>           -<span style="color:#cc66cc;">0.0190285</span>  <span style="color:#cc66cc;">0.0062392</span>  -<span style="color:#cc66cc;">3.050</span> <span style="color:#cc66cc;">0.002892</span> **
tsq          <span style="color:#cc66cc;">0.0003623</span>  <span style="color:#cc66cc;">0.0001054</span>   <span style="color:#cc66cc;">3.439</span> <span style="color:#cc66cc;">0.000837</span> ***
DH           <span style="color:#cc66cc;">0.1812598</span>  <span style="color:#cc66cc;">0.0549094</span>   <span style="color:#cc66cc;">3.301</span> <span style="color:#cc66cc;">0.001313</span> **
---
Signif. codes:  <span style="color:#cc66cc;">0</span> ‘***’ <span style="color:#cc66cc;">0.001</span> ‘**’ <span style="color:#cc66cc;">0.01</span> ‘*’ <span style="color:#cc66cc;">0.05</span> ‘.’ <span style="color:#cc66cc;">0.1</span> ‘ ’ <span style="color:#cc66cc;">1</span>
 
Residual standard error: <span style="color:#cc66cc;">0.2441</span> on <span style="color:#cc66cc;">106</span> degrees of freedom
Multiple R-squared: <span style="color:#cc66cc;">0.1876</span><span style="color:#339933;">,</span>     Adjusted R-squared: <span style="color:#cc66cc;">0.1647</span>
F-statistic: <span style="color:#cc66cc;">8.162</span> on <span style="color:#cc66cc;">3</span> and <span style="color:#cc66cc;">106</span> DF<span style="color:#339933;">,</span>  p-value: 6.127e-05
 
&gt; ubb.2010.fitted &lt;- <span style="color:#cc66cc;">3.0879505</span> + <span style="color:#009900;">(</span>-<span style="color:#cc66cc;">.0190285</span><span style="color:#009900;">)</span>*<span style="color:#cc66cc;">56</span> + <span style="color:#009900;">(</span><span style="color:#cc66cc;">.0003623</span><span style="color:#009900;">)</span>*<span style="color:#009900;">(</span><span style="color:#cc66cc;">56</span>**<span style="color:#cc66cc;">2</span><span style="color:#009900;">)</span> + <span style="color:#cc66cc;">.1812598</span>
&gt; ubb.2010.obs &lt;- <span style="color:#cc66cc;">3.25</span> - <span style="color:#cc66cc;">.2</span>
&gt; residual.ubb &lt;- ubb.2010.obs - ubb.2010.fitted
&gt; se.ubb &lt;- <span style="color:#cc66cc;">.2441</span>
&gt; residual.ubb/se.ubb
<span style="color:#009900;">[</span><span style="color:#cc66cc;">1</span><span style="color:#009900;">]</span> -<span style="color:#cc66cc;">1.187166</span></pre>
</div>
</div>
<p><a title="Created by Pretty R at inside-R.org" href="http://www.inside-r.org/pretty-r">Created by Pretty R at inside-R.org</a></p>
<p>Unintentional walks decreased by a bit over one standard error. Again, that isn&#8217;t evidence of a big enough fluctuation to say that it&#8217;s statistically different from our expectation.</p>
<p><strong><span style="text-decoration:underline;">Third Assumption:</span></strong> OBP drops, but by somewhat less than 3.4 standard errors.</p>
<p><strong><span style="text-decoration:underline;">Results:</span></strong></p>
<div style="overflow:auto;">
<div class="geshifilter">
<pre class="r geshifilter-R" style="font-family:monospace;">&gt; obp.lm &lt;- <a href="http://inside-r.org/r-doc/stats/lm"><span style="color:#003399;font-weight:bold;">lm</span></a><span style="color:#009900;">(</span>OBP ~ <a href="http://inside-r.org/r-doc/base/t"><span style="color:#003399;font-weight:bold;">t</span></a> + tsq + DH<span style="color:#009900;">)</span>
&gt; <a href="http://inside-r.org/r-doc/base/summary"><span style="color:#003399;font-weight:bold;">summary</span></a><span style="color:#009900;">(</span>obp.lm<span style="color:#009900;">)</span>
 
Call:
<a href="http://inside-r.org/r-doc/stats/lm"><span style="color:#003399;font-weight:bold;">lm</span></a><span style="color:#009900;">(</span><a href="http://inside-r.org/r-doc/stats/formula"><span style="color:#003399;font-weight:bold;">formula</span></a> = OBP ~ <a href="http://inside-r.org/r-doc/base/t"><span style="color:#003399;font-weight:bold;">t</span></a> + tsq + DH<span style="color:#009900;">)</span>
 
Residuals:
       Min         1Q     Median         3Q        Max
-<span style="color:#cc66cc;">0.0217348</span> -<span style="color:#cc66cc;">0.0044903</span>  <span style="color:#cc66cc;">0.0002799</span>  <span style="color:#cc66cc;">0.0046695</span>  <span style="color:#cc66cc;">0.0182481</span>
 
Coefficients:
              Estimate Std. Error <a href="http://inside-r.org/r-doc/base/t"><span style="color:#003399;font-weight:bold;">t</span></a> value Pr<span style="color:#009900;">(</span>&gt;|t|<span style="color:#009900;">)</span>
<span style="color:#009900;">(</span>Intercept<span style="color:#009900;">)</span>  3.238e-01  2.230e-03 <span style="color:#cc66cc;">145.199</span>  &lt; 2e-16 ***
<a href="http://inside-r.org/r-doc/base/t"><span style="color:#003399;font-weight:bold;">t</span></a>           -5.703e-04  1.899e-04  -<span style="color:#cc66cc;">3.003</span>  <span style="color:#cc66cc;">0.00334</span> **
tsq          1.472e-05  3.207e-06   <span style="color:#cc66cc;">4.591</span> 1.22e-05 ***
DH           8.245e-03  1.671e-03   <span style="color:#cc66cc;">4.933</span> 3.02e-06 ***
---
Signif. codes:  <span style="color:#cc66cc;">0</span> ‘***’ <span style="color:#cc66cc;">0.001</span> ‘**’ <span style="color:#cc66cc;">0.01</span> ‘*’ <span style="color:#cc66cc;">0.05</span> ‘.’ <span style="color:#cc66cc;">0.1</span> ‘ ’ <span style="color:#cc66cc;">1</span>
 
Residual standard error: <span style="color:#cc66cc;">0.00743</span> on <span style="color:#cc66cc;">106</span> degrees of freedom
Multiple R-squared: <span style="color:#cc66cc;">0.487</span><span style="color:#339933;">,</span>      Adjusted R-squared: <span style="color:#cc66cc;">0.4724</span>
F-statistic: <span style="color:#cc66cc;">33.54</span> on <span style="color:#cc66cc;">3</span> and <span style="color:#cc66cc;">106</span> DF<span style="color:#339933;">,</span>  p-value: 2.532e-15
 
&gt; obp.2010.fitted &lt;- <span style="color:#009900;">(</span>3.238e-01<span style="color:#009900;">)</span> + <span style="color:#009900;">(</span>-5.703e-04<span style="color:#009900;">)</span>*<span style="color:#cc66cc;">56</span> + <span style="color:#009900;">(</span>1.472e-05<span style="color:#009900;">)</span>*<span style="color:#009900;">(</span><span style="color:#cc66cc;">56</span>**<span style="color:#cc66cc;">2</span><span style="color:#009900;">)</span> + 8.245e-03
&gt; obp.2010.obs &lt;- <span style="color:#cc66cc;">.327</span>
&gt; residual.obp &lt;- obp.2010.obs - obp.2010.fitted
&gt; se.obp &lt;- <span style="color:#cc66cc;">.00743</span>
&gt; residual.obp/se.obp
<span style="color:#009900;">[</span><span style="color:#cc66cc;">1</span><span style="color:#009900;">]</span> -<span style="color:#cc66cc;">2.593556</span></pre>
</div>
</div>
<p><a title="Created by Pretty R at inside-R.org" href="http://www.inside-r.org/pretty-r">Created by Pretty R at inside-R.org</a></p>
<p>OBP dropped, but it dropped by quite a bit. Without more information it&#8217;s hard to judge whether a change of this magnitude is due to better pitching or power being taken away from hitters.</p>
<p><strong><span style="text-decoration:underline;">Fourth Assumption:</span></strong> Slugging average will drop.</p>
<p><strong><span style="text-decoration:underline;">Results:</span></strong></p>
<div style="overflow:auto;">
<div class="geshifilter">
<pre class="r geshifilter-R" style="font-family:monospace;">&gt; slg.lm &lt;- <a href="http://inside-r.org/r-doc/stats/lm"><span style="color:#003399;font-weight:bold;">lm</span></a><span style="color:#009900;">(</span>SLG ~ <a href="http://inside-r.org/r-doc/base/t"><span style="color:#003399;font-weight:bold;">t</span></a> + tsq + DH<span style="color:#009900;">)</span>
&gt; <a href="http://inside-r.org/r-doc/base/summary"><span style="color:#003399;font-weight:bold;">summary</span></a><span style="color:#009900;">(</span>slg.lm<span style="color:#009900;">)</span>
 
Call:
<a href="http://inside-r.org/r-doc/stats/lm"><span style="color:#003399;font-weight:bold;">lm</span></a><span style="color:#009900;">(</span><a href="http://inside-r.org/r-doc/stats/formula"><span style="color:#003399;font-weight:bold;">formula</span></a> = SLG ~ <a href="http://inside-r.org/r-doc/base/t"><span style="color:#003399;font-weight:bold;">t</span></a> + tsq + DH<span style="color:#009900;">)</span>
 
Residuals:
       Min         1Q     Median         3Q        Max
-<span style="color:#cc66cc;">0.0357646</span> -<span style="color:#cc66cc;">0.0087050</span> -<span style="color:#cc66cc;">0.0007988</span>  <span style="color:#cc66cc;">0.0115133</span>  <span style="color:#cc66cc;">0.0317497</span>
 
Coefficients:
              Estimate Std. Error <a href="http://inside-r.org/r-doc/base/t"><span style="color:#003399;font-weight:bold;">t</span></a> value Pr<span style="color:#009900;">(</span>&gt;|t|<span style="color:#009900;">)</span>
<span style="color:#009900;">(</span>Intercept<span style="color:#009900;">)</span>  3.937e-01  4.471e-03  <span style="color:#cc66cc;">88.050</span>  &lt; 2e-16 ***
<a href="http://inside-r.org/r-doc/base/t"><span style="color:#003399;font-weight:bold;">t</span></a>           -2.058e-03  3.807e-04  -<span style="color:#cc66cc;">5.404</span> 4.04e-07 ***
tsq          5.049e-05  6.429e-06   <span style="color:#cc66cc;">7.853</span> 3.51e-12 ***
DH           1.693e-02  3.351e-03   <span style="color:#cc66cc;">5.054</span> 1.82e-06 ***
---
Signif. codes:  <span style="color:#cc66cc;">0</span> ‘***’ <span style="color:#cc66cc;">0.001</span> ‘**’ <span style="color:#cc66cc;">0.01</span> ‘*’ <span style="color:#cc66cc;">0.05</span> ‘.’ <span style="color:#cc66cc;">0.1</span> ‘ ’ <span style="color:#cc66cc;">1</span>
 
Residual standard error: <span style="color:#cc66cc;">0.01489</span> on <span style="color:#cc66cc;">106</span> degrees of freedom
Multiple R-squared: <span style="color:#cc66cc;">0.6452</span><span style="color:#339933;">,</span>     Adjusted R-squared: <span style="color:#cc66cc;">0.6352</span>
F-statistic: <span style="color:#cc66cc;">64.27</span> on <span style="color:#cc66cc;">3</span> and <span style="color:#cc66cc;">106</span> DF<span style="color:#339933;">,</span>  p-value: &lt; 2.2e-16
 
&gt; slg.2010.fitted &lt;- <span style="color:#009900;">(</span>3.937e-01<span style="color:#009900;">)</span> + <span style="color:#009900;">(</span>-2.058e-03<span style="color:#009900;">)</span>*<span style="color:#cc66cc;">56</span> + <span style="color:#009900;">(</span>5.049e-05<span style="color:#009900;">)</span>*<span style="color:#009900;">(</span><span style="color:#cc66cc;">56</span>**<span style="color:#cc66cc;">2</span><span style="color:#009900;">)</span> + <span style="color:#009900;">(</span>1.693e-02<span style="color:#009900;">)</span>
&gt; slg.2010.obs &lt;- <span style="color:#cc66cc;">.407</span>
&gt; residual.slg &lt;- slg.2010.obs - slg.2010.fitted
&gt; se.slg &lt;- <span style="color:#cc66cc;">.01489</span>
&gt; residual.slg/se.slg
<span style="color:#009900;">[</span><span style="color:#cc66cc;">1</span><span style="color:#009900;">]</span> -<span style="color:#cc66cc;">3.137585</span></pre>
</div>
</div>
<p><a title="Created by Pretty R at inside-R.org" href="http://www.inside-r.org/pretty-r">Created by Pretty R at inside-R.org</a></p>
<p>A drop in slugging average of over three standard errors indicates that we may be working with something that&#8217;s ruined hitters&#8217; power or that&#8217;s hurt their ability to hit in general. We have results that are consistent with either something harming power hitters specifically or hitters in general.</p>
<p>This isn&#8217;t evidence of steroid use. In fact, the same results would be consistent with a shift toward pitching talent. More work needs to be done on this year&#8217;s data before conclusions can be drawn. However, it does seem to indicate that, at least in the American League, the Year of the Pitcher narrative has some statistical foundation.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/tomflesher.wordpress.com/463/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/tomflesher.wordpress.com/463/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=tomflesher.com&#038;blog=20518139&#038;post=463&#038;subd=tomflesher&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://tomflesher.com/2010/12/22/diagnosing-the-al/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/4cc81c8ef60cdc1c146147aed58a6174?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Tom</media:title>
		</media:content>
	</item>
		<item>
		<title>What Happened to Home Runs This Year?</title>
		<link>http://tomflesher.com/2010/12/22/what-happened-to-home-runs-this-year/</link>
		<comments>http://tomflesher.com/2010/12/22/what-happened-to-home-runs-this-year/#comments</comments>
		<pubDate>Wed, 22 Dec 2010 17:18:46 +0000</pubDate>
		<dc:creator>tomflesher</dc:creator>
				<category><![CDATA[Baseball]]></category>
		<category><![CDATA[Economics]]></category>
		<category><![CDATA[baseball-reference.com]]></category>
		<category><![CDATA[forecasting]]></category>
		<category><![CDATA[home runs]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[regression]]></category>
		<category><![CDATA[standard error]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[time series]]></category>
		<category><![CDATA[Year of the Pitcher]]></category>

		<guid isPermaLink="false">http://tomflesher.com/?p=458</guid>
		<description><![CDATA[I was talking to Jim, the writer behind Apparently, I&#8217;m An Angels Fan, who&#8217;s gamely trying to learn baseball because he wants to be just like me. Jim wondered aloud how much the vaunted &#8220;Year of the Pitcher&#8221; has affected home run production. Sure enough, on checking the AL Batting Encyclopedia at Baseball-Reference.com, production dropped [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=tomflesher.com&#038;blog=20518139&#038;post=458&#038;subd=tomflesher&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>I was talking to Jim, the writer behind <a href="http://apparentlyanangelsfan.wordpress.com">Apparently, I&#8217;m An Angels Fan</a>, who&#8217;s gamely trying to learn baseball because he wants to be just like me. Jim wondered aloud how much the vaunted &#8220;Year of the Pitcher&#8221; has affected home run production. Sure enough, on checking the <a href="http://www.baseball-reference.com/leagues/AL/bat.shtml">AL Batting  Encyclopedia</a> at <a href="http://www.baseball-reference.com">Baseball-Reference.com</a>, production dropped by about .15 home runs per game (from 1.13 to .97). Is that normal statistical variation or does it show that this year was really different?</p>
<p>In two previous posts, I <a title="Back when it was hard to hit 55…" href="http://worldsworstsportsblog.com/2010/07/08/back-when-it-was-hard-to-hit-55/">looked at the trend of home runs per game to examine Stuff Keith Hernandez Says</a> and then <a title="More on Home Runs Per Game" href="http://worldsworstsportsblog.com/2010/07/09/more-on-home-runs-per-game/">examined Japanese baseball&#8217;s data for evidence of structural break</a>. I used the Batting Encyclopedia to run a time-series regression for a quadratic trend and added a dummy variable for the Designated Hitter. I found that the time trend and DH control account for approximately 56% of the variation in home runs per year, and that the functional form is</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Chat%7BHR%7D+%3D+.957+-+.0188+%5Ctimes+t+%2B+.0004+%5Ctimes+t%5E2+%2B+.0911++%5Ctimes+DH+&amp;bg=ffffff&amp;fg=666666&amp;s=0' alt='&#92;hat{HR} = .957 - .0188 &#92;times t + .0004 &#92;times t^2 + .0911  &#92;times DH ' title='&#92;hat{HR} = .957 - .0188 &#92;times t + .0004 &#92;times t^2 + .0911  &#92;times DH ' class='latex' /></p>
<p>with t=1 in 1955, t=2 in 1956, and so on. That means t=56 in 2010. Consequently, we&#8217;d expect home run production per game in 2010 in the American League to be approximately</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Chat%7BHR%7D+%3D+.957+-+.0188+%5Ctimes+56+%2B+.0004+%5Ctimes+3136+%2B+.0911+%5Capprox+1.25+&amp;bg=ffffff&amp;fg=666666&amp;s=0' alt='&#92;hat{HR} = .957 - .0188 &#92;times 56 + .0004 &#92;times 3136 + .0911 &#92;approx 1.25 ' title='&#92;hat{HR} = .957 - .0188 &#92;times 56 + .0004 &#92;times 3136 + .0911 &#92;approx 1.25 ' class='latex' /></p>
<p>That means we expected production to increase this year and it dropped precipitously, for a residual of -.28. The residual standard error on the original regression was .1092, so on 106 degrees of freedom, so the t-value using <a href="http://www.stat.tamu.edu/stat30x/zttables.php">Texas A&amp;M&#8217;s table</a> is 1.984 (approximating using 100 df). That means we can be 95% confident that the actual number of home runs should fall within .1092*1.984, or about .2041, of the expected value. The lower bound would be about 1.05, meaning we&#8217;re still significantly below what we&#8217;d expect. In fact, the observed number is about 3.4 standard errors below the expected number. In other words, we&#8217;d expect that to happen by chance less than .1% (that is, less than one tenth of one percent) of the time.</p>
<p>Clearly, something else is in play.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/tomflesher.wordpress.com/458/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/tomflesher.wordpress.com/458/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=tomflesher.com&#038;blog=20518139&#038;post=458&#038;subd=tomflesher&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://tomflesher.com/2010/12/22/what-happened-to-home-runs-this-year/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/4cc81c8ef60cdc1c146147aed58a6174?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Tom</media:title>
		</media:content>
	</item>
	</channel>
</rss>
