<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-6773098339388257491</id><updated>2011-10-08T06:05:43.455-07:00</updated><title type='text'>AggPro: Aggregate Projection System</title><subtitle type='html'></subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://aggpro.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://aggpro.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Ross</name><uri>http://www.blogger.com/profile/17359824128401198785</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>14</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-6773098339388257491.post-7895227938026892840</id><published>2011-01-09T11:45:00.001-08:00</published><updated>2011-01-09T11:47:22.568-08:00</updated><title type='text'>Swing Quality Implementation Assumptions</title><content type='html'>Cameron and I are sending a paper to the Sloan Sports Analytic Conference in Boston, Mass. We made several assumptions about some of the data that Cameron has nicely explained &lt;a href="http://www.cs.virginia.edu/~rjg7v/AssumptionsMadeInQualitySwingImpl.pdf"&gt;here&lt;/a&gt;. I needed a website to reference this document and thought the AggPro blog could use the extra 1 hit by the Sloan paper referees.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6773098339388257491-7895227938026892840?l=aggpro.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://aggpro.blogspot.com/feeds/7895227938026892840/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://aggpro.blogspot.com/2011/01/swing-quality-implementation.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/7895227938026892840'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/7895227938026892840'/><link rel='alternate' type='text/html' href='http://aggpro.blogspot.com/2011/01/swing-quality-implementation.html' title='Swing Quality Implementation Assumptions'/><author><name>Ross</name><uri>http://www.blogger.com/profile/17359824128401198785</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6773098339388257491.post-3662080151214208690</id><published>2010-11-25T03:53:00.000-08:00</published><updated>2010-11-25T03:55:33.967-08:00</updated><title type='text'>Analysis of the hire of Orioles new hitting coach: Jim Presley</title><content type='html'>I have a &lt;a href="http://baltimoresportsreport.com/jim-presley-orioles-new-hitting-coach-by-the-stats-9836.html"&gt;piece up on Baltimore Sports Report&lt;/a&gt; which takes a look at how Jim Presley fits into the development of Adam Jones and Matt Wieters. Feel free to check it out or ignore.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6773098339388257491-3662080151214208690?l=aggpro.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://aggpro.blogspot.com/feeds/3662080151214208690/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://aggpro.blogspot.com/2010/11/analysis-of-hire-of-orioles-new-hitting.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/3662080151214208690'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/3662080151214208690'/><link rel='alternate' type='text/html' href='http://aggpro.blogspot.com/2010/11/analysis-of-hire-of-orioles-new-hitting.html' title='Analysis of the hire of Orioles new hitting coach: Jim Presley'/><author><name>Ross</name><uri>http://www.blogger.com/profile/17359824128401198785</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6773098339388257491.post-7797385150089550810</id><published>2010-11-16T10:47:00.000-08:00</published><updated>2010-11-16T10:52:46.981-08:00</updated><title type='text'>Analysis of the hire of Orioles new pitching coach Mark Connor</title><content type='html'>I took a &lt;a href="http://baltimoresportsreport.com/mark-connor-orioles-new-pitching-coach-by-the-stats-9652.html#idc-container"&gt;closer look&lt;/a&gt; at the Orioles new pitching coach Mark Connor over at &lt;a href="http://baltimoresportsreport.com/"&gt;Baltimore Sports Report&lt;/a&gt;. Please mock, enjoy or ignore if you get a chance.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6773098339388257491-7797385150089550810?l=aggpro.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://aggpro.blogspot.com/feeds/7797385150089550810/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://aggpro.blogspot.com/2010/11/analysis-of-hire-of-orioles-new.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/7797385150089550810'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/7797385150089550810'/><link rel='alternate' type='text/html' href='http://aggpro.blogspot.com/2010/11/analysis-of-hire-of-orioles-new.html' title='Analysis of the hire of Orioles new pitching coach Mark Connor'/><author><name>Ross</name><uri>http://www.blogger.com/profile/17359824128401198785</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6773098339388257491.post-2883029335885344457</id><published>2010-10-07T11:29:00.001-07:00</published><updated>2010-10-07T11:33:56.810-07:00</updated><title type='text'>Juan Samuel Piece on BSR</title><content type='html'>I wrote a &lt;a href="http://baltimoresportsreport.com/the-juan-samuel-era-retrospective-in-pictures-and-prose-9083.html"&gt;retrospective piece&lt;/a&gt; on BSR about the 2010 Juan Samuel Era in Baltimore. Its available &lt;a href="http://baltimoresportsreport.com/the-juan-samuel-era-retrospective-in-pictures-and-prose-9083.html"&gt;here&lt;/a&gt;. I tried to have fun with it but noticed at least one glaring typo afterwards. Regardless I enjoyed taking a break from the more technical writing I have previously done. Per usual, feel free to mock or ignore.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6773098339388257491-2883029335885344457?l=aggpro.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://aggpro.blogspot.com/feeds/2883029335885344457/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://aggpro.blogspot.com/2010/10/juan-samuel-piece-on-bsr.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/2883029335885344457'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/2883029335885344457'/><link rel='alternate' type='text/html' href='http://aggpro.blogspot.com/2010/10/juan-samuel-piece-on-bsr.html' title='Juan Samuel Piece on BSR'/><author><name>Ross</name><uri>http://www.blogger.com/profile/17359824128401198785</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6773098339388257491.post-8724915595686797962</id><published>2010-09-15T14:53:00.000-07:00</published><updated>2010-09-15T15:00:05.642-07:00</updated><title type='text'>Is this the new and improved Chris Tillman?</title><content type='html'>I have a piece up on &lt;a href="www.baltimoresportsreport.com"&gt;Baltimore Sports Report&lt;/a&gt; about the Orioles' Chris Tillman. &lt;a href="http://baltimoresportsreport.com/is-this-the-new-and-improved-chris-tillman-8795.html"&gt;The article &lt;/a&gt;examines Tillman's two good starts since his Sept. 1 call up to see if his recent success is due to improved skills or good fortune experienced during a  small sample size.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6773098339388257491-8724915595686797962?l=aggpro.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://aggpro.blogspot.com/feeds/8724915595686797962/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://aggpro.blogspot.com/2010/09/is-this-new-and-improved-chris-tillman.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/8724915595686797962'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/8724915595686797962'/><link rel='alternate' type='text/html' href='http://aggpro.blogspot.com/2010/09/is-this-new-and-improved-chris-tillman.html' title='Is this the new and improved Chris Tillman?'/><author><name>Ross</name><uri>http://www.blogger.com/profile/17359824128401198785</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6773098339388257491.post-4709339464255211402</id><published>2010-07-28T07:25:00.000-07:00</published><updated>2010-07-28T07:31:12.289-07:00</updated><title type='text'>2010 Orioles Starting Rotation - Part 2</title><content type='html'>&lt;a href="http://baltimoresportsreport.com/the-o%E2%80%99s-starting-rotation-in-the-year-of-the-pitcher-part-2-7724.html"&gt;Part 2&lt;/a&gt; of my examination of the Orioles starting rotation in pitcher friendly 2010 is up at &lt;a href="http://www.baltimoresportsreport.com"&gt;baltimoresportsreport.com&lt;/a&gt;. If you get a chance head on over and check it out.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6773098339388257491-4709339464255211402?l=aggpro.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://aggpro.blogspot.com/feeds/4709339464255211402/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://aggpro.blogspot.com/2010/07/2010-orioles-starting-rotation-part-2.html#comment-form' title='11 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/4709339464255211402'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/4709339464255211402'/><link rel='alternate' type='text/html' href='http://aggpro.blogspot.com/2010/07/2010-orioles-starting-rotation-part-2.html' title='2010 Orioles Starting Rotation - Part 2'/><author><name>Ross</name><uri>http://www.blogger.com/profile/17359824128401198785</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>11</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6773098339388257491.post-3879208361930082131</id><published>2010-07-26T07:28:00.000-07:00</published><updated>2010-07-26T07:33:25.846-07:00</updated><title type='text'>The 2010 Orioles Starting Rotation</title><content type='html'>I've started writing over at &lt;a href="http://baltimoresportsreport.com/"&gt;BaltimoreSportsReport.com&lt;/a&gt;. My first &lt;a href="http://baltimoresportsreport.com/the-o%E2%80%99s-starting-rotation-in-the-year-of-the-pitcher-part-1-7671.html"&gt;article&lt;/a&gt; is a two part series on how the Orioles starting rotation has performed in the year of the pitcher. Its one of my first forays into non academic writing so bear with me. I promise it will get better.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6773098339388257491-3879208361930082131?l=aggpro.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://aggpro.blogspot.com/feeds/3879208361930082131/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://aggpro.blogspot.com/2010/07/2010-orioles-starting-rotation.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/3879208361930082131'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/3879208361930082131'/><link rel='alternate' type='text/html' href='http://aggpro.blogspot.com/2010/07/2010-orioles-starting-rotation.html' title='The 2010 Orioles Starting Rotation'/><author><name>Ross</name><uri>http://www.blogger.com/profile/17359824128401198785</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6773098339388257491.post-5077723960945063744</id><published>2010-04-20T12:48:00.000-07:00</published><updated>2010-04-27T14:11:22.429-07:00</updated><title type='text'>Fantasy Valuation</title><content type='html'>Abstract— Currently when drafting fantasy players, managers are reliant upon the rankings as they exist in their platform (ex: Yahoo, ESPN, CBSSports), regardless of the league’s actual format or statistic categories.  Managers must then use a variety of other information to pre-rank players for a customized league.  As an additional analysis step to “AggPro: The Aggregate Projection System” we set out to find a way to rank hitters and pitchers across any league format, with a strict mathematical basis.  The following article proposes a valuation metric applicable to any Rotisserie style league.&lt;br /&gt;&lt;br /&gt;I. INTRODUCTION&lt;br /&gt;Pre-ranking fantasy players for an annual draft is an overwhelming task.  Managers  can consume a wealth of information, such as ESPN’s annual draft kit[1], Yahoo’s position primers[2], CBSSports’ fantasy depth charts[3], MockDraftCentral’s Average Draft Pick (ADP) data[4], Athlon Sports’ annual baseball preview magazine[5]  and countless other sources of fantasy material.  These resources provide analysis and rankings of players based on projected performance.  In addition, managers also consider the projection systems for the statistics themselves.  As discussed in “AggPro: The Aggregate Projection System”[6], there are several projection systems available, thus bestowing managers with the task of choosing which valuation system to follow.  In our development of the AggPro category weights, we expanded our research, creating a standard way to turn these projections into a single value for each player which can then be used to pre-rank players for a draft.&lt;br /&gt; There are three core problems with the existing valuation systems: 1) Ranking systems are inconsistent across platforms.  2) Projections themselves need to be balanced against valuation information.  3) Subjective projections (source data systems discussed in AggPro) should be objectively weighted for valuation.  Using AggPro to produce a more accurate 2010 projections, it is possible to evaluate all players objectively for any league format.  Our solution as discussed in this paper is also threefold: 1) Apply a valuation metric, based on standard deviation, to determine hitter and pitcher value in each projection category.  2) Determine the “overall” value by using the customized league scoring settings.  3) Apply ADP information to suggest draft value coupled with statistical value.&lt;br /&gt;With this valuation formula, managers can use any projections in tandem with the specific categories for the league, to objectively define how valuable players are for that league format.  These values when ordered descending produce a pre-ranked list of players, but not necessarily in the best order to draft them.  Leveraging ADP data in conjunction with these values will best prepare managers for draft day.  This analysis is not new; many have contributed content across the web with articles discussing this process.  The remainder of this paper focuses on how to 1) gather the information 2) calculate the valuation metric and 3) organize these values in conjunction with ADP data to have a more precise list of pre-ranked players for their specific league.&lt;br /&gt;&lt;br /&gt;II. COLLECTING PROJECTION DATA&lt;br /&gt;This research uses the AggPro projections, but any source can be used for this process, provided that it projects enough players for the draft and stats for each of the categories.  Some systems neglect to project ERA, but do provide ERs and IPs, so ERA can be easily calculated.  For more unique leagues that use categories such as fielding errors or grounded into double plays, finding a projection system that has this data may be more difficult.  The analysis uses the following 8x8 rotisserie categories:&lt;br /&gt;Hitters R HR RBI SB K Avg. OBP. SLG.&lt;br /&gt;Pitchers IP W L Sv K ERA WHIP K/BB&lt;br /&gt;Because this 8x8 league uses additional categories, the generic rankings that Yahoo! or ESPN provide do not provide a full picture of player value.  &lt;br /&gt;After finding a usable projection data set, store the values in a spreadsheet or database.  The next section will detail the additional mathematical steps to creating the valuation metric.  The valuation metric will change depending on the field of players considered, so it is best to create the valuation for all players regardless of league size.  For this analysis, the minimum benchmarks were hitters projecting to more than 200 at-bats and pitchers with at least 100 innings pitched or at least 4 saves recorded.  This provided ample data for most league formats, while also focusing on players who are fantasy relevant.&lt;br /&gt;&lt;br /&gt;III. DETERMINING EACH PLAYER/CATEGORY VALUE&lt;br /&gt;For each category, create an output column that calculates the following:&lt;br /&gt;Player’s Projected Stat - Average (All Players’ Projections)  &lt;br /&gt;Standard Deviation (All Players’ Projections)&lt;br /&gt;&lt;br /&gt;In Excel: =(&lt;cell&gt; - AVERAGE($&lt;column&gt;$1:$&lt;column&gt;$&lt;final row&gt;)) / STDEV($&lt;column&gt;$1:$&lt;column&gt;$&lt;final row&gt;)&lt;br /&gt;&lt;br /&gt;The result of calculating how far above or below a player’s stat is from the average, then compared to the standard deviation, provides a positive or negative distance from zero, which translates to that player’s value for that statistic.  The larger the value, the greater the player’s impact on that rotisserie category.  Next, create this stat value for each player and stat category in the league.  Summing these creates a single value that translates into a player’s value in the league.  ESPN’s top five 2010 ranked players are shown below[7]:&lt;br /&gt;Projections:&lt;br /&gt;Player R HR RBI K SB AVG OBP SLG&lt;br /&gt;Albert Pujols 114 43 122 62 11 0.337 0.452 0.647&lt;br /&gt;Hanley Ramirez 113 29 91 106 34 0.324 0.4 0.546&lt;br /&gt;Alex Rodriguez 110 39 125 124 21 0.3 0.402 0.563&lt;br /&gt;Ryan Braun 110 37 116 125 17 0.315 0.38 0.576&lt;br /&gt;Chase Utley 114 30 97 111 18 0.29 0.396 0.52&lt;br /&gt;&lt;br /&gt;Values:&lt;br /&gt;Player Run Val HR Val RBI Val K Val SB Val Avg Val OBP Val SLG Val&lt;br /&gt;Albert Pujols 1.95 2.20 1.98 1.32 -0.17 2.82 3.75 3.37&lt;br /&gt;Hanley Ramirez 1.88 0.71 0.41 -0.12 1.61 2.09 1.57 1.31&lt;br /&gt;Alex Rodriguez 1.67 1.78 2.13 -0.70 0.60 0.75 1.66 1.66&lt;br /&gt;Ryan Braun 1.67 1.56 1.67 -0.74 0.29 1.59 0.74 1.92&lt;br /&gt;Chase Utley 1.95 0.82 0.71 -0.28 0.37 0.18 1.41 0.78&lt;br /&gt;&lt;br /&gt;Totals:&lt;br /&gt;Player 8x8 Value 5x5 Value&lt;br /&gt;Albert Pujols 17.22 8.78&lt;br /&gt;Hanley Ramirez 9.47 6.70&lt;br /&gt;Alex Rodriguez 9.54 6.93&lt;br /&gt;Ryan Braun 8.71 6.79&lt;br /&gt;Chase Utley 5.94 4.03&lt;br /&gt;&lt;br /&gt;In looking at the resulting data set, for the 8x8 (and a standard 5x5) league, Alex Rodriguez is slightly better than Hanley Ramirez, but four other players actually come in higher than Utley (data not shown).  The most notable finding is that in the 8x8, Albert Pujols is almost twice as valuable as the next best player.  &lt;br /&gt;&lt;br /&gt;IV. RANKING THE VALUE&lt;br /&gt;When evaluating this valuation process, we found that value of the projections varies in accuracy depending on 1) size of the dataset, 2) degree of range between projections for the same stat, 3) a source system’s favoring of a certain statistic.  While these factors impacted comparing the calculated value, ranking players 1 to n based on this value proved to be more accurate than any system in any year.  The resulting new column of data represents the order of value, which is can now be used in conjunction with the ADP data to provide the final pre-ranked list.&lt;br /&gt; We also chose to add a “rank round” column which is simply the round in which that player should go according to the rank of the value.  The formula to calculate a round based on some rank:&lt;br /&gt;&lt;br /&gt;((Rank – (Rank MOD  x )) / x ) + 1&lt;br /&gt;In Excel: =(&lt;cell&gt;-MOD(&lt;cell&gt;,  x ))/ x +1&lt;br /&gt;&lt;br /&gt;In the above,  x represents the number of teams drafting.&lt;br /&gt;&lt;br /&gt;V. ADDING IN ADP&lt;br /&gt;Like projection data, ADP data is readily available online from a variety of sources.  For our research we combined the 2010 ADP from Yahoo[8], ESPN[9], and KFFL[10] to obtain an average ADP.  This ADP data is most useful when comparing a ranked value.  To compare the ranked rounds and the ADP rounds, calculate the ADP round using the same formula as in section IV.  With the addition of this information, the full data set output includes seven columns:&lt;br /&gt;Player Position Value Rank Rank Round ADP ADP Round&lt;br /&gt;&lt;br /&gt;With this dataset, it is then easy to highlight players with a net difference in projected rank round to ADP round &gt; 1.  Those with a positive difference are good values for drafting (have a value higher than that of the ADP).  Those with a negative difference are not as valuable (are being drafter sooner in ADP than their value projects).&lt;br /&gt; With this indicator in place, it is thus easier to rank players in an order based on 1) his value to the league 2) how soon he will commonly be drafted in a league.  For our 2010 data, it’s obvious that Albert Pujols and Ryan Braun will go in the first round.  However, the interesting comparisons come in for those that meet the criteria above:&lt;br /&gt;Player Position Value Rank Rank Round ADP ADP Round&lt;br /&gt;Haren SP 7.85 9 1 50.2 6&lt;br /&gt;Sandoval 3B/1B 5.415 18 2 33 4&lt;br /&gt;Kemp OF 4.331 26 3 8 1&lt;br /&gt;Roberts 2B 0.911 81 9 43 5&lt;br /&gt;&lt;br /&gt;This table shows that Dan Haren and Pablo Sandoval are good draft values, especially at their ADP levels, but Matt Kemp and Brian Roberts are commonly taken before their value dictates.  Obviously, this is due to the custom league format.  In a standard 5x5, this data value data will be different (stolen bases are not as valuable in an 8x8).  However, this difference is the purpose of research, to break down the barriers of generic value rankings and create a more quantitatively sound and accurate valuation applicable to any league.&lt;br /&gt;&lt;br /&gt;VI. LIMITATIONS&lt;br /&gt;There are a few important points to make regarding limitations of the process.  This process does not consider health concerns, supporting teammates, or potential upside/”breakout” players.  Such measures are usually applied to the projection data source.  This process does not consider position scarcity.  Other systems, in which analysts rank players manually, likely take this into account.  Admittedly, this is likely a factor as to why Utley is ranked 4th in section III but projects slightly lower.  This process does not necessarily dictate a manager’s draft.  The use of this indicator still requires subjective choices.  For example, managers should not draft Dan Haren 9th overall because he projects that high in terms of value.  However, using this process, a manager can make an educated decision in drafting him in the fifth round, just before most draft him in the sixth.  Thus, that manager gets first round value in the fifth round.  Likewise, waiting until the ninth round to select Brian Roberts will result in him not being available.  Managers may choose to draft him sooner because of position scarcity, this research provides an indicator of what his true value is in this league format, rather than assuming it is high because common platforms say so.  Even though the process thus far has focused on custom formats, it can be repeated for a standard  5x5 league to provide a sanity check against the default rankings of a drafting platform.  &lt;br /&gt;&lt;br /&gt;VII. CONCLUSION&lt;br /&gt;By starting with a set of projection data, managers can evaluate players based on the statistics relevant to any custom league using a series of valuation metrics.  Ranking players according to a common value metric provides useful insight for fantasy drafts.  This ranking is further enhanced by including a check against ADP data to highlight players who project above or below their ADP.&lt;br /&gt;&lt;br /&gt;VIII. REFERENCES&lt;br /&gt;[1] http://games.espn.go.com/frontpage/flbdraftkit.&lt;br /&gt;[2] http://sports.yahoo.com/fantasy/mlb/news?slug=be-posprimer08-rp&lt;br /&gt;[3] http://cbssports.com/mlb/depth-charts&lt;br /&gt;[4] http://www.mockdraftcentral.com/report_adp.jsp?period=0&amp;sport=1&amp;type=218&amp;color=1&lt;br /&gt;[5] http://www.athlonsports.com/store/index.php?cpath=33_274&lt;br /&gt;[6] http://aggpro.blogspot.com&lt;br /&gt;[7] At time of authoring, AggPro data for 2010 was not yet available.  ESPN projections available:  Valuation calculations may differ due to size of sample set.&lt;br /&gt;[8] http://baseball.fantasysports.yahoo.com/b1/&lt;leagueid&gt;/draftanalysis&lt;br /&gt;[9] http://games.espn.go.com/flb/livedraftresults&lt;br /&gt;[10]http://www.kffl.com/static/programs/baseball/compilations/mlb_adp_report.php&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6773098339388257491-5077723960945063744?l=aggpro.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://aggpro.blogspot.com/feeds/5077723960945063744/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://aggpro.blogspot.com/2010/04/fantasy-valuation.html#comment-form' title='22 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/5077723960945063744'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/5077723960945063744'/><link rel='alternate' type='text/html' href='http://aggpro.blogspot.com/2010/04/fantasy-valuation.html' title='Fantasy Valuation'/><author><name>Ross</name><uri>http://www.blogger.com/profile/17359824128401198785</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>22</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6773098339388257491.post-2786081433333820845</id><published>2010-03-08T18:08:00.000-08:00</published><updated>2010-03-09T18:05:10.994-08:00</updated><title type='text'>2010 AggPro Weights</title><content type='html'>For those of you playing at home, your AggPro weights for 2010 are as follows:&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"   style="  border-collapse: collapse; font-family:arial, sans-serif;font-size:13px;"&gt;&lt;div&gt;Bill James = .4266&lt;/div&gt;&lt;div&gt;CHONE = 0.0&lt;/div&gt;&lt;div&gt;Marcel = .2851&lt;/div&gt;&lt;div&gt;PECOTA = .2833&lt;/div&gt;ZiPS = 0.0&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"   style="font-family:arial, sans-serif;font-size:100%;"&gt;&lt;span class="Apple-style-span"  style="border-collapse: collapse; font-size:13px;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"   style="font-family:arial, sans-serif;font-size:100%;"&gt;&lt;span class="Apple-style-span"  style="border-collapse: collapse; font-size:13px;"&gt;&lt;span class="Apple-style-span"   style="border-collapse: separate;   font-family:Georgia, serif;font-size:16px;"&gt;To review, in 2009 the weights were:&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"   style="  border-collapse: collapse; font-family:arial, sans-serif;font-size:13px;"&gt;&lt;div&gt;Bill James = .37&lt;/div&gt;&lt;div&gt;CHONE = 0.0&lt;/div&gt;&lt;div&gt;Marcel = .35&lt;/div&gt;&lt;div&gt;PECOTA = .28&lt;/div&gt;ZiPS = 0.0&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"   style="font-family:arial, sans-serif;font-size:100%;"&gt;&lt;span class="Apple-style-span"  style="border-collapse: collapse; font-size:13px;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"   style="font-family:arial, sans-serif;font-size:100%;"&gt;&lt;span class="Apple-style-span"  style="border-collapse: collapse; font-size:13px;"&gt;&lt;span class="Apple-style-span"   style="border-collapse: separate;   font-family:Georgia, serif;font-size:16px;"&gt;And in 2008 the weights were:&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"   style="  border-collapse: collapse; font-family:arial, sans-serif;font-size:13px;"&gt;&lt;div&gt;Bill James = .56&lt;/div&gt;&lt;div&gt;CHONE = 0.0&lt;/div&gt;&lt;div&gt;Marcel = .15&lt;/div&gt;&lt;div&gt;PECOTA = .29&lt;/div&gt;ZiPS = 0.0&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"   style="font-family:arial, sans-serif;font-size:100%;"&gt;&lt;span class="Apple-style-span"  style="border-collapse: collapse; font-size:13px;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;We adapted a polynomial time &lt;a href="http://www.mathworks.com/access/helpdesk/help/toolbox/optim/ug/lsqlin.html"&gt;matrix least-squares minimization algorithm&lt;/a&gt; to solve the AggPro problem this year instead of simply performing a brute force search across the search space. The new algorithm has allowed for more precise weight sets (4 digits vs. 2 digits) and a two order of magnitude improvement in speed (seconds vs. hours).  &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;We are experimenting with finer grained weight sets this year. For example, instead of only allowing one weight for all the categories what if we allowed a separate weight set for each category? for each player? for each category for each player? Our instinct is that the most accurate results will come from allowing each system to have a separate weight set for each category for the upcoming year but at this point we don't have any empirical evidence to support it. Results will be posted here as they come in.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6773098339388257491-2786081433333820845?l=aggpro.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://aggpro.blogspot.com/feeds/2786081433333820845/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://aggpro.blogspot.com/2010/03/2010-aggpro-weights.html#comment-form' title='5 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/2786081433333820845'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/2786081433333820845'/><link rel='alternate' type='text/html' href='http://aggpro.blogspot.com/2010/03/2010-aggpro-weights.html' title='2010 AggPro Weights'/><author><name>Ross</name><uri>http://www.blogger.com/profile/17359824128401198785</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>5</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6773098339388257491.post-6092094598400457808</id><published>2009-07-30T10:11:00.000-07:00</published><updated>2009-07-30T10:16:37.590-07:00</updated><title type='text'>Official Writeup of AggPro</title><content type='html'>Tomorrow Cameron and I will be presenting AggPro at &lt;a href="http://convention.sabr.org/"&gt;SABR 39&lt;/a&gt;. The official writeup of AggPro complete with evaluation of the AggPro projections for the 2006, 2007, 2008 and 2009 MLB seasons is availalbe &lt;a href="http://www.cs.virginia.edu/~rjg7v/AggPro.pdf"&gt;here&lt;/a&gt;. The slides we are going to present are available &lt;a href="http://www.cs.virginia.edu/~rjg7v/aggpro.ppt"&gt;here&lt;/a&gt;. AggPro has been a fun project but like most of my other research once I hit the final revision stage of the writeup I really start to HATE the project.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6773098339388257491-6092094598400457808?l=aggpro.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://aggpro.blogspot.com/feeds/6092094598400457808/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://aggpro.blogspot.com/2009/07/official-writeup-of-aggpro.html#comment-form' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/6092094598400457808'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/6092094598400457808'/><link rel='alternate' type='text/html' href='http://aggpro.blogspot.com/2009/07/official-writeup-of-aggpro.html' title='Official Writeup of AggPro'/><author><name>Ross</name><uri>http://www.blogger.com/profile/17359824128401198785</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6773098339388257491.post-959087957572321167</id><published>2009-07-01T17:45:00.000-07:00</published><updated>2009-07-01T19:01:42.598-07:00</updated><title type='text'>AggPro Final Results</title><content type='html'>The final AggPro results are listed in the table below. They are significantly better than I expected. In each year AggPro is more accurate than any of the constituent systems. Furthermore, each year it more accurate than a theoretical projection system that would be composed of the best constituent projection for each statistical category. Finally, AggPro's projection in every category for every year is more accurate than the any constituent system's projection for a given category in a given year except for one category in one year. (&lt;i&gt;As of June 29th PECOTA's projections for this year (2009) for triples are less than 1% more accurate than AggPro's 2009 projections for triples.) &lt;/i&gt;These observations seem to show that these different constituent projection systems work in a complementary nature (i.e. the overall aggregate projection is more than just the sum of its best parts.) &lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-style: normal;"&gt;Recall, the AggPro projections were formed by determining the weights for the constituent systems that minimized the standard error from the actual Major League Baseball data for 2007 and 2008. We were able to evaluate the AggPro projections for 2006 because AggPro does not weight the CHONE projections at all. The CHONE projections were the only constituent projections missing for 2006.&lt;/span&gt;&lt;div&gt;&lt;div&gt;&lt;i&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;table border="1" style="text-align: center;"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td&gt;&lt;b&gt;Year&lt;/b&gt;&lt;/td&gt;&lt;td&gt;&lt;b&gt;AggPro Improvement over BCPS&lt;/b&gt;&lt;/td&gt;&lt;td&gt;&lt;b&gt;AggPro Improvement over TPS&lt;/b&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;&lt;b&gt;2006&lt;/b&gt;&lt;/td&gt;&lt;td&gt;&lt;b&gt;5.25%&lt;/b&gt;&lt;/td&gt;&lt;td&gt;&lt;b&gt;5.00%&lt;/b&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;&lt;b&gt;2007&lt;/b&gt;&lt;/td&gt;&lt;td&gt;&lt;b&gt;4.25%&lt;/b&gt;&lt;/td&gt;&lt;td&gt;&lt;b&gt;2.65%&lt;/b&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;&lt;b&gt;2008&lt;/b&gt;&lt;/td&gt;&lt;td&gt;&lt;b&gt;5.90%&lt;/b&gt;&lt;/td&gt;&lt;td&gt;&lt;b&gt;4.60%&lt;/b&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;&lt;b&gt;2009 (through 06/29/09)&lt;/b&gt;&lt;/td&gt;&lt;td&gt;&lt;b&gt;2.4%&lt;/b&gt;&lt;/td&gt;&lt;td&gt;&lt;b&gt;2.0%&lt;/b&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;div style="text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;/i&gt;&lt;div style="text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="text-align: center;"&gt;&lt;span class="Apple-style-span" style="font-weight: bold; "&gt;Table Legend: BCPS = Best Constituent Projection System; TPS = Theoretical Projection System&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/i&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6773098339388257491-959087957572321167?l=aggpro.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://aggpro.blogspot.com/feeds/959087957572321167/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://aggpro.blogspot.com/2009/07/aggpro-final-results.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/959087957572321167'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/959087957572321167'/><link rel='alternate' type='text/html' href='http://aggpro.blogspot.com/2009/07/aggpro-final-results.html' title='AggPro Final Results'/><author><name>Ross</name><uri>http://www.blogger.com/profile/17359824128401198785</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6773098339388257491.post-7494685135691357404</id><published>2009-06-29T18:23:00.001-07:00</published><updated>2009-06-30T03:49:27.401-07:00</updated><title type='text'>AggPro Early Results</title><content type='html'>I will be finishing the AggPro Projection Research this week. Most of the work has been completed and the results look good. The weights for all constituent projection systems (Bill James, CHONE, Marcel, PECOTA and ZiPS) are as follows: 0.56 = Bill James, 0.00 = CHONE, 0.21 = Marcel, 0.23 = PECOTA and 0.00 = ZiPS. Recall, these are the weights that when applied to the respective constituent projection systems for the years 2007 and 2008 mimize the standard error of the resulting aggregate projection from the actual 2007 and 2008 Major League Baseball data. &lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;How well did we do?&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;The previous post described how AggPro would be measured. In short, AggPro will be considered successful if 1) the resulting aggregate projections are more accurate than any of the constituent projections for 2007, 2008, and 2009 and 2) if AggPro projections for each statistical category, for each year are more accurate than any constituent projection for the given category in the given year. The first evaluation criterion is almost trivial. Obviously, AggPro can be as accurate as the most accurate constituent projection system simply by giving a weight of 1.00 to the most accurate projeciton system. I was slightly skeptical if AggPro would be able to meet the second measure of success. It seemed possible a constituent projection system might be very accurate for predicting a statistic in a single statistical year but might mispredict almost every other category in that year or other years. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;It seems as if AggPro has met this measures of success.  The AggPro projections are more accurate in every statistical category for 2007 and 2008 MLB seasons than any of the constituent projection systems. In 2007 the AggPro projections are a 4.25% improvement over the leading constituent projection system (Bill James). In 2008 the AggPro projections are a 5.9% improvement over the leading constituent projection system (again Bill James). Furthermore, if one was to form a theoretical projection system by taking the most accurate constituent projection for a statistical category each year AggPro would still be more accurate in 2007 and 2008. This system is described in the AggPro baselines table in the previous post. In 2007 AggPro would be a 2.65% improvement over the theoretical system. In 2008 AggPro would be a 4.6% improvement over the theoretical system.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt; I have several theories as to why the Bill James predictions were so accurate (given AggPro's measures of accuracy) for these years that I may go into in a seperate post. I'll be doing a pro-rated evaluation for the 2009 season later this week. This will be a true test of AggPro's accuracy as the optimization which chose the weights did not consider any 2009 data.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6773098339388257491-7494685135691357404?l=aggpro.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://aggpro.blogspot.com/feeds/7494685135691357404/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://aggpro.blogspot.com/2009/06/aggpro-early-results.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/7494685135691357404'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/7494685135691357404'/><link rel='alternate' type='text/html' href='http://aggpro.blogspot.com/2009/06/aggpro-early-results.html' title='AggPro Early Results'/><author><name>Ross</name><uri>http://www.blogger.com/profile/17359824128401198785</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6773098339388257491.post-4140040554857058970</id><published>2009-06-15T10:34:00.000-07:00</published><updated>2009-07-01T19:03:23.742-07:00</updated><title type='text'>Projected Categories, Players and Measures of Success</title><content type='html'>Cameron and I have spent the last several weeks acquiring and formatting the projections from Bill James, CHONE, Marcel, PECOTA and ZiPS. We are going to be using the 2007 and 2008 projections from these systems. I will refer to these projections as the &lt;i&gt;constituent projections&lt;/i&gt;. Originally, we had planned on using the 2006 projections from these systems too, but as Chone Smith nicely explained in &lt;a href="http://aggpro.blogspot.com/2009/05/welcome-to-aggpro.html#comments"&gt;this&lt;/a&gt; comment the 2006 projections from CHONE were more experimental than anything else and probably would not be particurally useful. Since we need projections for all systems for all the years that we include in the AggPro analysis we had to eliminate the 2006 projections.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Projected Categories&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;We will be using the following player performance categories:&lt;/div&gt;&lt;div&gt;Hitters: At Bats, Hits, Runs, Doubles, Triples, Home Runs, RBIs, Strikeouts, Walks and Stolen Bases. &lt;/div&gt;&lt;div&gt;Pitchers: Innings Pitched, Earned Runs, Strikeouts, Walks and Hits. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;This is the set of hitter and pitcher performance categories that is common to all the constituent projection systems.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Projected Players&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The following lists contain the players AggPro included in the AggPro analysis: &lt;a href="http://www.cs.virginia.edu/~rjg7v/2007PlayersUsed.txt"&gt;2007&lt;/a&gt;, &lt;a href="http://www.cs.virginia.edu/~rjg7v/2008PlayersUsed.txt"&gt;2008&lt;/a&gt;. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The player list for a given year is the list of players that are common to all the constituent projection systems for the year.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;AggPro Measures of Success&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I also have determined the standard error of the 2007 and 2008 constituent projections from the actual Major League Baseball player performance data in each of the categories. The standard error, as a percentage of the actual population of each category, for each system, is listed below. Double click on the table to enlarge it.&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_tJWtajT3WzA/SjaWueO3hhI/AAAAAAAAAsM/ZLh6oYGvehQ/s1600-h/website_Variance.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 218px;" src="http://1.bp.blogspot.com/_tJWtajT3WzA/SjaWueO3hhI/AAAAAAAAAsM/ZLh6oYGvehQ/s400/website_Variance.jpg" border="0" alt="" id="BLOGGER_PHOTO_ID_5347627332682614290" /&gt;&lt;/a&gt;AggPro will be successful if it identifies a single weight to apply to each projection system such that the resulting AggPro projections have less standard error for a given statistical category in a given year than the best constituent system projection for that category, for that year. For example, each category in the AggPro projections for 2007 must have less standard error than corresponding value for the category identified in the righthand most column in the top half of the table. Based on the early results from the simulated annealing optimization it looks like an AggPro cocktail consisting of 2 parts Bill James, 1 part Marcel and 1 part PECOTA comes very close to meeting this goal. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6773098339388257491-4140040554857058970?l=aggpro.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://aggpro.blogspot.com/feeds/4140040554857058970/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://aggpro.blogspot.com/2009/06/projected-categories-players-and.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/4140040554857058970'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/4140040554857058970'/><link rel='alternate' type='text/html' href='http://aggpro.blogspot.com/2009/06/projected-categories-players-and.html' title='Projected Categories, Players and Measures of Success'/><author><name>Ross</name><uri>http://www.blogger.com/profile/17359824128401198785</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_tJWtajT3WzA/SjaWueO3hhI/AAAAAAAAAsM/ZLh6oYGvehQ/s72-c/website_Variance.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-6773098339388257491.post-7205290562707643089</id><published>2009-05-22T09:01:00.000-07:00</published><updated>2009-07-01T19:07:03.280-07:00</updated><title type='text'>Welcome to AggPro</title><content type='html'>Welcome to the home of AggPro: The Aggregate Projection System. I have tried to anticipate commonly asked questions about AggPro and provide answers to them below. As AggPro is developed, status updates and results will be posted on this blog. If you have any questions regarding AggPro feel free to &lt;a href="http://www.cs.virginia.edu/~rjg7v"&gt;contact me&lt;/a&gt;. &lt;a href="http://www.linkedin.com/pub/cameron-snapp/3/b19/991"&gt;Cameron Snapp&lt;/a&gt; will be assisting me with the development of AggPro.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;What is AggPro?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;AggPro is a proposed Major League Baseball (MLB) player projection system. Currently there exist many different projection systems that predict the performance of MLB players in a variety of different statistical categories. The goal of AggPro is to aggregate the MLB player predictions from the existing systems into a single more accurate prediction. AggPro will use the following projection systems, &lt;a href="http://bis-store.stores.yahoo.net/index.html"&gt;Bill James Handbook&lt;/a&gt;, &lt;a href="http://www.baseballprojection.com/"&gt;CHONE&lt;/a&gt;, &lt;a href="http://www.tangotiger.net/marcel/"&gt;Marcel&lt;/a&gt;, &lt;a href="http://www.baseballprospectus.com/"&gt;PECOTA&lt;/a&gt; and &lt;a href="http://www.baseballthinkfactory.org/"&gt;ZiPS,&lt;/a&gt; for the years 2006, 2007, and 2008 to form the aggregated projections.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Why do we need yet another projection system?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;It is important to note that AggPro is not just another projection system. Instead it is a methodology for aggregating effective projections from different systems into a single more accurate projection. That said, we probably don't need another projection system. It appears we are reaching the limit of the accuracy that can be expected from projection systems. While AggPro may improve the accuracy of existing systems I do not expect it to improve the state of the art significantly. However, &lt;a href="http://baseballanalysts.com/archives/2009/02/2009_projection.php"&gt;Greg Rybarczyk&lt;/a&gt; believes paradigm shifts that will improve the accuracy of projection systems are on the horizon. If paradigm shifting projection systems are developed, the AggPro methodology will be applicable to improve these systems as well.&lt;br /&gt;&lt;br /&gt;My interest in aggregating predictions from effective projection systems to form a better prediction stems from &lt;a href="http://www.research.att.com/~volinsky/netflix/"&gt;BellKor&lt;/a&gt;, the leading solution to the &lt;a href="http://en.wikipedia.org/wiki/Netflix_Prize"&gt;Netflix Prize&lt;/a&gt;. In October, 2006 Netflix released a dataset of anonymous movie ratings and challenged researchers to develop systems that could beat the accuracy of its recommendation system, Cinematch. A grand prize, known as the Netflix Prize, of $1,000,000 will be awarded to the first system to beat Cinematch by 10%. The BellKor prediction system, with 8.26% improvement over Cinematch, is the leading solution. BellKor employs 107 different models of varying approaches and uses mathematical optimization methods to weight each prediction. From the weighted predictions of the 107 models BellKor forms its aggregate recommendation. I was curious how well this strategy would work when applied to MLB player projection systems, thus AggPro was born.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;How will AggPro work and how will it be evaluated?&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;AggPro will employ a combination of mathematical optimization methods including &lt;a href="http://en.wikipedia.org/wiki/Hill_climbing"&gt;Hill climbing&lt;/a&gt;, &lt;a href="http://en.wikipedia.org/wiki/Genetic_Algorithms"&gt;Genetic algorithms&lt;/a&gt;, and &lt;a href="http://en.wikipedia.org/wiki/Simulated_Annealing"&gt;Simulated annealing&lt;/a&gt; to determine the weight for each projected statistic in each system that yields the AggPro projections with the least root mean square error (RMSE) from the MLB players’ actual performance in the 2006, 2007, and 2008 seasons. Using the same weights AggPro will provide projections for the 2009 MLB season.&lt;br /&gt;&lt;br /&gt;All the  projection systems (AggPro, &lt;a href="http://bis-store.stores.yahoo.net/index.html"&gt;Bill James Handbook&lt;/a&gt;, &lt;a href="http://www.baseballprojection.com/"&gt;CHONE&lt;/a&gt;, &lt;a href="http://www.tangotiger.net/marcel/"&gt;Marcel&lt;/a&gt;, &lt;a href="http://www.baseballprospectus.com/"&gt;PECOTA&lt;/a&gt; and &lt;a href="http://www.baseballthinkfactory.org/"&gt;ZiPS&lt;/a&gt;) will be evaluated against the root mean square error from the actual player performance for the 2006, 2007, 2008 and 2009 MLB seasons.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;When will AggPro be presented?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;AggPro has been selected to be presented at the 2009 Society for American Baseball Research Convention (&lt;a href="http://convention.sabr.org/"&gt;SABR 39&lt;/a&gt;) in Washington, D.C. on July 29th - August 2nd. The abstract proposing AggPro is available &lt;a href="http://www.cs.virginia.edu/~rjg7v/aggpro.pdf"&gt;here&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6773098339388257491-7205290562707643089?l=aggpro.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://aggpro.blogspot.com/feeds/7205290562707643089/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://aggpro.blogspot.com/2009/05/welcome-to-aggpro.html#comment-form' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/7205290562707643089'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6773098339388257491/posts/default/7205290562707643089'/><link rel='alternate' type='text/html' href='http://aggpro.blogspot.com/2009/05/welcome-to-aggpro.html' title='Welcome to AggPro'/><author><name>Ross</name><uri>http://www.blogger.com/profile/17359824128401198785</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>3</thr:total></entry></feed>
