in Part 2 of the 1919 World Series series series we’ll cover Game 1 using a new box score report. Eight players on the White Sox got paid to help their team lose a World Series. These posts look at each game in detail to see if there’s any blatant statistical evidence of foul play. First we need to see how these two teams match up against each other according to this data model.
Team Status
Tm | W | L | BAT | PITCH | UR | |
---|---|---|---|---|---|---|
CIN | 96 | 44 | 30.6 | 112.1 | 27.9 | |
CHA | 88 | 52 | 116.6 | 1.1 | 5.9 |
Team status records show data collected over an entire season which may not reflect upon their current roster. The above shows CIN had great pitching according to PITCH derived from runs scored against. White Sox had the opposite, great hitting derived from runs scored above average. Reds had much better Unearned runs above average which means good fielding.
1919 Playoff Horse Race
TeamID | W-L | Total | Hitters | Starters | Relief | UR |
---|---|---|---|---|---|---|
CIN | 52 | 34.1 | 7.3 | 22.4 | 4.5 | 27.9 |
CHA | 36 | 33.5 | 27.0 | 12.4 | -5.9 | 5.9 |
Horse race tables for playoff season are based upon estimated team rosters using end of year WAA data. During the regular year horse race tables use estimated rosters applying beginning of day data — what you would know reading the paper in the morning. During current year these tables are made using rosters downloaded from a source and are not estimated. This could change next season as our source sometimes can lag or have errors.
In modern baseball there would be 8 teams starting off in divisional series. Back in the day the World Series was the only playoff series so this horse race table reduces to two teams. This table kind of aligns with team status showing CIN with a much better starting rotation and CHA with a much better set of hitters.
White Sox starting rotation in the above table is much better than their PITCH in team status. They have a below average relief staff but relief was not nearly a factor then as it is now. More on these differences later in off season simulation talk.
1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | Score | |
---|---|---|---|---|---|---|---|---|---|---|
CHA | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
CIN | 1 | 0 | 0 | 5 | 0 | 0 | 2 | 1 | X | 9 |
The above shows the inning line with the Reds winning 9-1 on a a 5 run 4th. The presentation of this is still a work in progress.
Let’s look further into this box score.
CHA Lineup
Rank | WAA | Name_TeamID | PA | R | RBI | TB | H | W | Pos | |
---|---|---|---|---|---|---|---|---|---|---|
XXXXX | 0.06 | 1 | Shano_Collins_CHA | 4 | 0 | 0 | 1 | 1 | 0 | OF-RF |
+017+ | 5.17 | 2 | Eddie_Collins_CHA | 4 | 0 | 0 | 1 | 1 | 0 | 2B |
+020+ | 5.00 | 3 | Buck_Weaver_CHA | 4 | 0 | 0 | 1 | 1 | 0 | 3B-SS |
+010+ | 6.57 | 4 | Shoeless_Joe_Jackson_CHA | 4 | 1 | 0 | 0 | 0 | 0 | OF-LF-RF |
+021+ | 4.98 | 5 | Happy_Felsch_CHA | 4 | 0 | 0 | 0 | 0 | 0 | CF-OF |
+053+ | 2.65 | 6 | Chick_Gandil_CHA | 4 | 0 | 1 | 2 | 2 | 0 | 1B |
XXXXX | -0.08 | 7 | Swede_Risberg_CHA | 3 | 0 | 0 | 0 | 0 | 1 | SS-1B |
XXXXX | 0.42 | 8 | Ray_Schalk_CHA | 3 | 0 | 0 | 0 | 0 | 0 | CR |
XXXXX | -1.01 | 9 | Eddie_Cicotte_CHA | 1 | 0 | 0 | 0 | 0 | 0 | PITCH |
XXXXX | 0.15 | 10 | Roy_Wilkinson_CHA | 1 | 0 | 0 | 0 | 0 | 0 | PITCH |
XXXXX | 1.32 | 11 | Fred_McMullin_CHA | 1 | 0 | 0 | 1 | 1 | 0 | 3B |
TOTAL | 23.76 | X | TIER=3.75 | 33 | 1 | 1 | 6 | 6 | 1 | X |
For now the paid off players will be highlighted in bold red. Since I have to do this manually it won’t be done in subsequent games as much. The above is a prototype display showing columns this data model views as important. Plate Appearance (PA) represents time. R and RBI represent runs. TB , H , and W represent hits.
I haven’t doubled or triple checked the accuracy but pretty sure the above is correct.
Latest Update: All tables have been updated to the correct standard. Code was still being written during the first couple of these posts and bugs were found after posting. Developing the code to build these tables was the purpose of this series of posts. Now all World Series can be perused.
Update: Lineup tier numbers are wrong in lineups for both teams in this post. Part 3 has tiers properly calculated. Since the code that calculates tiers wasn’t working I tried to calculate lineup tiers manually and didn’t do it right. CHA lineup tier is a maxed out tier 5.00 as shown in Part 3 which was calculated automatically.
End of both Updates
Jackson scores the only run for CHA and reached second on an error. Chick Gandil gives up an unearned run due to an error but that was the only unearned run. This model can assign and tally unearned runs against fielders. This isn’t very interesting for regular season but it can be interesting in post season. Players can affect an outcome giving up a run by not fielding just as easily as not scoring a run by not hitting.
CHA Pitchers
Rank | WAA | Name_TeamID | Outs | PA | ER | R | SO | |
---|---|---|---|---|---|---|---|---|
+004+ | 8.76 | Eddie_Cicotte_CHA | 11 | 20 | 6 | 6 | 1 | SP |
XXXXX | 0.50 | Roy_Wilkinson_CHA | 10 | 13 | 1 | 2 | 1 | RP |
XXXXX | 1.22 | Grover_Lowdermilk_CHA | 3 | 7 | 1 | 1 | 0 | RP |
TOTAL | X | X | 24 | 40 | 8 | 9 | 2 |
The Pitcher table shows outs instead of innings pitched, Plate appearances pitched, Earned Runs , Runs, and Strikeouts. Cicotte gave up 6 runs in 11 outs or 3 2/3 IP. The first game was probably essential for the White Sox to lose in order to signal the fixers they weren’t being cheated. Cicotte made that happen and no Sox player got clever trying to score. All hits were singles according to TB column.
CIN Lineup
Rank | WAA | Name_TeamID | PA | R | RBI | TB | H | W | Pos | |
---|---|---|---|---|---|---|---|---|---|---|
-079- | -1.51 | 1 | Morrie_Rath_CIN | 5 | 2 | 1 | 2 | 1 | 1 | 2B |
XXXXX | 0.50 | 2 | Jake_Daubert_CIN | 5 | 1 | 1 | 5 | 3 | 1 | 1B |
+024+ | 4.26 | 3 | Heinie_Groh_CIN | 5 | 1 | 2 | 1 | 1 | 1 | 3B |
+033+ | 3.53 | 4 | Edd_Roush_CIN | 5 | 0 | 0 | 0 | 0 | 1 | CF-OF |
XXXXX | 0.63 | 5 | Pat_Duncan_CIN | 4 | 0 | 1 | 2 | 2 | 0 | LF-OF |
XXXXX | 0.40 | 6 | Larry_Kopf_CIN | 4 | 1 | 0 | 0 | 0 | 0 | SS |
XXXXX | -0.15 | 7 | Greasy_Neale_CIN | 4 | 2 | 0 | 3 | 3 | 0 | OF-RF-LF |
XXXXX | 0.50 | 8 | Ivey_Wingo_CIN | 4 | 1 | 1 | 1 | 1 | 0 | CR |
XXXXX | -0.55 | 9 | Dutch_Ruether_CIN | 4 | 1 | 3 | 7 | 3 | 1 | PITCH |
TOTAL | 7.61 | X | TIER=0.65 | 40 | 9 | 9 | 21 | 14 | 5 | X |
Tiering for lineups is based upon average and standard deviation of end of season lineups for all 16 MLB teams. This can only be calculated with a mostly full set of day by day box score data. A Tier 1.08 means Reds’ lineup is around 1/2 standard deviation above MLB average for the year 1919. CHA is at 2.41 which is much greater.
Lineups face pitchers however, not other lineups.
CIN Pitchers
Rank | WAA | Name_TeamID | Outs | PA | ER | R | SO | |
---|---|---|---|---|---|---|---|---|
+008+ | 6.83 | Dutch_Ruether_CIN | 27 | 33 | 0 | 1 | 1 | SP |
TOTAL | X | X | 27 | 33 | 0 | 1 | 1 |
Ruether was their best pitcher ranked #8 and pitched a complete game like most starters did back then. He gave up no earned runs.
Ruether is around Tier 3 and Cicotte around Tier 5. Even though on paper CHA as a team has worse pitching, when CIcotte pitches they are top top tier and back then starters usually pitched the entire game. Off the top of my head looking at the Tier numbers White Sox would have been favorites for this game. That alone doesn’t prove or disprove anything however.
The code to make handicapping tables for historical playoff games like what was done during 2019 current year coverage is not completely finished but will be for Game 2 hopefully soon. Until then ….