|
I've been following along with Python using the latest rush data and it's been fun to poke this stuff, and great for working out how to do similar analyses in Python. EDIT: David Johnson, in red EDIT 2: OH MY GOD TODD GURLEY EDIT #3: Here's the direction I'm thinking of exploring: run direction!. Stay tuned! Ghost of Reagan Past fucked around with this message at 04:31 on Feb 13, 2017 |
# ¿ Feb 12, 2017 04:30 |
|
|
# ¿ May 8, 2024 09:17 |
|
Here's some stuff on run direction. First up, the Executive Summary! 1. Running behind the right guard is the worst direction to run. 2. The best direction to run is outside to the left. 3. Most runs in the NFL are in the middle, which is actually surprisingly effective. Anyway, let's dig in. The Average Running Back Here's the average running back's run direction distribution. We can glean a few things from this. First, inside runs dominate. Second, see those dips behind the guards? These runs, as we'll see, are less successful than every other kind of run, so presumably NFL teams understand this. But the odder thing is the dropoff on outside runs. These are actually pretty successful, averaging more than 4.3 ypc--but they fail more often than other runs. Is this a smart strategy by coaches, or are they being too conservative? Here's the average yardage for each direction. Now, how do we want to measure 'run failure'? Let's consider a run below 2 yards to be a failure--ignoring yards to go, of course, which would make some runs of 2 yards or less be successes. This is just to help us get a grip on what we're looking at here. This is super interesting. Green is a successful run, blue is a failed run. This is the average yardage for each success and each direction. Note that the failures for the left and right outside runs are pretty big! This may explain the conservatism above. What proportion of runs for each type are failed runs, though? Do outside runs fail more often than inside runs? Cross-tabbed and normalized, as well: code:
Comparing Running Backs So here are some comparison charts between backs. Adrian Peterson Darren Sproles Todd Gurley Noted Laughingstock Trent Richardson David Johnson (this is loving weird man) Stay tuned for better success metrics and random forests. I can make you charts of any backs you'd like. Code will eventually be up somewhere once I figure out where to drop Jupyter notebooks. Ghost of Reagan Past fucked around with this message at 21:29 on Feb 26, 2017 |
# ¿ Feb 26, 2017 21:01 |
|
pmchem posted:I have a NFL data science question and this seems like the most appropriate place. Let's talk raw data sources. Ground Control uses nfldb. That data is likely available on a source like Pro Football Reference but I can't be 100% sure. It wouldn't be easily importable but you should be able to get it if you're dedicated. This is actually the hardest part of doing data science. Ghost of Reagan Past fucked around with this message at 14:31 on Aug 6, 2017 |
# ¿ Aug 6, 2017 14:28 |