Finding the Next Justin Jefferson: Rashod Bateman, Random Forests, and the Power of Checking All the Boxes – The Wrong Read, No. 69

Blair Andrews
April 28, 2021

Blair Andrews builds on the random forest model described in the Wrong Read No. 68 by turning it into actionable rules for choosing rookie wide receivers. Last year Blair told you that Justin Jefferson was the only WR prospect to check all the boxes. Today, he provides the most similar receivers from the 2021 class.

The best predictive models have three notable criteria — they’re stable, accurate, and interpretable. Not all kinds of models meet all three criteria. A decision tree is both accurate and interpretable, but small changes in the data can have drastic impacts on the model results, which makes a decision tree inherently unstable.

A linear regression does not suffer from quite the same instability, and it is also interpretable. But because it treats every variable as if its relation to the dependent variable can be expressed in a linear equation, it’s not always the most accurate, at least not for predicting how college prospects will perform in the NFL.

A random forest solves the stability problem of a decision tree by growing hundreds of decision trees using random slices of the data. However, the sheer amount of trees and nodes in a random forest model make it a “black box model” — that is, it’s not easily interpretable.

Making Random Forests Interpretable

However, we can solve that last problem using a technique called rule extraction. The basic idea is this: Each node in a decision tree can be thought of as a simple rule or heuristic. These heuristics prove vital in prospect evaluation. Checking all the boxes is even more important than you might have guessed.

Rule extraction algorithms pull those heuristics out of random forest models based on their frequency. The nodes that appear most frequently are taken to be the most important rules, in other words.

We end up with a short list of the most important rules extracted from those hundreds of trees. Some of them are created by combining two nodes that appear frequently in concert. The table below lists those rules, along with some evaluative statistics explaining how good each heuristic is at telling the hits from the misses. For our purposes, if a player averages at least 12.5 PPR points over his first three seasons (a 200-point season pace), I count that as a hit.

Membership Required

You must be a member to access this content.

View Membership Levels

Already a member? Log in here

Please subscribe For Full Access to all RotoViz content and tools!

What’s included in your subscription??

Exclusive Access to RotoViz Study Hall
- A treasure trove of our most insightful articles that will teach you the metrics that matter, time-tested winning strategies, the approaches that will give you an edge, and teach you how to be an effective fantasy manager.
Revolutionary Tools
- Including the NFL Stat Explorer, Weekly GLSP Projections, NCAA Prospect Box Score Scout, Combine Explorer, Range of Outcomes App, DFS Lineup Optimizer, Best Ball Suite,and many, many, more.
Groundbreaking Articles
- RotoViz is home of the original Zero-RB article and continues to push fantasy gamers forward as the go-to destination for evidence-based analysis and strategic advantages.
Weekly Projections
- Built using RotoViz’s unique GLSP approach.
Expert Rankings
And a whole lot more…

Blair Andrews

Managing Editor, Author of The Wrong Read, Occasional Fantasy Football League Winner. All opinions are someone else's.

2025 Post-Draft Wide Receiver Prospect Lab Scores – Live Round 1 Updates

Blair Andrews April 24, 2025

The Wide Receiver Prospect Lab has been one of the most reliable tools for evaluating rookie receivers for years. Like its RB counterpart, it uses a linear model to predict early-career fantasy performance based on key college metrics. The beauty of this approach lies in its simplicity — by focusing on a few critical variables, the model avoids both overfitting and overreliance on a single…...