Data Guru Ian Graham on the analytics behind Liverpoolâs title win & Salah and VVDâs new contracts.
Liverpoolâs original data analytics wizard, Ian Graham, joins Rog for an in-depth discussion on how Liverpool used data analytics to build a Premier League and Champions League winning team. Ian also discusses the boom in analytics in football and talks about the future of data analytics in the sport and why some teams are still hesitant to embrace it.
I was reading some background on vector databases last week. On the face of it, it doesnât look radically different to things I was doing 40 years ago (at least mathematically) but I was curious how these sorts of things can be applied to football.
For example, is it possible to spot a player that is underperforming because they are being played out of position but might fit in elsewhere?
I can recall several occasions when Klopp said that he had âtweakedâ his tactics to suit one of his players.
Youâre absolutely right, mathematically, the core concepts behind vector databases and embeddings (like measuring similarity in high-dimensional space) are not new. What is new is the scale, speed, and flexibility with which we can now compute and store this information, especially thanks to advances in machine learning, vector search, and high-performance storage systems.
Now, to your core question: Can these ideas be applied to football to spot misused players or optimize their positioning?
Using Vectors in Football Analytics
At the heart of modern football analytics is player representationâturning a playerâs behavior, performance, or style into a vector (i.e., a list of numbers). Once you do that, you can:
⢠Compare players using distance/similarity measures.
⢠Cluster players into roles or styles.
⢠Predict compatibility with systems or positions.
- Player Embeddings
You can embed players based on various data types:
⢠Positional data: heatmaps, movement patterns, average location per phase.
⢠Event data: passes, tackles, dribbles, pressures, etc.
⢠Tracking data: acceleration, spacing, off-ball runs.
⢠Tactical fit: how a player behaves in certain formations/phases.
These embeddings can be compared using cosine similarity or nearest neighbor search via a vector database like FAISS or Pinecone.
Practical Application: Positional Misuse
Letâs say you have a winger whoâs underperforming. Hereâs how embeddings could help:
Step 1: Build a Vector Profile
Collect in-game data for the player and represent their play style as a vector.
Step 2: Search for Similar Vectors
Use a vector database to find players with similar vectors but in different positionsâmaybe theyâre all inside forwards or attacking midfielders.
Step 3: Tactical Matching
Cross-reference the playerâs vector with tactical setups (e.g. Kloppâs 4-3-3 vs 4-2-3-1) to see which system styles historically matched players like him.
Step 4: Recommend Role Shift
If embedding similarities point strongly toward a different role, it gives evidence to suggest trying the player elsewhere.
Kloppâs Tactical âTweaksâ
When Klopp says he âtweakedâ his system, he might be:
⢠Moving a player inside/outside
⢠Shifting pressing triggers
⢠Adjusting build-up shape to allow someone to receive more comfortably
Vector models could be used to simulate these tweaks:
⢠You run a simulation of âif this player is given a different heatmap role, what does his similarity to other successful players look like?â
⢠Or ask, âdoes his vector move closer to high-performing players in a slightly different position?â
Real-World Examples
⢠Trent Alexander-Arnoldâs hybrid role: Early models based on heatmaps and passing vectors suggested he resembled midfield playmakers more than traditional full-backs. Klopp eventually trialed him in a hybrid role.
⢠Kai Havertz at Arsenal: Struggled as a false 9, improved as a left 8. Vector models mightâve suggested this based on positional comparisons.
If you had this data at scale, across thousands of players, a vector database (like Pinecone or FAISS) would:
⢠Store each playerâs vector.
⢠Let you instantly query for âmost similar playersâ to anyone.
⢠Filter by role, league, team, etc.
So you could ask: âFind me players in the Bundesliga whose style matches this underperforming winger in the Eredivisie,â and it could help with scouting or repurposing.
Thanks for the reply. I suspect that I will be disappearing down a statistical rabbit hole on Monday.
FIFY