Network Science
Network science is an academic field which studies complex networks such as telecommunication networks, computer networks, biological networks, Cognitive network, cognitive and semantic networks, and social networks, considering distinct eleme ...
based
basketball
Basketball is a team sport in which two teams, most commonly of five players each, opposing one another on a rectangular Basketball court, court, compete with the primary objective of #Shooting, shooting a basketball (ball), basketball (appro ...
analytics comprise a various recent attempts to apply the perspective of networks to the analysis of basketball.
Overview
Traditional
basketball statistics
Statistics in basketball are kept to evaluate a player's or a team's performance.
Examples
Examples of basketball statistics include:
* GM, GP; GS: games played; games starting lineup, started
* PTS: point (basketball), points
* FGM, FGA, FG%: Fi ...
analyze individuals independently of their teammates or competitors and traditional player
positions are determined by individual attributes. In contrast, these network based analytics are obtained by constructing a team or league level player networks, where individual players are
nodes connected by the ball movement or by some
measure of similarity
In statistics and related fields, a similarity measure or similarity function or similarity metric is a real-valued function that quantifies the similarity between two objects. Although no single definition of a similarity exists, usually such mea ...
. Then, the metrics are obtained by calculating network properties, such as
degree,
density
Density (volumetric mass density or specific mass) is the ratio of a substance's mass to its volume. The symbol most often used for density is ''ρ'' (the lower case Greek letter rho), although the Latin letter ''D'' (or ''d'') can also be u ...
,
centrality
In graph theory and network analysis, indicators of centrality assign numbers or rankings to nodes within a graph corresponding to their network position. Applications include identifying the most influential person(s) in a social network, ke ...
,
clustering,
distance
Distance is a numerical or occasionally qualitative measurement of how far apart objects, points, people, or ideas are. In physics or everyday usage, distance may refer to a physical length or an estimation based on other criteria (e.g. "two co ...
etc. This approach enriches the analysis of basketball with new individual and team level statistics and offers a new way of assigning position to a player.
Team level statistics
The biggest contribution to the team level metrics came from
Arizona State University
Arizona State University (Arizona State or ASU) is a public university, public research university in Tempe, Arizona, United States. Founded in 1885 as Territorial Normal School by the 13th Arizona Territorial Legislature, the university is o ...
researchers led by Jennifer H. Fewell. Using 2010 NBA first round playoff data, they constructed the networks for each team using players as nodes and ball movement between them as links. They distinguish the trade-off between not necessarily mutually exclusive division of labor and team's unpredictability that are measured by Uphill downhill flux and Team entropy respectively.
Team entropy - A measure of unpredictability and variation in teams offense, higher entropy meaning more variation. It is calculated as aggregated individual
Shannon entropies, where unpredictability is measured as uncertainty of the ball movement between any two nodes.
[Fewell J.H., Armbruster D, Ingraham J, Petersen A, Waters JS (2012) Basketball Teams as Strategic Networks. PLoS ONE 7(11): e47445. doi:10.1371/journal.pone.0047445 http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0047445]
Uphill downhill flux - Measures the division of labor, or the expertise in moving the ball to the player with the best shooting percentage. According to Fewell et al. It can be interpreted as an average change in potential shooting percentage per pass.
The metric is calculated as a sum of the differences between the shooting percentages of the nodes at the ends of each edge
:
,
where p
''ij'' is the probability of the link between players i and j, x
''i'' and x
''j'' are their shooting percentages.
Other measures include:
Team clustering coefficient - A direct application of a
clustering coefficient
In graph theory, a clustering coefficient is a measure of the degree to which nodes in a graph tend to cluster together. Evidence suggests that in most real-world networks, and in particular social networks, nodes tend to create tightly knit groups ...
. It measures how interconnected are the players, whether the ball moves via one node or whether in many ways between all the players.
Team degree centrality - Similarly to the previous metric, it measures if there is one dominant player in the team. It is calculated by the formula
where deg(v) is the degree of the node v, deg(v
*) is the highest degree node, V is the number of nodes.
Combined low clustering and high degree centrality mean that the defense can put double team on the dominant player, since without him ball team experiences problems in moving the ball.
Average path length - Number of passes per play.
Path flow rate - Number of passes per unit time. It measures how quickly the team moves the ball.
Deviation from maximum operating potential - Using players as the nodes and ball movement and links and
true shooting percentage
In basketball, true shooting percentage is an advanced statistic that measures a player's efficiency at shooting the ball. It is intended to more accurately calculate a player's shooting than field goal percentage, free throw percentage, and thr ...
as efficiency, analogy can by made to the
traffic network. Each individual is assumed to have a skill curve f(x), which is declining in the number shots taken. Individual maximization of the efficiency yield
whereas to maximum efficiency is achieved by solving
, where
The difference between these two constitute the teams deviation from the maximum potential.
[Brian Skinner (2011) The Price of Anarchy in Basketball, Journal of Quantitative Analysis in Sports 6(1), 3 (2010), https://arxiv.org/abs/0908.1801v4]
Individual statistics
Success/Failure Ratio - The number of times the player (node) was involved in the successful play divided by the number of times the player was involved in the unsuccessful play. The metric is obtained from the team play by play network.
Under/over performance - The metric is calculated by mapping the bipartite player network. Players are connected if they were a part of one team. The links are weighted by how successful was the team, where the players played together. Then node centrality measures are compared to the reference centrality distributions for each node obtained by
bootstrap - based randomization procedures and p - values are calculated. For example p - value of player i is given by :
, where π
i* is the reference centrality score, π
i0 is the calculated centrality score, J - number of iterations. High p - values indicate under-performance, low - over-performance.
[Piette, J, Pham, L. and Anand, S. (2011) “Evaluating Basketball Player Performance via Statistical Network Modeling,” in Sloan Sports Analytics Conference, (Boston, U.S.A.), http://www.sloansportsconference.com/wp-content/uploads/2011/08/Evaluating-Basketball-Player-Performance-via-Statistical-Network-Modeling.pdf]
Under - utilization - A player is under-utilized by the time if he has a low degree centrality, but is over-performing
Player positions
New basketball positions were classified by Stanford University student
Muthu Alagappan
Muthu Alagappan (born ) is a former medical resident known for his professional basketball analytics. He was born in England and raised in Texas. During college at Stanford University, he began an internship at big data startup company Ayasdi, ...
, who while working for the data visualization company Ayasdi, mapped the network of one season NBA players linking them by the similarity of their statistics. Then, based on node clusters players were grouped into 13 positions.
[Jeff Beckha]
"Analytics Reveal 13 New Basketball" Positions
"https://www.wired.com" 04.30.2012
Offensive Ball-Handler
Player that specializes in
scoring SCORE may refer to:
*SCORE (software), a music scorewriter program
* SCORE (television), a weekend sports service of the defunct Financial News Network
*SCORE! Educational Centers
*SCORE International, an offroad racing organization
*Sarawak Corrido ...
and ball handling, but has low averages of
steals and
blocks.
Defensive Ball-Handler
Player who specialized in
assisting and stealing the ball, but is average in scoring and shooting.
Combo Ball-Handler
Player who is above average in both offense and defense, but doesn't excel in any.
Shooting Ball-Handler
Player that is above average in shot attempts and points scored per game.
Role-Playing Ball-Handler
Those who play few minutes and don't have large impact on the team.
3-Point Rebounder
A big man and a ball handler with above average
rebounds
'Rebound' is a term used in sports to describe the ball (or puck or other object of play) becoming available for possession by either opponent after an attempt to put the ball or puck into the goal has been unsuccessful. Rebounds are generally ...
and
three point shots attempted and made.
Scoring Rebounder
Player with high scoring and rebound averages.
Paint Protector
Those valued for blocking and rebounding, but with low average points scored.
Scoring Paint Protector
Players that at both good and offense and defense in the paint.
NBA 1st-Team
Those with above averages in most of the statistical categories.
NBA 2nd-Team
Similar, but a bit worse than NBA 1st-Team players.
Role Player
Similar, but worse than NBA 2nd-Team players.
One-of-a-Kind
Ones that are so good and exceptional that could not be categorized.
See also
Network science
Network science is an academic field which studies complex networks such as telecommunication networks, computer networks, biological networks, Cognitive network, cognitive and semantic networks, and social networks, considering distinct eleme ...
Graph theory
In mathematics and computer science, graph theory is the study of ''graph (discrete mathematics), graphs'', which are mathematical structures used to model pairwise relations between objects. A graph in this context is made up of ''Vertex (graph ...
Muthu Alagappan
Muthu Alagappan (born ) is a former medical resident known for his professional basketball analytics. He was born in England and raised in Texas. During college at Stanford University, he began an internship at big data startup company Ayasdi, ...
APBRmetrics
External links
* https://www.wired.com/2012/04/analytics-basketball/
* https://www.youtube.com/watch?v=oz1uQi_epAo
* https://www.wired.com/2012/12/basketball-network-analysis/
References
Basketball statistics
Network analysis