INDEX
Explanations
specific names of individuals and related actions within contexts, particularly in sports and professional settings
New Auto-Interp
Negative Logits
_MC
-0.16
rts
-0.15
MC
-0.15
akk
-0.15
kk
-0.14
Fro
-0.14
rophe
-0.14
Sebast
-0.14
Matth
-0.14
ero
-0.14
POSITIVE LOGITS
Anderson
0.27
Thompson
0.27
ompson
0.25
Johnson
0.25
Armstrong
0.24
Anderson
0.23
Johnson
0.20
Davis
0.19
Wright
0.19
Lopez
0.18
Activations Density 0.170%