INDEX
Explanations
statistics about appearances or performances
the term "appearances" used in the context of sports or performances
New Auto-Interp
Negative Logits
xon
-0.71
rals
-0.70
yout
-0.69
Mos
-0.69
anim
-0.68
tones
-0.66
reaction
-0.66
Reviewer
-0.65
behavior
-0.65
Cruise
-0.63
POSITIVE LOGITS
unbeaten
0.91
Ago
0.85
nikov
0.82
Played
0.81
bley
0.78
opener
0.76
played
0.70
liga
0.69
strugg
0.69
orial
0.67
Activations Density 0.080%